Manuscript Number : IJSRSET218640
Accurate Classification for Imbalanced Data Analytics using RSB Ensemble Technique
Authors(3) :-Tanmayee Tushar Parbat, Honey Jain, Rohan Benhal
For the past few years, researchers have developed models using machine learning algorithms which demonstrated unsatisfactory performance on classifying imbalanced datasets. As a remedy, few researchers have experimented with synthetic minority over sampling technique (SMOTE) and Cost Sensitive methods. Results indicated that these methods also have certain drawbacks such as over fitting and high mis-classification rate. To overcome these problems, ensemble techniques were proved to be robust in handling imbalanced datasets. In this study, we have considered existing boosting, bagging ensemble techniques and improved them in several aspects by proposing an algorithm named Random Split bagging (RSB) such that imbalanced datasets can be handled effectively. Our approach presented a novel tuple and attribute selection strategy. Finally we have chosen splitting criteria to generate class label. The proposed Random split bagging ensemble technique shows best performance on classification of minority class examples i.e. classification of disease affected patients like malaria, dengue and jaundice diseases.
Tanmayee Tushar Parbat
RSB, SMOTE, Imbalanced Dataset
Publication Details
Published in :
Volume 9 | Issue 7 | September-October 2021 Article Preview
B.E IT, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India
Honey Jain
B.E IT, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India
Rohan Benhal
BBA IT, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India
Date of Publication :
2021-10-30
License: This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) :
104-108
Manuscript Number :
IJSRSET218640
Publisher : Technoscience Academy