A Gaussian Mixture Based Boosted Classification Scheme For Imbalanced And Oversampled Data
File version
Author(s)
Paul, Mahit Kumar
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
Cox's Bazar, Bangladesh
License
Abstract
Dataset with imbalanced class distribution used to abate classification performance for most of the standard classifier learning algorithms. Moreover, some application area consists of scarcity of labeled training data where clustering is most prominent way to support classification process. Gaussian Mixture Model (GMM) being able to approximate arbitrary probability distribution, is a dominant tool for classification in such cases by means of clustering. An ensemble approach is presented in this paper considering GMM as a weak learner to boost the GMMs in a semi supervised manner via Adaptive Boosting technique. This paper, firstly investigates how much K-means and GMM suffers from uneven class distribution in data. Later experiment on benchmark imbalanced datasets with different imbalance ratio and over sampled datasets using Synthetic Minority Over-sampling Technique (SMOTE) has been carried out for proposed approach. For each case cluster forest has been used as an attribute selection technique. Efficacy of the proposed Boosted GMM approach compared to standard clustering approaches like K means and GMM is exhibited from empirical analysis.
Journal Title
Conference Title
2017 International Conference on Electrical, Computer and Communication Engineering (ECCE)
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Science & Technology
Technology
Engineering, Electrical & Electronic
Telecommunications
Engineering
Persistent link to this record
Citation
Pal, B; Paul, MK, A Gaussian Mixture Based Boosted Classification Scheme For Imbalanced And Oversampled Data, 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE), 2017, pp. 401-405