Subset Feature Selection Approach for Class Imbalance
| dc.contributor.author | Lachheta, Pawan | |
| dc.contributor.supervisor | Bawa, Seema | |
| dc.date.accessioned | 2016-08-10T09:42:19Z | |
| dc.date.available | 2016-08-10T09:42:19Z | |
| dc.date.issued | 2016-08-10 | |
| dc.description | Master of Engineering-Software Engineering | en_US |
| dc.description.abstract | In machine learning, building an effective classification model, when the high dimensional data is suffering from class imbalance problem, is a major challenge. The problem becomes severe when negative samples have large percentages than positive samples. Various techniques like cost sensitive learning techniques, recognition based techniques, and sampling based techniques, etc. exist to handle data imbalance problem. However, these techniques suffer from data loss and over fitting because they invariably change the original distribution of data. To surmount the data imbalance and high dimensionality issues in dataset, in this thesis we propose a framework named Subset Feature Selection (SFS). The proposed SFS framework comprises of SMOTE filters are used for balancing the datasets, as well as feature ranker for pre-processing of data. The framework SFS is developed using R language and various R packages. The performance of SFS framework is evaluated and results show that SFS framework outperforms than other existing techniques like cost sensitive learning, recognition based techniques etc. | en_US |
| dc.identifier.uri | http://hdl.handle.net/10266/4057 | |
| dc.language.iso | en | en_US |
| dc.subject | Class Imbalance | en_US |
| dc.subject | Minority Class | en_US |
| dc.subject | Majority Class | en_US |
| dc.subject | Computer Science | en_US |
| dc.title | Subset Feature Selection Approach for Class Imbalance | en_US |
| dc.type | Thesis | en_US |
