Subset Feature Selection Approach for Class Imbalance

dc.contributor.authorLachheta, Pawan
dc.contributor.supervisorBawa, Seema
dc.date.accessioned2016-08-10T09:42:19Z
dc.date.available2016-08-10T09:42:19Z
dc.date.issued2016-08-10
dc.descriptionMaster of Engineering-Software Engineeringen_US
dc.description.abstractIn machine learning, building an effective classification model, when the high dimensional data is suffering from class imbalance problem, is a major challenge. The problem becomes severe when negative samples have large percentages than positive samples. Various techniques like cost sensitive learning techniques, recognition based techniques, and sampling based techniques, etc. exist to handle data imbalance problem. However, these techniques suffer from data loss and over fitting because they invariably change the original distribution of data. To surmount the data imbalance and high dimensionality issues in dataset, in this thesis we propose a framework named Subset Feature Selection (SFS). The proposed SFS framework comprises of SMOTE filters are used for balancing the datasets, as well as feature ranker for pre-processing of data. The framework SFS is developed using R language and various R packages. The performance of SFS framework is evaluated and results show that SFS framework outperforms than other existing techniques like cost sensitive learning, recognition based techniques etc.en_US
dc.identifier.urihttp://hdl.handle.net/10266/4057
dc.language.isoenen_US
dc.subjectClass Imbalanceen_US
dc.subjectMinority Classen_US
dc.subjectMajority Classen_US
dc.subjectComputer Scienceen_US
dc.titleSubset Feature Selection Approach for Class Imbalanceen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
4057.PDF
Size:
832.44 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.03 KB
Format:
Item-specific license agreed upon to submission
Description: