مشخصات پژوهش

صفحه نخست /Diversity and Separable ...
عنوان Diversity and Separable Metrics in Over-Sampling Technique for Imbalanced Data Classification
نوع پژوهش مقاله ارائه شده کنفرانسی
کلیدواژه‌ها Diversity measure; Separable Measure; Over-Sampling; Imbalanced Data; Classification problems
چکیده The imbalance data problem in classification is a significant research area and has attracted a lot attention in recent years. Rebalancing class distribution techniques such as over-sampling or under-sampling are the most common approaches to deal with this problem. This paper presents a new method so called Diversity and Separable Metrics in Over-Sampling Technique (DSMOTE) to handle the imbalanced learning problems. The main idea of the DSMOTE is to use a diversity and separable measure which shows a positive impact on the minority class. This improvement is achieved by reduce overfitting by using a diversity measure. Moreover by using the separable measure the risk of generating new samples in decision boundaries with hard-to-learn samples is decreased. The proposed method improves the learning accuracy in three stages including; (1) removal of abnormal samples from minority class, (2) selecting the top three samples of minority class according to desired criteria and (3) generating new sample using selected samples. The experiments are conducted on five real world datasets which is taken from Iran University of Medical Science and also six different UCI datasets. Moreover, three different classifiers, four resampling algorithms and six performance evaluation measures are used to evaluate the proposed method. The reported results indicate that the proposed approach demonstrates a better or at least comparable performance compared to those of the state-of-the-art methods.
پژوهشگران ریزان مرادی (نفر چهارم)، فردین اخلاقیان طاب (نفر سوم)، پرهام مرادی دولت آبادی (نفر دوم)، شادی محمودی (نفر اول)