2024 : 5 : 24
Parham Moradi

Parham Moradi

Academic rank: Associate Professor
Education: PhD.
ScopusId: 654
Faculty: Faculty of Engineering
Address: Department of Computer Engineering, Faculty of Engineering, University of Kurdistan


Diversity and Separable Metrics in Over-Sampling Technique for Imbalanced Data Classification
Diversity measure; Separable Measure; Over-Sampling; Imbalanced Data; Classification problems
Researchers Shadi Mahmoudi ، Parham Moradi ، Fardin Akhlaghian Tab ، Rizan Moradi


The imbalance data problem in classification is a significant research area and has attracted a lot attention in recent years. Rebalancing class distribution techniques such as over-sampling or under-sampling are the most common approaches to deal with this problem. This paper presents a new method so called Diversity and Separable Metrics in Over-Sampling Technique (DSMOTE) to handle the imbalanced learning problems. The main idea of the DSMOTE is to use a diversity and separable measure which shows a positive impact on the minority class. This improvement is achieved by reduce overfitting by using a diversity measure. Moreover by using the separable measure the risk of generating new samples in decision boundaries with hard-to-learn samples is decreased. The proposed method improves the learning accuracy in three stages including; (1) removal of abnormal samples from minority class, (2) selecting the top three samples of minority class according to desired criteria and (3) generating new sample using selected samples. The experiments are conducted on five real world datasets which is taken from Iran University of Medical Science and also six different UCI datasets. Moreover, three different classifiers, four resampling algorithms and six performance evaluation measures are used to evaluate the proposed method. The reported results indicate that the proposed approach demonstrates a better or at least comparable performance compared to those of the state-of-the-art methods.