Full Text:   <246>

CLC number: 

On-line Access: 2023-10-07

Received: 2023-04-21

Revision Accepted: 2023-09-25

Crosschecked: 0000-00-00

Cited: 0

Clicked: 328

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Journal of Zhejiang University SCIENCE C 1998 Vol.-1 No.-1 P.

http://doi.org/10.1631/FITEE.2300278


A novel overlapping minimization SMOTE algorithm for imbalanced classification


Author(s):  Yu-lin HE, Xuan LU, Philippe FOURNIER-VIGER, Joshua Zhexue HUANG

Affiliation(s):  Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen 518107, China; more

Corresponding email(s):   yulinhe@gml.ac.cn, 2110276215@email.szu.edu.cn, philfv@szu.edu.cn, zx.huang@szu.edu.cn

Key Words:  Imbalanced classification, Synthetic minority oversampling technique, Majority-class sample point, Minority-class sample point, Generalization capability, Overlapping minimization


Yu-lin HE, Xuan LU, Philippe FOURNIER-VIGER, Joshua Zhexue HUANG. A novel overlapping minimization SMOTE algorithm for imbalanced classification[J]. Frontiers of Information Technology & Electronic Engineering, 1998, -1(-1): .

@article{title="A novel overlapping minimization SMOTE algorithm for imbalanced classification",
author="Yu-lin HE, Xuan LU, Philippe FOURNIER-VIGER, Joshua Zhexue HUANG",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="-1",
number="-1",
pages="",
year="1998",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.2300278"
}

%0 Journal Article
%T A novel overlapping minimization SMOTE algorithm for imbalanced classification
%A Yu-lin HE
%A Xuan LU
%A Philippe FOURNIER-VIGER
%A Joshua Zhexue HUANG
%J Journal of Zhejiang University SCIENCE C
%V -1
%N -1
%P
%@ 2095-9184
%D 1998
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.2300278

TY - JOUR
T1 - A novel overlapping minimization SMOTE algorithm for imbalanced classification
A1 - Yu-lin HE
A1 - Xuan LU
A1 - Philippe FOURNIER-VIGER
A1 - Joshua Zhexue HUANG
J0 - Journal of Zhejiang University Science C
VL - -1
IS - -1
SP -
EP -
%@ 2095-9184
Y1 - 1998
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.2300278


Abstract: 
The synthetic minority oversampling technique (SMOTE) is a popular algorithm to reduce the impact of class imbalance for building classifiers, which has received several enhancements over the past 20 years. SMOTE and its variants synthesize a number of minority-class sample points in the original sample space to alleviate the adverse effects of class imbalance. This approach works well in many cases, but problems arise when synthetic sample points are generated in overlapping areas between different classes, which further complicates classifier training. To address this issue, this paper proposes a novel generalization-oriented rather than imputation-oriented minority-class sample point generation algorithm, named overlapping minimization SMOTE (OM-SMOTE), this algorithm is designed specifically for binary imbalanced classification problems. OM-SMOTE first maps the original sample points into a new sample space by balancing the trade-off between sample encoding and classifier generalization. Then, OM-SMOTE employs a set of sophisticated minority-class sample point imputation rules to generate synthetic sample points that are as far as possible from overlapping areas between classes. Extensive experiments have been conducted on 32 imbalanced data sets to validate the effectiveness of the OM-SMOTE algorithm. Results show that using OM-SMOTE to generate synthetic minority-class sample points leads to better classifier training performances for the naive Bayes classifier, support vector machine, decision tree, and logistic regression than other 11 state-of-the-art SMOTE-based imputation algorithms. This demonstrates that OM-SMOTE is a viable approach to support the training of high-quality classifier for imbalanced classification.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE