Journal of Guangxi Normal University(Natural Science Edition) ›› 2019, Vol. 37 ›› Issue (2): 75-81.doi: 10.16088/j.issn.1001-6600.2019.02.009

Previous Articles     Next Articles

Improving Classification Rule with Lift Measure for KNN Classifier

WU Hao1*, QIN Lichun2, LUO Liurong2   

  1. 1.College of Computer Science and Information Technology, Guangxi Normal University, Guilin Guangxi 541004, China;
    2.Liuzhou Railway Vocational Technology College, Liuzhou Guangxi 545616, China
  • Received:2018-11-02 Online:2019-04-25 Published:2019-04-28

Abstract: A KNN classifier is presented for classifying imbalanced data. A gain model is constructed for measuring the lift of probability of a class label. The competition of minority class is well enhanced in imbalanced-class dataset. And the accurate rate of classifying minor-class data is significantly improved. The experimental results show that in the setting of imbalanced-class datasets, the proposed approach has significantly improved the classification accuracy, compared with the existing KNN classifiers.

Key words: classification, KNN algorithm, imbalanced data, lift

CLC Number: 

  • TP181
[1] ZHU Xiaofeng,XIE Qing,ZHU Yonghua,et al.Multi-view multi-sparsity kernel reconstruction for multi-class image classification[J].Neurocomputing,2015,169:43-49.DOI:10.1016/ j.neucom.2014.08.106.
[2] WU Xindong,KUMAR V,QUINLAN J R,et al.Top 10 algorithms in data mining[J].Knowledge and Information Systems,2008,14(1):1-37.DOI:10.1007/s10115-007-0114-2.
[3] DENG Zhenyun,ZHU Xiaoshu,CHENG Debo,et al.Efficient kNN classification algorithm for big data[J].Neurocomputing,2016,195:143-148.DOI:10.1016/j.neucom.2015.08.112.
[4] ZHANG Shichao.KNN-CF approach: incorporating certainty factor to kNN classification[J]. IEEE Intelligent Informatics Bulletin,2010,11(1):24-33.
[5] 张师超.KDD全过程利用缺失数据的模型与方法[R/OL].北京:中国科学院数学与系统科学研究院数学研究所,2017[2018-11-02].http://www.math.ac.cn/xshd/xsbg/201712/t20171220_391373. html.
[6] ZHANG Shichao,LI Xuelong,ZONG Ming,et al.Learning k for kNN classification[J].ACM Transactions on Intelligent Systems and Technology,2017,8(3):43.DOI:10.1145/2990508.
[7] ZHU Xiaofeng,ZHANG Shichao,JIN Zhi,et al.Missing value estimation for mixed-attribute datasets[J].IEEE Transactions on Knowledge and Data Engineering,2011,23(1):110-121.DOI: 10.1109/TKDE.2010.99.
[8] ZHU Xiaofeng,LI Xuelong,ZHANG Shichao.Block-row sparse multiview multilabel learning for image classification[J].IEEE Transactions on Cybernetics,2016,46(2):450-461.DOI: 10.1109/TCYB.2015.2403356.
[9] ZHU Xiaofeng,LI Xuelong,ZHANG Shichao,et al.Graph PCA hashing for similarity search[J]. IEEE Transactions on Multimedia,2017,19(9):2033-2044.DOI:10.1109/TMM.2017.2703636.
[10] ZHU Xiaofeng,LI Xuelong,ZHANG Shichao,et al.Robust joint graph sparse coding for unsupervised spectral feature selection[J].IEEE Transactions on Neural Networks and Learning Systems,2017,28(6):1263-1275.DOI:10.1109/TNNLS.2016.2521602.
[11] ZHU Xiaofeng,ZHANG Shichao,HU Rongyao,et al.One-step multi-view spectral clustering[J]. IEEE Transactions on Knowledge and Data Engineering,2018.DOI:10.1109/TKDE.2018.2873378.
[12] COVER T,HART P.Nearest neighbor pattern classification[J].IEEE Transactions on Information Theory,1967,13(1):21-27.DOI:10.1109/TIT.1967.1053964.
[13] ZHANG Shichao.Nearest neighbor selection for iteratively kNN imputation[J].Journal of Systems and Software,2012,85(11):2541-2552.DOI:10.1016/j.jss.2012.05.073.
[14] 吴昊.最近邻分类的改良模型[J].广西大学学报(自然科学版),2012,37(6):1261-1266.DOI: 10.13624/j.cnki.issn.1001-7445.2012.06.022.
[15] 吴昊,唐振军.加权壳近邻填充数学模型[J].华南师范大学学报(自然科学版),2013,45(3):45-48.
[16] DUA D,EFI K T.UCI machine learning repository[DS/OL].Irvine,CA:University of California, School of Information and Computer Science,2017[2018-11-02].http://archive.ics.uci.edu /ml.
[1] DUAN Huajuan, WEI Yongqing, LIU Peiyu, ZHOU Peng. An Improved Multi-decision Tree Algorithm for Imbalanced Classification [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(2): 72-80.
[2] SU Lei, LI Junying. Discussion on Classification Standard of Eco-environment Quality in Counties of National Key Eco-functional Areas [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(3): 196-202.
[3] YUE Tianchi, ZHANG Shaowu, YANG Liang, LIN Hongfei, YU Kai. Stance Detection Method Based on Two-Stage Attention Mechanism [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 42-49.
[4] NIE Yu, LIAO Xiangwen, WEI Jingjing, YANG Dingda, CHEN Guolong. Multi-label Classification Based on the Deep Autoencoder [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 71-79.
[5] YU Chunqiang, DENG Fangzhou, ZHANG Xianquan, TANG Zhenjun, CHEN Yan, HE Nan. A Reversible Information Hiding Method Based on Multiple Prediction Values [J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(2): 24-32.
[6] ZHANG Fang. Ecological Landscape Patterns in Ebinur Lake Region Based on Remote Sensing [J]. Journal of Guangxi Normal University(Natural Science Edition), 2016, 34(4): 156-164.
[7] LIANG Shi-chu, TIAN Hua-li, TIAN Feng, XIA Yi, QIN Ying-ying. Wetland Vegetation Types and Their Distribution Characteristics in Lijiang River [J]. Journal of Guangxi Normal University(Natural Science Edition), 2015, 33(4): 115-119.
[8] HU Le-ning, DENG Hua, WU Hua-jing, WU Dao-ning, LIANG Shi-chu. The Characteristic of Soil Aggregate Structure in Different Artificial Forest Soils with Different Sieve Strength in Karst Region of Northeast Guangxi, China [J]. Journal of Guangxi Normal University(Natural Science Edition), 2015, 33(3): 151-156.
[9] YANG Wen, LI Wen-jing, LI Shuang, LI Shu-ju, LIN Zhong-ming. Parallel Classification Compression Algorithm for Stream-Data Based on Granular Analysis and Storage of GEP [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(3): 87-93.
[10] WANG Feng, JIN Xiao-bo, YU Jun-wei, WANG Gui-cai. V-optimal Histogram and Its Application in License Plate Classification [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(3): 138-143.
[11] HU Le-ning, SU Yi-rong, HE Xun-yang. Characteristic of Soil Aggregate Structure in Different Typical Soils in Karst Region of Northwest Guangxi,China [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(3): 213-219.
[12] QIN Lin-chan, ZHONG Ning, LÜ Sheng-fu, LI Mi. Interaction Effects of Web Content Appeal Strategies with User Tasks and Its Influence on Eye-Movement Pattern [J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(4): 30-35.
[13] LU Guang-quan, XIE Yang-cai, LIU Xing, ZHANG Shi-chao. An Improvement Semi-supervised Learning Based on KNN Classification [J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(1): 45-49.
[14] SHEN Ze-hao, YE Zhong-xing. Fuzzy Clustering Analysis of Customer Credit Risk of Futures Company [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(3): 101-104.
[15] LIAO Yuan-xiu, ZHOU Sheng-ming. Application of Errors in Cost-Sensitive Classifications [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 110-113.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!