Journal of Guangxi Normal University(Natural Science Edition) ›› 2022, Vol. 40 ›› Issue (1): 57-67.doi: 10.16088/j.issn.1001-6600.2021060913

Previous Articles     Next Articles

New Category Classification Research Based on MEB and SVM Methods

YANG Di, FANG Yangxin, ZHOU Yan*   

  1. College of Mathematics and Statistics, Shenzhen University, Shenzhen Guangdong 518060, China
  • Received:2021-06-09 Revised:2021-07-13 Online:2022-01-25 Published:2022-01-24

Abstract: This paper mainly studies the following problems: if there is a training set containing only A and B classes,and a test set containing more than these two categories,how should the samples in the test set be classified? For this problem, three new category classification methods based on SVM and minimum enclosing ball method are proposed. These three new methods not only can solves the weakness of SVM that can't correctly identifying new categories, but also can obtain good effect in the real data analysis. The data set used in this paper is breast cancer molecular subtype data set. The final sample classification accuracy rate can reach more than 90%,and the classification accuracy of the new category samples can be more than 99%.

Key words: machine learning, multi-classification problem, support vector machine, MEB, SVDD

CLC Number: 

  • R737.9
[1] 刘宗超, 李哲轩, 张阳, 等. 2020全球癌症统计报告解读[J]. 肿瘤综合治疗电子杂志, 2021, 7(2): 1-13.
[2]CHEN W Q, ZHENG R S, BAADE P D, et al. Cancer statistics in China, 2015[J]. CA:a Cancer Journal for Clinicians, 2016, 66(2): 115-32.
[3]JEMAL A, SIEGEL R, WARD E, et al. Cancer statistics, 2006[J]. CA:a Cancer Journal for Clinicians, 2006, 56(2): 106-130.
[4]ZHENG X Q, ZHAO Q, WU H J, et al. Methylpurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes[J]. Genome Biology, 2014, 15(8): 419.
[5]DOU H X, FANG Y, ZHENG X Q. Universal informative CpG sites fbr inferring tumor purity from DNA methylation microarray data[J]. Journal of Bioinformatics and Computational Biology, 2018, 16(3): 1750030.
[6]CARTER S L,CIBULSKIS K, HELMAN E, et al. Absolutequantification of somatic DNA alterations in Human cancer[J]. Nature Biotechnology, 2012, 30(5): 413-421.
[7]OESPER L, MAHMOODY A, RAPHAEL B J. THetA:inferring intra-tumor heterogeneity from high-throughput DNA sequencing data[J]. Genome Biology, 2013, 14(7): R80.
[8]ANDOR N, HAMESS J V, MÜLLER S, et al. EXPANDS:expanding ploidy and allele frequency on nested subpopulations[J]. Bioinfbrmatics, 2014, 30(1): 50-60.
[9]任湘, 张朋, 范明, 等. 基于卷积神经网络的乳腺癌分子分型预测研究[J]. 杭州电子科技大学学报(自然科学版), 2018, 38(5): 66-71.
[10]DREISEITL S, OSL M, SCHEIBBÖCK C, et al. Outlier detection with one-class SVMs: an application to melanoma prognosis[J].AMIA Annual Symposium Proceedings. AMIA Symposium, 2010: 172-176.
[11]SCHOLKOPF B, SMOLA A J. Learning with kernels: support vector machines, regularization, optimization, and beyond[M].Cambridge, MA : MIT Press, 2001.
[12]董小瑞, 武雅文, 张志文, 等. 基于遗传算法和支持向量机的汽车行驶工况识别[J]. 车用发动机, 2021(2): 13-17.
[13]SCHÖLKOPF B, PLATT J C, SHAWE-TAYLOR J, et al. Estimating the support of a high-dimensional distribution[J]. Neural Computation, 2001, 13(7): 1443-1471.
[14]TAX D M J, DUIN R P W. Support vector domain description[J]. Pattern Recognition Letters, 1999, 20(11/12/13): 1191-1199.
[15]WANG K, STOLFO S. One-class SVM training for masquerade detection[J]. 3rd IEEE Conference Data Mining Workshop on Data Mining for Computer Security. Florida, 2003: 10-19.
[16]CHEN Y Q, ZHOU X S, HUANG T S. One-class SVM for learning in image retrieval[C]//Proceedings 2001 International Conference on Image Processing. Thessaloniki: IEEE, 2001: 34-37.
[17]姚力群, 陶卿. 局部线性与one-class结合的科技文本分类方法[J]. 计算机研究与发展, 2005, 42(11): 1862-1869.
[18]何书锋, 孙钿奇, 王诏, 等. 基于深度学习的多波束海底地质数据异常值检测方法[J]. 计算机应用与软件, 2021, 38(4): 95-100.
[19]PEROU C M, SERLIE T, EISEN M B, et al. Molecular portraits of human breast tumors[J]. Nature, 2000, 406: 747-752.
[20]PRISACK H B, KARREMAN C, MODLICH O, et al. Predictive biological markers for response of invasive breast cancer to anthracycline/cyclophosphamide-based primary (radio-)chemotherapy[J]. Anticancer Research, 2005, 25(6C): 4615-4621.
[21]曾天宇, 孙春晓, 杨帆, 等. 小剂量阿帕替尼治疗晚期乳腺癌的效果和安全性分析[J]. 临床肿瘤学杂志, 2020, 25(5): 451-455.
[1] LU Kaifeng, YANG Yilong, LI Zhi. A Web Service Classification Method Using BERT and DPCNN [J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(6): 87-98.
[2] ZHANG Yongsheng, ZHU Wenjun, SHI Ruoqi, DU Zhenhua, ZHANG Rui, WANG Zhi. A Confidence-guided Hybrid Android Malware DetectionSystem with Multiple Heterogeneous Algorithms [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(2): 19-28.
[3] ZHU Yongjian, PENG Ke, QI Guangwen, XIA Haiying, SONG Shuxiang. Defect Detection of Solar Panel Based on Machine Vision [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(2): 105-112.
[4] LÜ Kaichen, YAN Hongfei, CHEN Chong. Quantitative Investment Strategy Based on CSI 300 [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 1-12.
[5] LIN Yue,LIU Tingzhang,WANG Zhehe. Quantity Optimization of Virtual Sample Generation with Two Kinds of Upper Bound Conditions [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 142-148.
[6] LI Ziyan,LIU Weiming. New Method of Moving Vehicle Detection Based on Partial HOG Feature [J]. Journal of Guangxi Normal University(Natural Science Edition), 2017, 35(3): 1-13.
[7] LIU Yanhong, LUO Xiaoshu, CHEN Jin, GUO Lei. Research on Cervical Cell Image Feature Extraction and Recognition [J]. Journal of Guangxi Normal University(Natural Science Edition), 2016, 34(2): 61-66.
[8] CHEN Si-yi, LUO Qiang, HUANG Hui-xian. Division Method of Coordinated Control Sub-areas Based on Group Decision Making Theory and Support Vector Machine [J]. Journal of Guangxi Normal University(Natural Science Edition), 2014, 32(4): 18-25.
[9] ZUO Xin, HUANG Hai-long, LIU Jian-wei. Classifier of p-norm Regularizing SVM with Nonconvex Conjugate Gradient Algorithm [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(3): 51-58.
[10] WANG Shi-ming, XU Jian-min, LI Ri-han. Improvement on On-ramp Control Algorithm of Urban Freeway [J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(2): 1-6.
[11] YAN Xiao-ming, ZHENG Zhi. Optimizing Parameters of SVM Based on Combined Bionic Algorithm [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 114-118.
[12] ZHANG Ren-jin, TANG Cui-fang, LIU Bin. Researching and Programming of Computer Games Using Artificial Neural Networks [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(2): 119-124.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LIU Guolun, SONG Shuxiang, CEN Mingcan, LI Guiqin, XIE Lina. Design of Bandwidth Tunable Band-Stop Filter[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(3): 1 -8 .
[2] LIU Ming, ZHANG Shuangquan, HE Yude. Classification Study of Differential Telecom Users Based on SOM Neural Network[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(3): 17 -24 .
[3] HU Yucong, CHEN Xu, LUO Jialing. Network Design Model of Customized Bus in Diversified Operationof Multi-origin-destination and Multi-type Vehicle Mixed Load[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 1 -11 .
[4] TANG Tang, WEI Chengyun, LUO Xiaoshu, QIU Senhui. Study of Seeker Optimization Algorithm with Inertia TermSelf-tuning to Attitude Stability of Quadrotor UAV[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 12 -19 .
[5] LIN Yue, LIU Tingzhang, HUANG Lirong, XI Xiaoye, PAN Jian. Anomalous State Detection of Power Transformer Basedon Bidirectional KL Distance Clustering Algorithm[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 20 -26 .
[6] WEI Zhenhan, SONG Shuxiang, XIA Haiying. State-of-charge Estimation Using Random Forest for Lithium Ion Battery[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 27 -33 .
[7] XU Yuanjing, HU Weiping. Identification of Pathological Voice of Different Levels Based on Random Forest[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 34 -41 .
[8] ZHANG Canlong, SU Jiancai, LI Zhixin, WANG Zhiwen. Infrared-Visible Target Tracking Basedon AdaBoost Confidence Map[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 42 -50 .
[9] LIU Dianting, WU Lina. Domain Experts Recommendation in Social Network Basedon the LDA Theme Model of Trust[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 51 -58 .
[10] JIANG Yingxing, HUANG Wennian. Ground State Solutions for the NonlinearSchrödinger-Maxwell Equations[J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(4): 59 -66 .