Journal of Guangxi Normal University(Natural Science Edition) ›› 2010, Vol. 28 ›› Issue (3): 113-117.

Previous Articles     Next Articles

Constructing Decision Tree Attribution Reduction Algorithms withGene Expression Programming Based on Information Gain

WANG Yan1, YUAN Chang-an2, LIU Fu-tian3   

  1. 1. College of Shiyuan,Guangxi Teachers Education University,Nanning Guangxi,530023,China;
    2. College of Computer and Information Engineering,Guangxi Teachers Education University, Nanning Guangxi 530023,China;
    3. Nanning City's Emergency Response Centre,Nanning Guangxi 530023,China
  • Received:2010-04-20 Online:2010-09-20 Published:2023-02-06

Abstract: Classification is an important sector of Data Mining,and decision tree is one of the efficient classification methods used constantly.Nowadays,thereare several classification algorithms which use decision tree,for instance ID3,C4.5 and CART.But there are some disadvantages to use them,for example,classset must be input manually,attribute must be separated and the best class setis needed.This paper makes the following contributions to avoid these disadvantages:on the one hand,proposing a new concept of using information gain to lineGEP chromosome's head;on the other hand,proposing the algorithms of IG-GEPDTAR (constructing decision tree attribution reduction algorithms with gene expression programming based on information gain) and validate it by using experiment data.The result shows that decision tree constructed by IG-GEPDTAR is absolutely correct and better,it has less redundancies than Candida Ferreira's,has 82.9%less nodes than that of ID3 algorithms,and has 31.2% less nodes than that of Wang Chunnian's.

Key words: GEP (gene expression programming), information gain, decision tree induction, entropy

CLC Number: 

  • TP301.6
[1] HAN Jia-wei,KAMBER M.数据挖掘概念与技术[M].范明,孟小峰,译.北京:机械工业出版社,2007:188-199.
[2] 常志玲,周庆敏,杨清莲,等.基于粗糙集理论的决策树构造算法[J].南京工业大学学报:自然科学版,2005,7(4):80-83.
[3] 洪家荣,丁明峰,李星原,等.一种新的决策树归纳学习算法[J].计算机学报,1995,18(6):470-474.
[4] 刘华富,王仲.基于决策树的排序学习算法[J].郑州大学学报:理学版,2007,39(2):153-156.
[5] 郭玉滨.一种基于离散度的决策树改进算法[J].山东师范大学学报:自然科学版,2006,21(3):106-108.
[6] FERREIRA C.Gene expression programming:mathematical modeling byan artificial intelligence[M].Berlin:Springer-Verlag,2006:337-474.
[7] PENG Yu-zong,YUAN Chang-an,WANG yan,et al.SGDE-GEP:a novel algorithm of GEP[C]//International Conference on Computer Science and Software Engineering(CSSE 2008).Wuhan:[s.n.],2008:419-422.
[8] 王晓东.计算机算法设计与分析[M].北京:电子工业出版社,2001:26-28.
[9] 王春年,梁吉业.基于粗糙集与属性值聚类的决策树改进算法[J].计算机工程与应用,2007,43(31):178-181.
[1] LIU Dong, ZHOU Li, ZHENG Xiaoliang. A Very Short-term Electric Load Forecasting Based on SA-DBN [J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(4): 21-33.
[2] XU Jianmin, WEI Jia, SHOU Yanfang. Comprehensive Evaluation of Urban Road Traffic Operation StatusBased on Game Theory-Cloud Model [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(4): 1-10.
[3] ZOU Yanli,YAO Fei,WANG Yang,WANG Ruirui,WU Lingjie. Critical Node Identification for Power Systems Based on Network Structure and Power Tracing [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 133-141.
[4] LIN Yue,LIU Tingzhang,WANG Zhehe. Quantity Optimization of Virtual Sample Generation with Two Kinds of Upper Bound Conditions [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 142-148.
[5] WEI Miaoluan, LIN Jiayin, LIU Ru′e, LUO Jie. Relationship between the Information Entropy of Visual Sense and Vection Induced by Virtual Rotational Drum [J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(2): 134-140.
[6] MENG Yuanyuan,WEI Bo,ZOU Yao. The Landmark Decision Based on a New Vague Soft Set FuzzyEntropy-Topsis Theory [J]. Journal of Guangxi Normal University(Natural Science Edition), 2017, 35(4): 39-48.
[7] XIAO Fayuan,LI Haowei. A Routing Optimization Algorithm for Wireless Sensor Network Based on Fuzzy Theory [J]. Journal of Guangxi Normal University(Natural Science Edition), 2017, 35(1): 37-43.
[8] WANG Kai-ming, ZHOU Hai-yan, GUO Jia-liang,
YANG Xiao-jing, WANG Gang, ZHONG Ning. Analysis of Depression Electroencephalogram Basedon Statistics Distribution Entropy [J]. Journal of Guangxi Normal University(Natural Science Edition), 2015, 33(2): 29-35.
[9] HOU Xiao-dong, CAI Bin-bin, JIN Wei-dong, DUAN Wang-wang. A New Weighted Evidence Fusion Algorithm Based on Evidence Distanceand Fuzzy Entropy Theory [J]. Journal of Guangxi Normal University(Natural Science Edition), 2015, 33(1): 45-51.
[10] LIU Hai-feng, XU Xin-ying, SHEN Xue-fen, XIE Jun. Attribute Reduction of Incomplete Mixed Decision System Based on Limited Neighborhood Relation [J]. Journal of Guangxi Normal University(Natural Science Edition), 2013, 31(3): 30-36.
[11] XU Li, DING Shi-fei, GUO Feng-feng. A Rough Kernel Clustering Algorithm Based on ImprovedAttribute Reduction [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(3): 105-109.
[12] XU Jiu-cheng, LI Xiao-yan, LI Shuang-qun, ZHANG Ling-jun. Feature Images Retrieval Method of Tolerance Granular-basedMulti-level Texture [J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 186-187.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!