Journal of Guangxi Normal University(Natural Science Edition) ›› 2011, Vol. 29 ›› Issue (1): 82-86.

Previous Articles     Next Articles

Design and Implementation of Parallel Decision Tree ClassificationBased on MapReduce

ZHU Min, WAN Jian-yi, WANG Ming-wen   

  1. College of Computer Information and Engineering,Jiangxi NormalUniversity,Nanchang Jiangxi 330022,China
  • Received:2010-12-14 Published:2018-11-16

Abstract: Decision tree classification is an effective classification method in data mining,but its performance is severely affected by large dataset.This paper addresses the design and implementation of a parallel decisiontree classification algorithm based on MapReduce programming model.Experiment results show that this implementation works better than implementation based on other parallel programming models while running on more nodes.

Key words: MapReduce, decision tree classification, SPRINT

CLC Number: 

  • TP181
[1] SHAFER J,AGRAWAL R,MEHTA M.SPRINT:a scalable parallel classifier for data mining[C]//Processing of the 22th International Conference on VLDB,Bombay,India.San Frasisco:Morgan Kaufmann Publishers,1996:544-555.
[2] 魏红宁.基于SPRINT方法的并行决策树分类研究[J].计算机应用,2005,25(1):39-41.
[3] 郭玉滨.一种基于离散度的决策树改进算法[J].山东师范大学学报:自然科学版,2006,21(3):129-131.
[4] 王鄂,李铭.云计算环境下的海量数据挖掘研究[J].现代计算机,2009(319):22-26.
[5] WAN Jian-yi,LI Xiao-ying.Approach of generating parallel programs from parallelized algorithm design strategies[J].The Journal of China Universities of Posts and Telecommunications,2008,15(3):128-132.
[6] DEAN J,GHEMAWAT S.MapReduce:simplified data processing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
[7] AGRAWAL R,IMIELINSKI T,SWAI A.Database mining:a performance perspective[J].IEEE Transaction on Kn-owledge and Data Engineering,1993,5(6):914-925.
[1] HE Mingxian, XU Shulin, LI Shilin, LUO Shuyi, YANG Chunsheng,CHENG Rui, WU Zhengjun. Correlation between Locomotor Performance and Body Measurements of Captive Breeding Shinisaurus crocodilurus [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(1): 120-126.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!