| 5,832 | 415 | 226 |
| 下载次数 | 被引频次 | 阅读次数 |
决策树是归纳学习和数据挖掘的重要方法,通常用来形成分类器和预测模型。概述了决策树分类算法,指出了决策树算法的核心技术:测试属性的选择和树枝修剪技术。通过对当前数据挖掘中具有代表性的优秀分类算法进行分析和比较,总结出了各种算法的特性,为使用者选择算法或研究者改进算法提供了依据。最后,通过一个实例说明决策树分类在实际生产中的应用。
Abstract:Decision tree is an important method in induction learning as well as in data mining,which can be used to form classification and predictive model.Introduces decision tree and points out its key techniques: the choice of testing feature and tree pruning.It summarizes the main features of every algorithm by analyzing and comparing a variety of typical classifiers to provide a basis for selecting or improving the algorithms in data mining.Finally,through an instance,this paper shows the application of decision tree in production.
[1]Mitchell T M.机器学习[M].北京:机械工业出版社,2004.
[2]Quinlan J R.Induction of Decision Tree[J].Machine Learn-ing,1986(1):81-106.
[3]Quinlan J R.C4.5:Programs for Machine Learning[M].[s.l.]:Morgan Kaufman,1993.
[4]Mehta M,Agrawal R,Rissanen J.SLIQ:A Fast and ScalableClassifier for Data Mining[M].US:IBM Almaden ResearchCenter,1996.
[5]Shafer J C,Agrawal R,Mehta M.SPRINT:A Scalable Par-allel Classifier for Data Mining[C]//Proc of the 22nd Int Confon Very Large Databases.Mumbai(Bombay),India:[s.n.],1996.
[6]Rastogi R,Shim K.PUBLIC:A Decision Tree Classifier thatIntegrates Building and Pruning[R].Murray Hill:Bell Labora-tories,1998.
[7]Han Jiawei,Kamber M.DATA MINING Concepts and tech-niques[M].北京:高等教育出版社,2001.
基本信息:
中图分类号:TP18
引用信息:
[1]杨学兵,张俊.决策树算法及其核心技术[J].计算机技术与发展,2007(01):43-45.
基金信息:
安徽省教育厅自然科学基金重点资助项目(2004KJ053ZD)
2007-01-10
2007-01-10