Top 10 algorithms in data mining
File version
Author(s)
Kumar, Vipin
Quinlan, J Ross
Ghosh, Joydeep
Yang, Qiang
Motoda, Hiroshi
McLachlan, Geoffrey J
Ng, Angus
Liu, Bing
Yu, Philip S
Zhou, Zhi-Hua
Steinbach, Michael
Hand, David J
Steinberg, Dan
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
This paper presents the top 10 data mining algorithms identified by the IEEE International Conference on Data Mining (ICDM) in December 2006: C4.5, k-Means, SVM, Apriori, EM, PageRank, AdaBoost, kNN, Naive Bayes, and CART. These top 10 algorithms are among the most influential data mining algorithms in the research community. With each algorithm, we provide a description of the algorithm, discuss the impact of the algorithm, and review current and further research on the algorithm. These 10 algorithms cover classification, clustering, statistical learning, association analysis, and link mining, which are all among the most important topics in data mining research and development.
Journal Title
Knowledge and Information Systems
Conference Title
Book Title
Edition
Volume
14
Issue
1
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Information systems