African Journal of
Mathematics and Computer Science Research

  • Abbreviation: Afr. J. Math. Comput. Sci. Res.
  • Language: English
  • ISSN: 2006-9731
  • DOI: 10.5897/AJMCSR
  • Start Year: 2008
  • Published Articles: 254

Full Length Research Paper

Comparison of open source data mining softwares on a data set

Abdullah BAYKAL
  • Abdullah BAYKAL
  • Department of Mathematics, Faculty of Science, Dicle University, Diyarbakır, Turkey.
  • Google Scholar
  • Cengiz COSKUN
  • Department of Economics Faculty of Economics and Administrative Sciences, Dicle University, Diyarbakır, Turkey.
  • Google Scholar

  •  Received: 22 September 2018
  •  Accepted: 01 November 2018
  •  Published: 30 November 2018


Data mining is the process of extracting informative and useful rules or relations, that can be used to make predictions about the values of new instances, from existing data. A wide range of commercial and open source software programs are used for data mining. In this study, a comparison of several classification algorithms included in some open source softwares such as WEKA, Tanagra and Scikit-learn using SEER (Survillance Epidemiology and End Results) data set which consists of 60948 instances is performed.

Key words: Data mining, classification analysis, open source data mining tools.