Ghazal, T M, Afifi, M A M and Kalra, D (2020) Data Mining and Exploration: A Comparison Study among Data Mining Techniques on Iris Data Set. Talent Development & Excellence, 12 (1). pp. 3854-3861. ISSN 1869-0459
1339-articletext-2388-1-10-20200610.pdf - Published Version
Download (432kB)
Abstract
This work aims at investigating the efficiency of diverse methods of classification through the use of WEKA software for the well-known Iris data set. For the assessment of the classification algorithm performance, this paper adopted the use of Receiver Operating Characteristic (ROC) curves. The different classification algorithm techniques used for this work include neural networks, naïve Bayes and decision trees. The data set used in our investigation, Iris data, is one of the oldest and widely used data sets in data mining. For the three techniques of classification used in this study, a comparison of the ROC curves used in this study indicate that the Neural Network (NN) is the most appropriate method of evaluation investigated in this work. The other two methods, Bayes network classifier and decision trees, have their classical procedures for classification that might need to improve significantly.
Affiliation: | Skyline University College |
---|---|
SUC Author(s): | Ghazal, T M ORCID: https://orcid.org/0000-0002-7202-5165, Afifi, M A M and Kalra, D |
All Author(s): | Ghazal, T M, Afifi, M A M and Kalra, D |
Item Type: | Article |
Uncontrolled Keywords: | Data Mining, Iris data, Decision Trees, Naïve Bayes, Neural Networks, ROC Curve |
Subjects: | B Information Technology > BT Data Management |
Divisions: | Skyline University College > School of IT |
Depositing User: | Mr SUC Library |
Date Deposited: | 08 Aug 2022 14:32 |
Last Modified: | 08 Aug 2022 14:32 |
URI: | https://research.skylineuniversity.ac.ae/id/eprint/514 |
Publisher URL: | |
Publisher OA policy: | |
Related URLs: |
Actions (login required)
Statistics for this ePrint Item |