Clustering Algorithms on Diabetes Data: Comparative Case Study

UshaG Biradar, Dr. Deepa S Mugali


Data Clustering is used to extract meaningful information and plays a vital role in data mining. Its main job is to group the similar data together based on the characteristic they possess. This paper represents the performance of three clustering algorithms such as Hierarchical clustering, EM and K Means clustering algorithm.
The Diabetes dataset is used for the comparison of those clustering algorithms based on the performance. This comparative study focuses on use of data mining tool to analyze a previously obtained data set using Weka and Tanagra. The results were compared to find algorithm yields and best result presented.


Data Mining, Cluster techniques, Hierarchical clustering, EM and K Means clustering algorithm, Weka, Tanagra.

Full Text:




  • There are currently no refbacks.

Copyright (c) 2017 International Journal of Advanced Research in Computer Science