PREDICTION ACCURACY COMPARISON OF PREDICTIVE MODELS USING MACHINE LEARNING FOR DIABETES DATA SET

Main Article Content

Doreswamy G S
Santosh Kumar J and Nandish

Abstract

Diagnosis of Diabetes disease at beginning stage is important for healthier treatment. In today’s scenario equipments like sensors are used for discovery of infections. Accurate classification techniques are necessary for automatic detection of disease samples. this study utilizes data mining techniques for classification of Diabetes patients. Five algorithms (Logistic Regression and Artificial Neural Network, SVM, Random forest) were implemented for classification using R platform. Classification and prediction of medical datasets poses real challenges in Data Mining. To deal with these challenges Logistic Regression (LR) and Artificial Neural Network (ANN) SVM , Random Forest are commonly used. LR enables us to examine the relationship between a categorical outcome and a set of descriptive variables. LR explains that there can be one or more self-governing variables that can establish the problem outcome. ANN resembles the human brain and here the information is processed by simple elements called neurons and signals are transmitted between the neuron From the experimental results it is identified that for Diabetes dataset NN with 10 fold using percentage split prediction correctness of 84.52% is achieved.

Downloads

Download data is not yet available.

Article Details

Section
Articles

References

. Raghavendra B.K., Jay B. Simha, “Performance Evaluation of Logistic Regression and Neural Network Model with Feature Selection Methods and Sensitivity Analysis on Medical Data Miningâ€, International Journal of Advanced Engineering Technology (Vol. II, Issue: I, January-March 2011), pp. 288-298. [2]. Raghavendra B.K., S.K. Srivatsa, Raghavendra S, Shivashankar S.K., “Comparison of Logistic

Doreswamy G S et al, International Journal of Advanced Research in Computer Science, 9 (Special Issue III), May 2018, 86-88

Conference Paper: Third National Conference on “Advances in Computing and Information Technology†Organized by: School of Computing and Information Technology, REVA University, Bengaluru, India 88

Regression and Neural Network Model with and without hidden Layersâ€, Universal Journal of Applied Computer Science and Technology, Vol.1, 2011, pp. 49-53. [3]. Raghavendra S, and Indiramma M., â€Performance Evaluation of Logistic Regression and Artificial Neural Network Model with Feature Selection Methods using Cross Validation Sample and Percentage Split on Medical Datasetsâ€, Second International Conference on Emerging Research in Computing, Information,

Communication and Applications (ERCICA- 2014), August-2014. [4]. AnkitaDewan and Meghna Sharma, â€Prediction of Heart Disease Using a Hybrid Technique in Data Mining Classificationâ€, 2015, page(s): 704-706. [5]. SunitaSoni, Ujma Ansari, Dipesh Sharma and JyotiSoni, “Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Predictionâ€, International Journal of Computer application (09758887), vol. 17, no.8, March (2011).