Comparative Analysis with Implementation of Cluster Based, Distance Based and Density Based Outlier Detection Techniques Using Different Healthcare Datasets

Main Article Content

HARSHADA CHANDRAKANT MANDHARE
SONALI R. IADATE

Abstract

Outliers is view as an error data in information which is turned into important crisis that has been investigated in various areas of study plus functional fields. Several outlier detection methods have been implemented to assured functional fields, whereas several methods are supplementary basic. Various functional areas are also investigated in severe privacy like study on offense as well as terrorist behaviors. Through the improvement in information skills, the numeral of records, plus their measurement as well as difficulty, raise fast, that outcome in the need of computerized examination of huge quantity of various ordered data. For this intention, different data mining systems are utilized. The objective of these types of systems is to detect unseen dependencies from the records. Outlier detection in data mining is the detection of objects, remarks or observations that doesn’t match to a predictable sample in a set of record. This detection technique is more beneficial in the several areas such as health trade, offense finding, fake operation, community protection and so on. In this paper we have studied different outlier detection algorithms such as Cluster based outlier detection, Distance based outlier detection plus Density based outlier detection. Result experimentation is done on different four dataset to identify the outliers and the comparative result shows that the cluster based methods are efficient for calculation of clusters and density-based outlier detection algorithm offers improved accuracy and faster execution for identification of outliers than other two outlier detection algorithm.

Downloads

Download data is not yet available.

Article Details

Section
Articles

References

Kamal Malik, H.Sadawarti, Member IEEE, Kalra G.S., Member IEEE,†Comparative Analysis of Outlier Detection Techniquesâ€, International Journal of Computer Applications (0975 – 8887) Volume 97– No.8, July 2014.

Dr. S.Vijayarani, Ms. P. Jothi, “Comparative Analysis of Clustering Algorithms for Outlier Detection in Data Streamsâ€, International journal of engineering sciences & research technology, issn: 2277-9655, 2013.

Armin Daneshpazhouh, Ashkan Sami, “Entropy-based outlier detection using semi-supervised approach with few positive examples†, 0167-8655, 2014 Elsevier B.V.

Jihyun Ha, Seulgi Seok, Jong-Seok Lee, “Robust outlier detection using the instability factor†, Knowledge-Based Systems 63 (2014) 15–23, _ 2014 Elsevier B.V.

Seung Kim, Nam Wook Cho, Bokyoung Kang, Suk-Ho Kang, “Fast outlier detection for very large log dataâ€, Expert Systems with Applications 38 (2011) 9587–9596, 2011 Elsevier Ltd.

Peng Yang, Qingsheng Zhu, “Finding key attribute subset in dataset for outlier detectionâ€, Knowledge-Based Systems 24 (2011) 269–274, 2010 Elsevier B.V.

Ye Wang, Srinivasan Parthasarathy, Shirish Tatikonda, “Locality Sensitive Outlier Detection: A Ranking Driven Approachâ€, ICDE Conference 2011, IEEE.

Sheng-yi Jiang,Qing-bo An, “Clustering-Based Outlier Detection Methodâ€, Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008 IEEE DOI 10.1109/FSKD.

Zhipeng Liu, Dechang Pi, and Jinfeng Jiang, “Density-based trajectory outlier detection algorithmâ€, Journal of Systems Engineering and Electronics Vol. 24, No. 2, April 2013, pp.335–340.

Haowen Guan, Qingzhong Li, “SLOF: Identify Density-based Local Outliers in Big Dataâ€, 2015 12th Web Information System and Application Conference , IEEE DOI 10.1109/WISA.

Mohiuddin Ahmed and Abdun Naser Mahmood, “A Novel Approach for Outlier Detection and Clustering Improvementâ€, 2013 IEEE 8th Conference on Industrial Electronics and Applications (ICIEA).

Ana arribas-gil and Juan romo, “Shape outlier detection and visualization for functional data: the outliergramâ€, Biostatistics Advance Access published March 11, 2014.

Saptarsi Goswami, Samiran Ghosh, and Amlan Chakrabarti, “Outlier Detection Techniques for SQL and ETL Tuningâ€, International Journal of Computer Applications (0975 – 8887), Volume 23– No.8, June 2011.

Bo Liu, Yanshan Xiao, Philip S. Yu, Zhifeng Hao, Longbing Cao, “An Efficient Approach for Outlier Detection with Imperfect Data Labelsâ€, IEEE transactions on knowledge and data engineering, 1041-4347/13/$31.00 © 2013 ieee.

Manzoor Elahi, Kun Li, Wasif Nisar, Xinjie Lv, Hongan Wang, “Efficient Clustering-Based Outlier Detection Algorithm for Dynamic Data Streamâ€, Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008 IEEE.

Jingke Xi, “Outlier Detection Algorithms in Data Miningâ€, Second International Symposium on Intelligent Information Technology Application, 2008 IEEE.

Christy.A, MeeraGandhi.G, S. Vaithyasubramanian, “Cluster Based Outlier Detection Algorithm For Healthcare Dataâ€, 2nd International Symposium on Big Data and Cloud Computing, Published by Elsevier B.V. 2015.

UCI machine repository- link- https://archive.ics.uci.edu/ml/datasets.html