An Effective Method of Dimensionality Reduction for High Dimensional Datasets Using PCA

Main Article Content

D. Napoleon
S. Pavalakodi


Data mining employs a variety of traditional statistical methods such as cluster analysis, discriminate analysis, logistic regression, and time series forecasting. Due to the mega high dimensionality nature of datasets, data dimension reduction has drawn special attention for such type of data analysis. Feature extraction can be viewed as preprocessing step which removes distracting variance from the datasets so that clustering, classifiers can estimators perform better. In this paper principal component analysis, a linear transformation is used for dimensionality reduction and clustering with k-means algorithm is applied and shows the results.




Keywords: - Principal component analysis, dimensional reduction, kmeans clustering.


Download data is not yet available.

Article Details