An Efficient Bayesian Classifier in SQL with PCA

Main Article Content

DS.Bhupal Naik
S.Deva kumar, G Sridhar Reddy


As Bayesian classifier is a fundamental classification technique. We focus on an efficient Bayesian classifier programmed in sql
with PCA. We consider three classifiers: Naive Bayes and a classifier based on class decomposition using K-means clustering and Bayesian
classifier with dimensionality reduction technique PCA. We consider two complementary tasks: model computation and scoring a data set. We
introduce one of the dimensionality reduction techniques, Principal component analysis (PCA) to achieve more accuracy and reduction of
storage space. We study several layouts for tables and several indexing alternatives. We analyse how to transform equations into efficient SQL
queries. We also analyse how to calculate covariance matrix for PCA using SQL. We perform experiments on wbcancer and bscale datasets to
evaluate classification accuracy, query optimization & scalability. Our approach shows improvement in accuracy over the existing approach.

Keywords— Bayesian classification, principal component analysis, covariance matrix, Mean, variance.


Download data is not yet available.

Article Details