Survey on Preprocessing in Web Server Log Files

Main Article Content

Priyanka V. Patil
Mrs. Ujawala M. Patil

Abstract

The World Wide Web is a system of hypertext documents accessed via the Internet. In that web pages may contain text, images, videos, and other multimedia and navigate between them via hyperlinks. World Wide Web gives large information to internet user. World Wide Web is a huge repository of web pages and links. When user accesses websites are recorded in web logs file. Web server log file is a simple plain text file. Display of log file data in different format like W3C Extended log file format, NCSA common log file format, IIS log file format. To improve quality of data, log file should be preprocessed. Log files usually contain noisy and unnecessary data. Preprocessing reduce log file size also increase quality of available data. log file is input for mining algorithm. It gives detailed discussion about web log file, web log file format. In this paper we survey about data preprocessing of web log file.


Keywords: preprocessing; web log file; web log file format.

Downloads

Download data is not yet available.

Article Details

Section
Articles