Noise Reduction using Character Density Approach
Jincymol Joseph1, J R Jeba2
1Jincymol Joseph, Department of Computer Science, St.Pius X College Rajapuram, Kasargod (Kerala), India.
2Dr. J R Jeba, Department of Computer Applications, Noorul Islam Centre for Higher Education, Kumaracoil (Tamil Nadu), India.
Manuscript received on 13 December 2018 | Revised Manuscript received on 22 December 2018 | Manuscript Published on 30 December 2018 | PP: 425-428 | Volume-8 Issue-2S, December 2018 | Retrieval Number: 100.1/ijeat.B10891282S18/18©BEIESP
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Web mining is an application of data mining to extract informative content from World Wide Web(WWW). It has become one of the most significant resources nowadays. It may contain informative as well as non-informative contents. Non-informative contents may be header, footer, advertisements, copyright information, etc. These are called noisy data. A user needs only main contents. Web mining methods are useful for removing noisy parts and extract main contents from a web page, The advantage of using web mining methods is reduced time. Also, it provides users the needed information. This paper describes various methods for eliminating non-informative content from the large volume of data present in World Wide Web.
Keywords: Noisy Data, Web Mining, Cluster, Outlier.
Scope of the Article: Web Mining