Performance Exploration on Various Document Clustering Techniques with K-Means Family
V.Kumaresan1, R.Nagarajan2
1V.Kumaresan*1 , Assistant Professor, Annamalai University,Tamilnadu, India.
2Dr. R. Nagarajan*2, Assistant Professor, Annamalai University, Annamalai Nagar, India.
Manuscript received on November 22, 2019. | Revised Manuscript received on December 15, 2019. | Manuscript published on December 30, 2019. | PP: 2852-2857 | Volume-9 Issue-2, December, 2019. | Retrieval Number: B3462129219/2019©BEIESP | DOI: 10.35940/ijeat.B3462.129219
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Clustering performs a important position in numerous fields which include Computer science & packages, facts, pattern reputation, system studying technique and find out dating among the files. Clustering focuses on document clustering, and other related area. Increase within the extent of statistics saved in virtual form (text, photograph, audio) has improved the need for requirement of an automated tool, that allows people to find and manage the records in an efficient way. Usually clustering refer to document clustering technique investigates the documents and find its relation. This paper center of attention on the a range of clustering methods and evaluation its overall performance. This paper also categories the document clustering techniques as three major groups, namely Group K-means, Expectation Maximization and Semantic-based techniques (Hybrid method). Several experiments were conducted to analyse the performance accuracy and Speed.
Keywords: K means, K*, Hybrid, data set, bisection.