Text Categorization Techniques and Current Trends
Abhisu Jain1, Aditya Goyal2, Vikrant Singh3, Anshul Tripathi4, Saravanakumar Kandasamy5
1Abhisu Jain*, Student, Department of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
2Aditya Goyal, Student, Department of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
3Vikrant Singh, Student, Department of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
4Anshul Tripathi, Student, Department of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
5Saravanakumar Kandasamy, Assistant Professor, School of Information Technology and Engineering, Vellore Institute of Technology, Vellore campus, Tamil Nadu, India.
Manuscript received on April 11, 2020. | Revised Manuscript received on May 15, 2020. | Manuscript published on June 30, 2020. | PP: 335-345 | Volume-9 Issue-5, June 2020. | Retrieval Number: E9620069520/2020©BEIESP | DOI: 10.35940/ijeat.E9620.069520
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: With the development of online data, text categorization has become one of the key procedures for taking care of and sorting out content information. Text categorization strategies are utilized to order reports, to discover fascinating data on the world wide web. Text Categorization is a task for categorizing information based on text and it has been important for effective analysis of textual data frameworks. There are systems which are designed to analyse and make distinctions between meaningful classes of information and text, such system is known as text classification systems. The above-mentioned system is widely accepted and has been used for the purpose of retrieval of information and natural language processing. The archives can be ordered in three different ways unsupervised, supervised and semi supervised techniques. Text categorization alludes to the procedure of dole out a classification or a few classes among predefined ones to each archive, naturally. For the given text data, these words that can be expressed in the correct meaning of a word in different documents are usually considered as good features. In the paper, we have used certain measures to ensure meaningful text categorization. One such method is through feature selection which is the solution proposed in this paper which does not change the physicality of the original features. We have taken into account all meaningful features to distinguish between different text categorization approaches and highlighted the evaluation metrics, advantages and limitations of each approach. We conclusively studied the working of several approaches and drew conclusion of best suited algorithm by performing practical evaluation. We are going to review different papers on the basis of different text categorization sections and a comparative and conclusive analysis is presented in this paper. This paper will present classification on various kinds of ways to deal and compare with text categorization.
Keywords: Attention Mechanism, BRCAN, Convolutional Neural Network, Feature Evaluation Function, Few Short