Machine Learning Techniques for Text Classification: A Survey
Abstract
Text classification is one of the hot or trending topicsdue to its real time applications. Text classification comes under natural language processing technique that helps in classifying the text into different categories based on context, type of words used in a sentence. Text classification plays a significant part in the sentiment analysis, information retrieval, email filtering, text summarization, spam detection etc. Text can be classified manually or automatically by using different machine learning techniques. This paper talks about the different data pre-processing technique such as number of words in the document(Term-Frequency), importance of each word in the document (TF-IDF) technique used in text classification. In this paper a comparativestudy of different machine learning algorithms based on metrics like performance, accuracy for text classification is presented.



