Toward Representing Automatic Knowledge Discovery from Social Media Contents Based on Document Classification

  • Zeinab Shahbazi, Yung Cheol Byun, Dong Cheol Lee

Abstract

Representing documents is one of the critical steps in natural language processing and text mining that focus on converting unstructured to structured documents with numeric vectors to get access to machine learning and data mining algorithms. Bag of word (BOW) model is an adopted text representation system in document classification. Based on BOW, document demonstrate as fixed-length. This process means word dimensions presented as a numerical value that is defined as TF-IDF or word frequency.  In this paper, we analyze, the combination of Bag-of-Concept (BOC) and BOW demonstration applying attention mechanism to operate information of word-level and concept-level to achieve the optimal performance of document classification.

Published
2020-03-30
How to Cite
Zeinab Shahbazi, Yung Cheol Byun, Dong Cheol Lee. (2020). Toward Representing Automatic Knowledge Discovery from Social Media Contents Based on Document Classification . International Journal of Advanced Science and Technology, 29(3), 14089 - 14096. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/31840
Section
Articles