Performance Analysis of Twitter Data Extraction Using Apache Flume

  • Harbhajan Singh, Vijay Dhir

Abstract

There has been a tremendous rise in social media usage in recent years. This has led to huge volumes of data available online for analysis. Various organisations use this data to study consumer patterns and their behaviour. Sentiment Analysis allows data to be analysed and divided into positive, negative or neutral sentiments. Healthcare Centres can use Sentiment Analysis to study patient behaviours and serve them better in regards to treatment, diet and medicine. In this paper, we have extracted Tweets through Twitter Agent in Apache Flume using different block sizes, stored them in HDFS and studied their effect on Data Generation in terms of speed.

Published
2020-02-11
How to Cite
Vijay Dhir, H. S. (2020). Performance Analysis of Twitter Data Extraction Using Apache Flume. International Journal of Advanced Science and Technology, 29(3), 2374-2383. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/4332
Section
Articles