Analisa Sentimen Menggunakan Data Twitter, Flume, Hive Pada Hadoop dan Java Untuk Deteksi Kemacetan di Jakarta

Authors

  • Nurhayati Buslim UIN Syarif Hidayatullah Jakarta, Indonesia http://orcid.org/0000-0002-6564-6641
  • Busman Busman Sekolah Tinggi Ilmu Ekonomi (STIE) Gotong Royong Jakarta, Indonesia
  • Nadika Sigit Sinatrya Syarif Hidayatullah State Islamic University Jakarta, Indonesia
  • Tifani Shallynda Kania Syarif Hidayatullah State Islamic University Jakarta, Indonesia

DOI:

https://doi.org/10.15575/join.v3i1.141

Keywords:

Sentiment Analysis, Big Data, Twitter Data, Congestion Detection

Abstract

Berikut adalah paragraf yang sudah dirapikan:

Traffic congestion in big cities in Indonesia is unavoidable, especially in Jakarta. The increasing number of vehicles and the lack of public transportation are the main causes of traffic congestion in Jakarta. It disturbs people's activities. The government has already made various efforts to resolve the congestion problem; however, it requires high installation and maintenance costs and takes time to be implemented. People often complain about traffic congestion in Jakarta by posting on Twitter, which are called tweets. Every tweet posted is saved in the Twitter API and used for sentiment analysis. It analyzes the emotions of the users. Based on these problems, we conducted research on how to detect traffic congestion in Jakarta. Therefore, we tried to create a Congestion Detection App. We designed the app using UML diagrams. The Congestion Detection App is connected with Hadoop, Flume, Hive, and Derby. The app streams Twitter data collected by connecting with the Twitter API. This app is a Java-based application that can create and view data tables. It performs searches on tweets by ID and analyzes traffic conditions in specific regions in Jakarta. The app performs sentiment analysis on certain tweets and displays the results based on the data tables. The result of the research compares data from the Congestion Detection App with data from Google Maps. We made three value categories consisting of three colors: green for less traffic congestion with a value of 1, orange for medium-scale traffic congestion with a value of 2, and red for heavy traffic congestion with a value of 3. Based on these three categories and values, we used four regions as samples and compared the values with those from Google Maps data to determine accuracy. We achieved an 81% average accuracy from the four samples. The data from the tweet samples compared with Google Maps data showed significant congestion detection with the Congestion Detection App.

Author Biographies

Nurhayati Buslim, UIN Syarif Hidayatullah Jakarta

Informatics engineering Department, Sains and Technology Faculty Syarif Hidayatullah State Islamic University

Busman Busman, Sekolah Tinggi Ilmu Ekonomi (STIE) Gotong Royong Jakarta

Management Department

Nadika Sigit Sinatrya, Syarif Hidayatullah State Islamic University Jakarta

Department of Informatics Engineering, Sains and Technologu Faculty

Tifani Shallynda Kania, Syarif Hidayatullah State Islamic University Jakarta

Department of Informatics Engineering, Sains and Technologu Faculty

References

S. Bhosale and S. Kokate, “Traffic Detection Using Tweets on Twitter Social Network,” Int. J. Sci. Res., vol. 14611, no. 12, pp. 2319–7064, 2013.

R. P. Agus, “IPTEK: Solusi Komprehensif Atasi Kemacetan Lalu Lintas.” [Online]. Available: http://dikti.go.id/iptek-solusi-komprehensif-atasi-kemacetan-lalu-lintas/. [Accessed: 22-May-2017].

S. Kamran, M. Shaikh, A. Naseem, and P. Kamble, “Exploiting Social Media Data for Traffic Monitoring Using the Techniques of Data Mining,” Int. J. Innov. Res. Comput. Commun. Eng., vol. 4, no. 3, 2016.

R. S. Aisah, “Analisis Klasifikasi Sentimen Twitter terhadap Kinerja Layanan Provider Telekomunikasi menggunakan varian Naïve Bayes,” Institut Pertanian Bogor, 2016.

S. Shinde and P. S. Jagtap, “Big Data Solution for Improving Traffic Management System Effectively for Vehicular Networks,” Int. J. Adv. Res. Comput. Commun. Eng., vol. 5, no. 6, pp. 800–805, 2016.

M. Vadivukarassi, N. Puviarasan, and P. Aruna, “Sentimental Analysis of Tweets Using Naive Bayes Algorithm,” World Appl. Sci. J., vol. 35, no. 1, pp. 54–59, 2017.

H. Rajwani, S. Somvanshi, A. Upadhye, R. Vaidya, and T. Dange, “Dynamic Traffic Analyzer Using Twitter,” Int. J. Recent Innov. Trends Comput. Commun., vol. 4, no. 10, pp. 2013–2016, 2015.

M. Cindy, “What Is Twitter API,” 2015. [Online]. Available: https://www.quora.com/What-is-Twitter-API. [Accessed: 22-May-2017].

Sangeeta, “Twitter Data Analysis Using FLUME & HIVE on Hadoop Framework,” vol. 4, no. 2, pp. 119–123, 2016.

R. Wardhani and M. H. Yaqin, “Game Dasar-Dasar Hukum Islam Dalam Kitab Mabadi ’ ul Fiqh Jilid I,” Teknika, vol. 5, no. 2, pp. 473–478, 2013.

R. Margaret, “Definition NetBeans,” 2014. [Online]. Available: http://www.theserverside.com/definition/NetBeans. [Accessed: 03-Jul-2017].

Anonim, “NetBeans IDE 8.2 Information,” 2016. [Online]. Available: https://netbeans.org/community/releases/82/. [Accessed: 03-Jul-2017].

Y. Emma, Sharia Banking Information System Analysis and Design. CCIT-FTUI, 2012.

Published

2018-06-30

Issue

Section

Article

Citation Check