Analisa Sentimen Menggunakan Data Twitter, Flume, Hive Pada Hadoop dan Java Untuk Deteksi Kemacetan di Jakarta

Authors

  • Nurhayati Buslim UIN Syarif Hidayatullah Jakarta, Indonesia http://orcid.org/0000-0002-6564-6641
  • Busman Busman Sekolah Tinggi Ilmu Ekonomi (STIE) Gotong Royong Jakarta, Indonesia
  • Nadika Sigit Sinatrya Syarif Hidayatullah State Islamic University Jakarta, Indonesia
  • Tifani Shallynda Kania Syarif Hidayatullah State Islamic University Jakarta, Indonesia

DOI:

https://doi.org/10.15575/join.v3i1.141

Keywords:

Sentiment Analysis, Big Data, Twitter Data, Congestion Detection

Abstract

Traffic congestion big cities in Indonesia is unavoidable, especially in Jakarta. The increasing number of vehicle and the lack of public transportation is the main cause of traffic congestion in Jakarta. It disturb people activities. Government already did various efforts to resolve congestion problem, however it needs high installation, maintenance cost and need time to be implemented. Peoples often complained about traffic congestion in Jakarta by posting in Twitter which called tweets. Every tweets post are saved in API Twitter and used for sentiment analysis. It analyzed emotion of the user. Based of problems we do research how  to detect traffic congestion in Jakarta. Therefore, we try to makes Congestion Detection App. We design the app using UML diagrams. Congestion Detection App is connected with Hadoop, Flume, Hive and Derby. The app stream twitters data to colected by connecting with API Twitter. This app is Java-based application which can makes and view data tables. It  performance searching tweets data by ID and analyze traffic condition on a certain region in Jakarta. The perform sentiment analysis to a certain tweet and display the result based on the data table. The result of research is comparing Data from Congestion Detection App with data from Google Maps. We make three valus categories which consist of three colors: green for less traffic congestion have a value of 1. Orange for medium-scale traffic congestion has value of 2 and Red for heavily traffic congestion has a value of 3.  Based on three categories and value we use 4 regions for sample and comparing the values with value from Google Maps Data to get the accuracy. We got 81% average accuracy from the four samples. The result of Data from tweet sample compared with Google Maps Data. It  have big detected congestion with Congestion Detection App.

Author Biographies

Nurhayati Buslim, UIN Syarif Hidayatullah Jakarta

Informatics engineering Department, Sains and Technology Faculty Syarif Hidayatullah State Islamic University

Busman Busman, Sekolah Tinggi Ilmu Ekonomi (STIE) Gotong Royong Jakarta

Management Department

Nadika Sigit Sinatrya, Syarif Hidayatullah State Islamic University Jakarta

Department of Informatics Engineering, Sains and Technologu Faculty

Tifani Shallynda Kania, Syarif Hidayatullah State Islamic University Jakarta

Department of Informatics Engineering, Sains and Technologu Faculty

References

S. Bhosale and S. Kokate, “Traffic Detection Using Tweets on Twitter Social Network,†Int. J. Sci. Res. ISSN (Online Index Copernicus Value Impact Factor, vol. 14611, no. 12, pp. 2319–7064, 2013.

R. P. Agus, “IPTEK: Solusi Komprehensif Atasi Kemacetan Lalu Lintas.†[Online]. Available: http://dikti.go.id/iptek-solusi-komprehensif-atasikemacetan-lalu-lintas/. [Accessed: 22-May-2017].

S. Kamran, M. Shaikh, A. Naseem, and P. Kamble, “Exploiting Social Media Data for Traffic Monitoring Using the Techniques of Data Mining,†Int. J. Innov. Res. Comput. Commun. Eng., vol. 4, no. 3, 2016.

R. S. Aisah, “Analisis Klasifikasi Sentimen Twitter terhadap Kinerja Layanan Provider Telekomunikasi menggunakan varian Naïve Bayes,†Institut Pertanian Bogor, 2016.

S. Shinde and P. S. Jagtap, “Big data solution for improving traffic management System effectively for vehicular networks,†Int. J. Adv. Res. Comput. Commun. Eng., vol. 5, no. 6, pp. 800–805, 2016.

M. Vadivukarassi, N. Puviarasan, and P. Aruna, “Sentimental Analysis of Tweets Using Naive Bayes Algorithm,†World Appl. Sci. J., vol. 35, no. 1, pp. 54–59, 2017.

H. Rajwani, S. Somvanshi, A. Upadhye, R. Vaidya, and T. Dange, “Dynamic Traffic Analyzer Using Twitter,†Int. J. Recent Innov. Trends Comput. Commun., vol. 4, no. 10, pp. 2013–2016, 2015.

M. Cindy, “What Is Twitter API,†2015. [Online]. Available: https://www.quora.com/What-isTwitter-API. [Accessed: 22-May-2017].

Sangeeta, “Twitter Data Analysis Using FLUME & HIVE on Hadoop Framework,†vol. 4, no. 2, pp. 119–123, 2016.

R. Wardhani and M. H. Yaqin, “Game Dasar Dasar Hukum Islam Dalam Kitab Mabadi ’ ul Fiqh Jilid I,†Teknika, vol. 5, no. 2, pp. 473–478, 2013.

R. Margaret, “Definition NetBeans,†2014. [Online]. Available: http://www.theserverside.com/definition/NetBeans. [Accessed: 03-Jul-2017].

Anonim, “NetBeans IDE 8.2 Information,†2016. [Online]. Available: https://netbeans.org/community/releases/82/. [Accessed: 03-Jul-2017].

Y.Emma, Sharia Banking Information System Analysis and Design. CCIT-FTUI, 2012.

Published

2018-06-30

Issue

Section

Article

Citation Check