YouTube X-Rating Detection with Bahasa-Slang Title Using Query Expansion and Rule Based Approaches

Authors

  • Dewi Wisnu Wardani Department of Informatics, Universitas Sebelas Maret, Indonesia
  • Salsabila F Shabihah Department of Informatics, Universitas Sebelas Maret, Indonesia

DOI:

https://doi.org/10.15575/join.v7i2.799

Keywords:

Bahasa Slang, Detection of X-rating, Pornography, Query Expansion, Rule Based, YouTube content

Abstract

The detection of X-rating content on the Internet is still rarely done in Indonesia and the performance of the existing work to detect X-rating content, especially in video is still low. The largest video portal, YouTube, does not yet have automatic X-rating content detection through its content either. Some X-rating content prevention service providers in Indonesia, such as the Internet Positive and Nawala Project, detect X-rating content using the keyword detection method of a web page and then block the web page with DNS filtering. However, that method does not pay attention to using  Bahasa-Slang. This work developed Metasearch named Safedio. Safedio aims to detect X-rating content on YouTube content through video titles that contain Bahasa-Slang. Safedio utilizes Query Expansion and Rule-Based approaches. The Query Expansion is a technique to get additional rules in search. In the end, Safedio can detect X-rating content through video titles in both Bahasa and Bahasa-Slang. The average results return with precision 71%, recall 46% and accuracy 72%.

References

APJII, PENETRASI & PROFIL PERILAKU PENGGUNA INTERNET INDONESIA SURVEI 2018. 2019. Accessed: Oct. 30, 2019. [Online]. Available: https://apjii.or.id/survei2018s/download/SPt3b2klhjHziyMBGOq0x4vA8fsDoL

B. B. Pratamawaty, P. Limilia, and P. Prihandini, “Young people’s perception of internet pornography: Case of junior high school students’ in West Java Indonesia,†International Journal of Psychosocial Rehabilitation, vol. 24, no. 1, pp. 492–501, 2020.

Z. Ardi, K. Viola, and I. Sukmawati, “An Analysis of Internet Abuses Impact on Children’s Moral Development,†JPPI (Jurnal Penelitian Pendidikan Indonesia), vol. 4, no. 1, pp. 44–50, 2018.

R. Panuju, “PUBLIC NON-COMPLIANCE WITH PROHIBITION OF ACCESSING PORNOGRAPHY ON THE INTERNET: A CASE IN INDONESIA,†Journal of Critical Reviews, vol. 7, no. 19, pp. 1274–1285, 2020.

R. Puspita and D. Rohedi, “The impact of internet use for students,†in IOP Conference Series: Materials Science and Engineering, 2018, vol. 306, no. 1, p. 012106.

E. M. S. Sulhan, “Text Filtering Kata Porno dengan Metode Boyer Moore pada Aplikasi Elearning Berbasis Cms di Sdn,†Bimasakti : Jurnal Riset Mahasiswa FTI UNIKAMA, 2014, Accessed: Dec. 04, 2019. [Online]. Available: https://www.neliti.com/publications/182947/text-filtering-kata-porno-dengan-metode-boyer-moore-pada-aplikasi-elearning-berb

D. A. W. Nurhayati, “MORPHOLOGICAL AND MORPHOPHONEMIC PROCESS OF ALAY VARIATION,†LINGUA: Journal of Language, Literature and Teaching, vol. 12, no. 1, pp. 59–70, 2015, doi: 10.30957/lingua.v12i1.71.

D. Wardani, R. Siwi, B. Harjito, and M. Marshallia, “Using Metadata in Detection Spam Email with Pornography Content,†in 2018 International Conference on Electrical Engineering and Computer Science (ICECOS), 2018, pp. 213–218.

N. D. Kelana, “FENOMENA BAHASA ALAY: PROSES PEMBENTUKAN DAN IMPLIKASINYA TERHADAP PERKEMBANGAN BAHASA INDONESIA,†Thesis, Universitas Diponegoro, 2011. Accessed: Dec. 04, 2019. [Online]. Available: https://fib.undip.ac.id/digilib/home/fib.undip.ac.id/files/e_book/Fenomena%20Bahasa%20Alay%20(Proses%20Pembentukan%20Kata%20dan%20Implikasin.pdf

S. S. Andriani and I. D. P. Wijana, “BAHASA ALAY,†Doctoral dissertation, Universitas Gadjah Mada, 2016. Accessed: Dec. 04, 2019. [Online]. Available: http://etd.repository.ugm.ac.id/index.php?mod=penelitian_detail&sub=PenelitianDetail&act=view&typ=html&buku_id=94623&obyek_id=4

M. B. Garcia, T. F. Revano, B. G. M. Habal, J. O. Contreras, and J. B. R. Enriquez, “A pornographic image and video filtering application using optimized nudity recognition and detection algorithm,†in 2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM), 2018, pp. 1–5.

E. Yiallourou, R. Demetriou, and A. Lanitis, “On the detection of images containing child-pornographic material,†in 2017 24th International Conference on Telecommunications (ICT), 2017, pp. 1–5.

H. Li, X. Guo, H. Wang, B. Yu, and P. Zhao, “Application of Computational Complexity Theory to Pornographic Detection in Live Based on Frame Sampling,†in 2020 IEEE 6th International Conference on Computer and Communications (ICCC), 2020, pp. 1334–1338.

J. Mallmann, A. O. Santin, E. K. Viegas, R. R. dos Santos, and J. Geremias, “PPCensor: Architecture for real-time pornography detection in video streaming,†Future Generation Computer Systems, vol. 112, pp. 945–955, 2020.

A. Tabone, A. Bonnici, S. Cristina, R. A. Farrugia, and K. P. Camilleri, “Private Body Part Detection using Deep Learning.,†in ICPRAM, 2020, pp. 205–211.

C. Tian, X. Zhang, W. Wei, and X. Gao, “Color pornographic image detection based on color-saliency preserved mixture deformable part model,†Multimedia Tools and Applications, vol. 77, no. 6, pp. 6629–6645, 2018.

C. Carpineto and G. Romano, “A Survey of Automatic Query Expansion in Information Retrieval,†ACM Comput. Surv., vol. 44, no. 1, pp. 1–50, 2012, doi: 10.1145/2071389.2071390.

B. He and I. Ounis, “Studying Query Expansion Effectiveness,†in Advances in Information Retrieval, Berlin, Heidelberg, 2009, pp. 611–619. doi: 10.1007/978-3-642-00958-7_57.

L. Lengyel, “Validating Rule-based Algorithms,†Acta Polytechnica Hungarica, vol. 12, no. 4, p. 17, 2015.

S. Ekalestari, “Pengaruh Penggunaan Bahasa Alay Terhadap Penggunaan Bahasa Indonesia yang Baik dan Benar,†2017.

A. Wicaksono and S. Supriyono, “TINJAUAN SOSIOLINGUISTIK BAHASA ALAY DALAM KONS℡ASI KEBAHASAAN,†Ksatra: Jurnal Kajian Bahasa dan Sastra, vol. 3, no. 1, pp. 19–28, 2021.

A. Lutfiatuna, A. Novitasari, and A. Helfiyana, “Bahasa Alay Pada Chating Di Medsos Remaja Millenial (Bahasa Alay Vs Remaja Millenial),†in Prosiding Seminar Nasional Bahasa dan Sastra Indonesia (SENASBASA), 2018, vol. 2, no. 2.

K. Inoue, “文字ã§ã¿ã‚‹è‹¥è€…言葉―インドãƒã‚·ã‚¢ã® Bahasa Alay ã‚’é¡Œæã¨ã—ã¦â€•,†Ayumi: Jurnal Budaya, Bahasa dan Sastra, vol. 4, no. 1, 2017.

L. Laelasari, L. Oktavia, and I. Mustika, “Pengaruh bahasa alay terhadap penggunaan bahasa indonesia di kalangan mahasiswa ikip siliwangi,†Parole (Jurnal Pendidikan Bahasa dan Sastra Indonesia), vol. 1, no. 5, pp. 675–680, 2018.

S. Abid, “KEBERADAAN BAHASA ALAY DALAM PERKEMBANGAN BAHASA INDONESIA,†Jurnal Perspektif Pendidikan, vol. 11, no. 2, pp. 22–29, 2017.

P. DEAMA, S. Sayuti, and I. Romi, ““Penggunaan Bahasa Alay Remaja Pada Status Media Sosial Facebook: Tinjauan Sosiolinguistik,†PhD Thesis, UNIVERSITAS BUNG HATTA, 2020.

S. Agustina, D. Indihadi, and H. Hodidjah, “ANALISIS PENGGUNAAN BAHASA ALAY DALAM KOSAKATA BAHASA INDONESIA SISWA SEKOLAH DASAR,†PEDADIDAKTIKA: Jurnal Ilmiah Pendidikan Guru Sekolah Dasar, vol. 2, no. 1, pp. 1–8.

Z. Abedjan and F. Naumann, “Synonym Analysis for Predicate Expansion,†in The Semantic Web: Semantics and Big Data, vol. 7882, P. Cimiano, O. Corcho, V. Presutti, L. Hollink, and S. Rudolph, Eds. Springer Berlin Heidelberg, 2013, pp. 140–154. [Online]. Available: http://dx.doi.org/10.1007/978-3-642-38288-8_10

C. Manning and H. Schutze, Foundations of statistical natural language processing. MIT press, 1999.

Downloads

Published

2022-12-29

Issue

Section

Article

Citation Check

Most read articles by the same author(s)