YouTube X-Rating Detection with Bahasa-Slang Title Using Query Expansion and Rule Based Approaches
DOI:
https://doi.org/10.15575/join.v7i2.799Keywords:
Bahasa Slang, Detection of X-rating, Pornography, Query Expansion, Rule Based, YouTube contentAbstract
The detection of X-rating content on the Internet is still rarely done in Indonesia and the performance of the existing work to detect X-rating content, especially in video is still low. The largest video portal, YouTube, does not yet have automatic X-rating content detection through its content either. Some X-rating content prevention service providers in Indonesia, such as the Internet Positive and Nawala Project, detect X-rating content using the keyword detection method of a web page and then block the web page with DNS filtering. However, that method does not pay attention to using Bahasa-Slang. This work developed Metasearch named Safedio. Safedio aims to detect X-rating content on YouTube content through video titles that contain Bahasa-Slang. Safedio utilizes Query Expansion and Rule-Based approaches. The Query Expansion is a technique to get additional rules in search. In the end, Safedio can detect X-rating content through video titles in both Bahasa and Bahasa-Slang. The average results return with precision 71%, recall 46% and accuracy 72%.
References
APJII, PENETRASI & PROFIL PERILAKU PENGGUNA INTERNET INDONESIA SURVEI 2018. 2019. Accessed: Oct. 30, 2019. [Online]. Available: https://apjii.or.id/survei2018s/download/SPt3b2klhjHziyMBGOq0x4vA8fsDoL
B. B. Pratamawaty, P. Limilia, and P. Prihandini, “Young people’s perception of internet pornography: Case of junior high school students’ in West Java Indonesia,†International Journal of Psychosocial Rehabilitation, vol. 24, no. 1, pp. 492–501, 2020.
Z. Ardi, K. Viola, and I. Sukmawati, “An Analysis of Internet Abuses Impact on Children’s Moral Development,†JPPI (Jurnal Penelitian Pendidikan Indonesia), vol. 4, no. 1, pp. 44–50, 2018.
R. Panuju, “PUBLIC NON-COMPLIANCE WITH PROHIBITION OF ACCESSING PORNOGRAPHY ON THE INTERNET: A CASE IN INDONESIA,†Journal of Critical Reviews, vol. 7, no. 19, pp. 1274–1285, 2020.
R. Puspita and D. Rohedi, “The impact of internet use for students,†in IOP Conference Series: Materials Science and Engineering, 2018, vol. 306, no. 1, p. 012106.
E. M. S. Sulhan, “Text Filtering Kata Porno dengan Metode Boyer Moore pada Aplikasi Elearning Berbasis Cms di Sdn,†Bimasakti : Jurnal Riset Mahasiswa FTI UNIKAMA, 2014, Accessed: Dec. 04, 2019. [Online]. Available: https://www.neliti.com/publications/182947/text-filtering-kata-porno-dengan-metode-boyer-moore-pada-aplikasi-elearning-berb
D. A. W. Nurhayati, “MORPHOLOGICAL AND MORPHOPHONEMIC PROCESS OF ALAY VARIATION,†LINGUA: Journal of Language, Literature and Teaching, vol. 12, no. 1, pp. 59–70, 2015, doi: 10.30957/lingua.v12i1.71.
D. Wardani, R. Siwi, B. Harjito, and M. Marshallia, “Using Metadata in Detection Spam Email with Pornography Content,†in 2018 International Conference on Electrical Engineering and Computer Science (ICECOS), 2018, pp. 213–218.
N. D. Kelana, “FENOMENA BAHASA ALAY: PROSES PEMBENTUKAN DAN IMPLIKASINYA TERHADAP PERKEMBANGAN BAHASA INDONESIA,†Thesis, Universitas Diponegoro, 2011. Accessed: Dec. 04, 2019. [Online]. Available: https://fib.undip.ac.id/digilib/home/fib.undip.ac.id/files/e_book/Fenomena%20Bahasa%20Alay%20(Proses%20Pembentukan%20Kata%20dan%20Implikasin.pdf
S. S. Andriani and I. D. P. Wijana, “BAHASA ALAY,†Doctoral dissertation, Universitas Gadjah Mada, 2016. Accessed: Dec. 04, 2019. [Online]. Available: http://etd.repository.ugm.ac.id/index.php?mod=penelitian_detail&sub=PenelitianDetail&act=view&typ=html&buku_id=94623&obyek_id=4
M. B. Garcia, T. F. Revano, B. G. M. Habal, J. O. Contreras, and J. B. R. Enriquez, “A pornographic image and video filtering application using optimized nudity recognition and detection algorithm,†in 2018 IEEE 10th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM), 2018, pp. 1–5.
E. Yiallourou, R. Demetriou, and A. Lanitis, “On the detection of images containing child-pornographic material,†in 2017 24th International Conference on Telecommunications (ICT), 2017, pp. 1–5.
H. Li, X. Guo, H. Wang, B. Yu, and P. Zhao, “Application of Computational Complexity Theory to Pornographic Detection in Live Based on Frame Sampling,†in 2020 IEEE 6th International Conference on Computer and Communications (ICCC), 2020, pp. 1334–1338.
J. Mallmann, A. O. Santin, E. K. Viegas, R. R. dos Santos, and J. Geremias, “PPCensor: Architecture for real-time pornography detection in video streaming,†Future Generation Computer Systems, vol. 112, pp. 945–955, 2020.
A. Tabone, A. Bonnici, S. Cristina, R. A. Farrugia, and K. P. Camilleri, “Private Body Part Detection using Deep Learning.,†in ICPRAM, 2020, pp. 205–211.
C. Tian, X. Zhang, W. Wei, and X. Gao, “Color pornographic image detection based on color-saliency preserved mixture deformable part model,†Multimedia Tools and Applications, vol. 77, no. 6, pp. 6629–6645, 2018.
C. Carpineto and G. Romano, “A Survey of Automatic Query Expansion in Information Retrieval,†ACM Comput. Surv., vol. 44, no. 1, pp. 1–50, 2012, doi: 10.1145/2071389.2071390.
B. He and I. Ounis, “Studying Query Expansion Effectiveness,†in Advances in Information Retrieval, Berlin, Heidelberg, 2009, pp. 611–619. doi: 10.1007/978-3-642-00958-7_57.
L. Lengyel, “Validating Rule-based Algorithms,†Acta Polytechnica Hungarica, vol. 12, no. 4, p. 17, 2015.
S. Ekalestari, “Pengaruh Penggunaan Bahasa Alay Terhadap Penggunaan Bahasa Indonesia yang Baik dan Benar,†2017.
A. Wicaksono and S. Supriyono, “TINJAUAN SOSIOLINGUISTIK BAHASA ALAY DALAM KONS℡ASI KEBAHASAAN,†Ksatra: Jurnal Kajian Bahasa dan Sastra, vol. 3, no. 1, pp. 19–28, 2021.
A. Lutfiatuna, A. Novitasari, and A. Helfiyana, “Bahasa Alay Pada Chating Di Medsos Remaja Millenial (Bahasa Alay Vs Remaja Millenial),†in Prosiding Seminar Nasional Bahasa dan Sastra Indonesia (SENASBASA), 2018, vol. 2, no. 2.
K. Inoue, “文å—ã§ã¿ã‚‹è‹¥è€…言葉―インドãƒã‚·ã‚¢ã® Bahasa Alay を題æã¨ã—ã¦â€•,†Ayumi: Jurnal Budaya, Bahasa dan Sastra, vol. 4, no. 1, 2017.
L. Laelasari, L. Oktavia, and I. Mustika, “Pengaruh bahasa alay terhadap penggunaan bahasa indonesia di kalangan mahasiswa ikip siliwangi,†Parole (Jurnal Pendidikan Bahasa dan Sastra Indonesia), vol. 1, no. 5, pp. 675–680, 2018.
S. Abid, “KEBERADAAN BAHASA ALAY DALAM PERKEMBANGAN BAHASA INDONESIA,†Jurnal Perspektif Pendidikan, vol. 11, no. 2, pp. 22–29, 2017.
P. DEAMA, S. Sayuti, and I. Romi, ““Penggunaan Bahasa Alay Remaja Pada Status Media Sosial Facebook: Tinjauan Sosiolinguistik,†PhD Thesis, UNIVERSITAS BUNG HATTA, 2020.
S. Agustina, D. Indihadi, and H. Hodidjah, “ANALISIS PENGGUNAAN BAHASA ALAY DALAM KOSAKATA BAHASA INDONESIA SISWA SEKOLAH DASAR,†PEDADIDAKTIKA: Jurnal Ilmiah Pendidikan Guru Sekolah Dasar, vol. 2, no. 1, pp. 1–8.
Z. Abedjan and F. Naumann, “Synonym Analysis for Predicate Expansion,†in The Semantic Web: Semantics and Big Data, vol. 7882, P. Cimiano, O. Corcho, V. Presutti, L. Hollink, and S. Rudolph, Eds. Springer Berlin Heidelberg, 2013, pp. 140–154. [Online]. Available: http://dx.doi.org/10.1007/978-3-642-38288-8_10
C. Manning and H. Schutze, Foundations of statistical natural language processing. MIT press, 1999.
Downloads
Published
Issue
Section
Citation Check
License
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
-
Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
-
NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.
-
No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
- You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation.
- No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License