Automatic Detection of Hijaiyah Letters Pronunciation  using Convolutional Neural Network Algorithm

Yana Aditia Gerhana; Aaz Muhammad Hafidz Azis; Diena Rauda Ramdania; Wildan Budiawan Dzulfikar; Aldy Rialdy Atmadja; Deden Suparman; Ayu Puji Rahayu

doi:10.15575/join.v7i1.882

Authors

Yana Aditia Gerhana Faculty of Information and Communication Technology, Asia e-University; Department of Informatics, UIN Sunan Gunung Djati Bandung, Malaysia http://orcid.org/0000-0003-0105-9170
Aaz Muhammad Hafidz Azis UIN Sunan Gunung Djati Bandung, Indonesia
Diena Rauda Ramdania UIN Sunan Gunung Djati Bandung, Indonesia
Wildan Budiawan Dzulfikar UIN Sunan Gunung Djati Bandung, Indonesia
Aldy Rialdy Atmadja UIN Sunan Gunung Djati Bandung, Indonesia
Deden Suparman UIN Sunan Gunung Djati Bandung
Ayu Puji Rahayu Faculty of Education Fujian Normal University Fuzhou, China, China

DOI:

https://doi.org/10.15575/join.v7i1.882

Keywords:

Hijaiyah, Speech recognition, MFCC, CNN, CRISP-DM,

Abstract

Abstractâ€” Speech recognition technology is used in learning to read letters in the Qur'an. This study aims to implement the CNN algorithm in recognizing the results of introducing the pronunciation of the hijaiyah letters. The pronunciation sound is extracted using the Mel-frequency cepstral coefficients (MFCC) model and then classified using a deep learning model with the CNN algorithm. This system was developed using the CRISP-DM model. Based on the results of testing 616 voice data of 28 hijaiyah letters, the best value was obtained for accuracy of 62.45%, precision of 75%, recall of 50% and f1-score of 58%.

Author Biography

Yana Aditia Gerhana, Faculty of Information and Communication Technology, Asia e-University; Department of Informatics, UIN Sunan Gunung Djati Bandung

Scopus Profile

Google Scholar Profile

SINTA Profile

References

Q. Nada, C. Ridhuandi, P. Santoso, and D. Apriyanto, â€œSpeech Recognition dengan Hidden Markov Model untuk Pengenalan dan Pengucapan Huruf Hijaiyah,â€ J. Al-AZHAR Indones. SERI SAINS DAN Teknol., vol. 5, no. 1, p. 19, 2019, doi: 10.36722/sst.v5i1.319.

S. Khairuddin et al., â€œClassification of the Correct Quranic Letters Pronunciation of Male and Female Reciters,â€ in IOP Conference Series: Materials Science and Engineering, 2017, doi: 10.1088/1757-899X/260/1/012004.

A. T. Ali, H. S. Abdullah, and M. N. Fadhil, â€œVoice recognition system using machine learning techniques,â€ in Materials Today: Proceedings, 2021, pp. 1â€“7, doi: 10.1016/j.matpr.2021.04.075.

S. Souli, R. Amami, and S. Ben Yahia, â€œA robust pathological voices recognition system based on DCNN and scattering transform,â€ Appl. Acoust., vol. 177, June, pp. 1â€“7, 2021, doi: 10.1016/j.apacoust.2020.107854.

M. T.LuetmerBA, C. H.Hunt, R. J.McDonald MD, B. J. B. MD, and D. F.KallmesMD, â€œLaterality Errors in Radiology Reports Generated With and Without Voice Recognition Software: Frequency and Clinical Significance,â€ J. Am. Coll. Radiol., vol. 10, no. 7, pp. 538â€“543, 2013, doi: /10.1016/j.jacr.2013.02.017.

S. G. Koolagudi, D. Rastogi, and K. S. Rao, â€œIdentification of Language using Mel-Frequency Cepstral Coefficients (MFCC),â€ in ICMOC, 2012, pp. 3391â€“3398, doi: 10.1016/j.proeng.2012.06.392.

D. Taufik and N. Hanafiah, â€œAutoVAT: An Automated Visual Acuity Test Using Spoken Digit Recognition with Mel Frequency Cepstral Coefficients and Convolutional Neural Network,â€ in 5th International Conference on Computer Science and Computational Intelligence 2020, 2021, pp. 458â€“468, doi: 10.1016/j.procs.2021.01.029.

G. Shen, Q. Nguyen, and J. Choi, â€œAn Environmental Sound Source Classification System Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models,â€ in 14th IFAC Symposium on Information Control Problems in Manufacturing, 2012, pp. 1802â€“1807, doi: 10.3182/20120523-3-RO-2023.00251.

L. Marlina et al., â€œMakhraj recognition of Hijaiyah letter for children based on Mel-Frequency Cepstrum Coefficients (MFCC) and Support Vector Machines (SVM) method,â€ in 2018 International Conference on Information and Communications Technology, ICOIACT 2018, 2018, doi: 10.1109/ICOIACT.2018.8350684.

Y. Wang and B. Lawlor, â€œSpeaker recognition based on MFCC and BP neural networks,â€ 2017 28th Irish Signals Syst. Conf. ISSC 2017, pp. 0â€“3, 2017, doi: 10.1109/ISSC.2017.7983644.

S. Tirronen, S. ReddyKadiri, and P. Alku, â€œThe Effect of the MFCC Frame Length in Automatic Voice Pathology Detection,â€ J. Voice, vol. 6, no. 3, pp. 297â€“440, 2022, doi: 10.1016/j.jvoice.2022.03.021.

N. Sugan, N. S. S. Srinivas, L. S. Kumar, M. K. Nath, and A. Kanhe, â€œSpeech emotion recognition using cepstral features extracted with novel triangular filter banks based on bark and ERB frequency scales,â€ Digit. Signal Process., vol. 104, pp. 1â€“22, 2020, doi: 10.1016/j.dsp.2020.102763.

L. Abdel-Hamid, â€œEgyptian Arabic Speech Emotion Recognition using Prosodic, Spectral and Wavelet Features,â€ Speech Commun., vol. 122, pp. 19â€“20, 2020, doi: 10.1016/j.specom.2020.04.005.

N. W. Arshad, S. N. Abdul Aziz, R. Hamid, R. Abdul Karim, F. Naim, and N. F. Zakaria, â€œSpeech processing for makhraj recognition,â€ pp. 323â€“327, 2011, doi: 10.1109/inecce.2011.5953900.

S. Saha et al., â€œPredicting motor outcome in preterm infants from very early brain diffusion MRI using a deep learning convolutional neural network (CNN) model,â€ Neuroimage, vol. 215, pp. 1â€“35, 2020, doi: 10.1016/j.neuroimage.2020.116807.

T. Masuda et al., â€œDeep learning with convolutional neural network for estimation of the characterisation of coronary plaques: Validation using IB-IVUS,â€ Radiography, vol. 28, no. 3, pp. 1â€“7, 2022, doi: /10.1016/j.radi.2021.07.024.

Jahandada, S. M. Sam, K. Kamardin, N. N. A. Sjarif, and N. Mohamed, â€œOffline Signature Verification using Deep Learning Convolutional Neural Network (CNN) Architectures GoogLeNet Inception-v1 and Inception-v3,â€ in The Fifth Information Systems International Conference 2019, 2019, pp. 475â€“483, doi: 10.1016/j.procs.2019.11.147.

S. Ghimirea, T. Nguyen-Huy, R. C Deo, D. Casillas-PÃ©rez, and S. Salcedo-Sanz, â€œEfficient daily solar radiation prediction with deep learning 4-phase convolutional neural network, dual stage stacked regression and support vector machine CNN-REGST hybrid model,â€ Sustain. Mater. Technol., vol. 32, 2022, doi: 10.1016/j.susmat.2022.e00429.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, â€œImageNet classification with deep convolutional neural networks,â€ Commun. ACM, 2017, doi: 10.1145/3065386.

U. N. Wisesty, M. S. Mubarok, and A. Adiwijaya, â€œA classification of marked hijaiyah lettersâ€™ pronunciation using hidden Markov model,â€ in AIP Conference Proceedings, 2017, doi: 10.1063/1.4994439.

Institute of Electrical and Electronics Engineers., â€œExtending CRISP-DM,â€ pp. 0â€“4, 2009.

C. SchrÃ¶er, F. Kruse, and J. M. GÃ³mez, â€œA Systematic Literature Review on Applying CRISP-DM Process Model,â€ in CENTERIS - International Conference on ENTERprise Information Systems / ProjMAN - International Conference on Project MANagement / HCist - International Conference on Health and Social Care Information Systems and Technologies 2020, 2020, pp. 526â€“534, doi: 10.1016/j.procs.2021.01.199.

V. Plotnikova, M. Dumas, and F. P. Milani, â€œApplying the CRISP-DM data mining process in the financial services industry: Elicitation of adaptation requirements,â€ Data Knowl. Eng., vol. 139, 2022, doi: 10.1016/j.datak.2022.102013

Automatic Detection of Hijaiyah Letters Pronunciation using Convolutional Neural Network Algorithm

Authors

DOI:

Keywords:

Abstract

Author Biography

Yana Aditia Gerhana, Faculty of Information and Communication Technology, Asia e-University; Department of Informatics, UIN Sunan Gunung Djati Bandung

References

Downloads

Published

Issue

Section

Citation Check

License

You are free to:

Under the following terms:

Notices:

Most read articles by the same author(s)

Similar Articles

Make a Submission

newsidebarjoin