Metode Wavelet-MFCC dan Korelasi dalam Pengenalan Suara Digit

  • Zaurarista Dyarbirru Universitas Bumigora Mataram
  • Syahroni Hidayat Universitas Bumigora Mataram
Keywords: Automatic Speech Recognition, MFCC method, wavelet, wavelet-MFCC, K-Fold Cross Validation

Abstract

Voice is the sound emitted from living things. With the development of Automatic Speech Recognition (ASR) technology, voice can be used to make it easier for humans to do something. In the ASR extraction process the features have an important role in the recognition process. The feature extraction methods that are commonly applied to ASR are MFCC and Wavelet. Each of them has advantages and disadvantages. Therefore, this study will combine the wavelet feature extraction method and MFCC to maximize the existing advantages. The proposed method is called Wavelet-MFCC. Voice recognition method that does not use recommendations. Determination of system performance using the Word Recoginition Rate (WRR) method which is validated with the K-Fold Cross Validation with the number of folds is 5. The research dataset used is voice recording digits 0-9 in English. The results show that the digit speech recognition system that has been built gives the highest average value of 63% for digit 4 using wavelet daubechies DB3 and wavelet dyadic transform method. As for the comparison results of the wavelet decomposition method used, that the use of dyadic wavelet transformation is better than the wavelet package.

Downloads

Download data is not yet available.
Published
2020-08-21
How to Cite
[1]
Z. Dyarbirru and S. Hidayat, “Metode Wavelet-MFCC dan Korelasi dalam Pengenalan Suara Digit”, jtim, vol. 2, no. 2, pp. 100-108, Aug. 2020.
Section
Articles