Main Article Content

Abstract

The voice of each speaker has a unique specific character, influenced by gender, age, emotion, dialect, etc. The use of voice-based gender identification is growing rapidly, such as in the fields of security systems, speech recognition, artificial intelligence, etc. However, in speech processing, there are difficulties where the characteristics of the speech signal based on increasing age are difficult to determine accuracy, and there are overlapping fundamental frequency values between males and females. In this research, modeling of a gender identification system based on voice in real-time has been carried out on a Raspberry Pi device. This system is implemented by 2 methods, namely the YIN algorithm and feature extraction of Mel-Frequency Cepstral Coefficient (MFCC). The test results showed that the success of identification in the tuning parameters of scheme two is better than the first scheme by narrowing the overlapping frequency parameters. In the female test data in the closed test, the accuracy is from 98% to 100%, then in the open test starts from 92% to 96%. Meanwhile, the test data for the male closed test increased from 92% to 98%, and the open test started at 90% and rose to 94%. It indicates that the data used in this research is more suitable to use the second scheme parameter tuning to increase the accuracy of the results.

Keywords

real-time gender YIN MFCC Raspberry Pi real-time gender YIN MFCC Raspberry Pi

Article Details

Author Biographies

Mirza Ardiana, Politeknik Elektronika Negeri Surabaya

Teknik Elektro Politeknik Elektronika Negeri Surabaya

Titon Dutono, Politeknik Elektronika Negeri Surabaya

Teknik Telekomunikasi Politeknik Elektronika Negeri Surabaya

Tri Budi Santoso, Politeknik Elektronika Negeri Surabaya

Teknik Telekomunikasi Politeknik Elektronika Negeri Surabaya
How to Cite
Ardiana, M., Dutono, T., & Budi Santoso, T. (2022). Real-time Gender Identification Using Voice On Raspberry Pi. Jurnal Komputer Terapan , 8(1), 158–167. https://doi.org/10.35143/jkt.v8i1.5320

References

  1. S.Chaudhary and D.K. Sharma, “Gender Identification based on Voice Signal Characteristics”, in International Conference on Advances in Computing, Communication Control and Networking , ICACCCN 2018.
  2. Krishna D N, et all, “Language Independent Gender Identification From Raw Waveform Using Multi-Scale Convolutional Neural Networks”, in 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2020.
  3. B. Jolad and R. Khanai, “An Art of Speech Recognition: A Review” in 2nd International Conference on Signal Processing and Communication, ICSPC 2019 - Proceedings, 2019, pp. 31-35.
  4. Furui, S., “Digital Speech Processing, Synthesis and Recognition”, Marcel Dekker Inc., New York. 2001.
  5. A. P. Simpson, “Phonetic differences between male and female speech”, Language and Linguistics Compass, vol. 3, no. 2, pp. 621–640, 2009.
  6. M. Alhussein, Z. Ali, M. Imran, and W. Abdul, “Automatic gender detection based on characteristics of vocal folds for mobile healthcare system” Mobile Information Systems, vol. 2016, 2016.
  7. Y. Hu, D. Wu, and A. Nucci, “Pitch-based gender identification with two-stage classification,” Security and Communication Networks, vol. 5, no. 2, pp. 211–225, 2012.
  8. K. Rakesh, S. Dutta, and K. Shama, “Gender recognition using speech processing techniques in labview”, International Journal of Advances in Engineering & Technology, vol. 1, no. 2, pp. 51–63, 2011.
  9. M. Abouelenien, V. Perez-Rosas, R. Mihalcea, and M. Burzo, “Multi-modal gender detection” in Proceedings of the 19th ACM International Conference on Multimodal Interaction. ACM, 2017, pp. 302–311.
  10. M. Kumari and I. Ali, “An efficient algorithm for gender detection using voice samples”, in Communication, Control and Intelligent Systems (CCIS), 2015. IEEE, 2015, pp. 221–226.
  11. A.de Cheveigne and H. Kawahara, “YIN, a fundamental frequency estimator for speech and music”, J. Acoust. Soc. Am., Vol. 111, No. 4, April 2002.
  12. F.Aldhi and M.Huda, “Sistem Klasifikasi Gender Berdasarkan Suara Pada Raspberry Pi,” Jurnal Elektro PENS, Vol.2, No.2, 2018.
  13. Y. Al-Irhayim and A.Abdulkafor, “Speaker Gender Recognition Using Hidden Markov Model” ISSN 2222-1719, Vol.7, No.4, 2016.
  14. Yakun Hu, Dapeng Wu and Antonio Nucci, “Pitch-based gender identification with two-stage classification”, in Wiley Online Library SECURITY AND COMMUNICATION NETWORKS, 2011
  15. Zizhu Fan et all, “Principal Component Analysis Integrating Mahalanobis Distance for Face Recognition”, in 2013 Second International Conference on Robot, Vision and Signal Processing, 2013.
  16. Chang-Young Lee, “A Study on the Optimal Mahalanobis Distance for Speech Recognition”, Speech Science, Vol. 13 No. 4, 2006.

Most read articles by the same author(s)