Document Type : Research
Authors
1 Assistant Professor in General Linguistics, University of Isfahan, Isfahan, Iran
2 Ph.D. in General Linguistics, Alzahra University, Tehran, Iran
3 Associate Professor in General Linguistics, University of Isfahan, Isfahan, Iran
Abstract
Human voices are unique, and for this reason, speakers can be identified by their voices. This shows that speech sounds contain speaker-specific information that can be reflected in the acoustic properties of speech signals. There are many individuals around the world who speak two or more languages, adding a fascinating dimension of variability to language perception and production. However, it remains unclear whether bilinguals alter their voice when switching between languages. A holistic view of bilingualism suggests that bilinguals are an integrated whole that cannot be separated into distinct parts; instead, they possess their own specific linguistic configuration (Grosjean, 1989). Moreover, languages differ in their segment inventories, rules of segmental combinations, as well as spectral and rhythmic characteristics of speech. Speaking styles can also contribute to within-speaker variability in acoustic parameters. Despite these factors, little is known about the influence of language and speaking style on within- and between-speaker vocal variability. This study aims to investigate how acoustic features, specifically long-term F0 and long-term formant frequencies (F1-F4), contribute to speaker individuality in Persian-English bilingual speakers and to what extent these features can discriminate between bilingual speakers.
Keywords
- Abercrombie, D. (1967). Elements of General Phonetics. UK: Edinburgh University Press.
- Afshan, A., Kreiman, J., & Alwan, A. (2020). Speaker discrimination in humans and machines: Effects of speaking style variability. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-Octob(September), 3136–3140. https://doi.org/10.21437/Interspeech.2020-3004
- Altenberg, E. P., & Ferrand, C. T. (2006). Fundamental Frequency in Monolingual English, Bilingual English / Russian , and Bilingual English / Cantonese Young Adult Women. Journal of Voice, 20(1), 86–96. https://doi.org/10.1016/j.jvoice.2005.01.005
- Asadi, H., Nourbakhsh, M., Sasani, F., & Dellwo, V. (2018). Examining long-term formant frequency as a forensic cue for speaker identification : An experiment on Persian. In M. Nourbakhsh, H. Asadi, & M. Asiaee (Eds.), Proceedings of the First International Conference on Laboratory Phonetics and Phonology (pp. 21–28). Tehran: Neveesh Parsi Publications.
- Asadi, H., Nourbakhsh, M., & Sassani, F. (2019). Between-speaker variability of voiceless fricatives in Persian. Language Related Research, 10(1), 129–147. http://lrr.modares.ac.ir/article-14-3720-en.html
- Asiaee, M., Nourbakhsh, M., & Skarnitzl, R. (2019). Can LTF discriminate bilingual speakers? Proceedings of the 28th Annual Conference of the International Association for Forensic Phonetics and Acoustics (IAFPA), 41–42.
- Baldwin, J. R., & French, P. (1990). Forensic Phonetics. London: Printer Publishers. http://hdl.handle.net/1885/91429
- Boersma, P., & Weenink, D. (2022). Praat: Doing phonetics by computer (6.1.39). http://www.praat.org/
- Cheng, A. (2020). Cross-linguistic f 0 differences in bilingual speakers of English and Korean . The Journal of the Acoustical Society of America, 147(2), EL67–EL73. https://doi.org/10.1121/10.0000498
- Corretge, R. (2022). Praat Vocal Toolkit. http://www.praatvocaltoolkit.com
- Debruyne, F., Decoster, W., Van Gijsel, A., & Vercammen, J. (2002). Speaking fundamental frequency in monozygotic and dizygotic twins. Journal of Voice : Official Journal of the Voice Foundation, 16(4), 466–471. https://doi.org/10.1016/s0892-1997(02)00121-2
- Dorreen, K. (2017). FUNDAMENTAL FREQUENCY DISTRIBUTIONS OF BILINGUAL SPEAKERS IN FORENSIC SPEAKER COMPARISON [The University of Canterbury]. Retrieved from <https://doi.org/10.1017/CBO9781107415324.004>
- Foulkes, P., & Barron, A. (2000). Telephone speaker recognition amongst members of a close social network. International Journal of Speech, Language and the Law, 7(2), 180–198. https://doi.org/10.1558/sll.2000.7.2.180
- Gold, E. (2014). Calculating likelihood ratios for forensic speaker comparisons using phonetic and linguistic parameters. (Phd dissertation), The University of York, Toronto, Canada.
- Gold, E., French, P., & Harrison, P. (2013). Examining long-term formant distributions as a discriminant in forensic speaker comparisons under a likelihood ratio framework. Proceedings of Meetings on Acoustics, 19. https://doi.org/10.1121/1.4800285
- Grosjean, F. (1989). Neurolinguists, beware! The bilingual is not two monolinguals in one person. Brain and Language, 36(1), 3–15. https://doi.org/https://doi.org/10.1016/0093-934X(89)90048-5
- Grosjean, F. (2010). Bilingual: Life and Reality. Cambridge: Harvard University Press. http://www.jstor.org/stable/j.ctt13x0ft8
- He, L., Zhang, Y., & Dellwo, V. (2019). Between-speaker variability and temporal organization of the first formant. In The Journal of the Acoustical Society of America, 145(3), 209–214. https://doi.org/10.1121/1.5093450
- Heeren, W., Vloed, D., & Vermeulen, J. (2014). Exploring long-term formants in bilingual speakers. IAFPA’s 2014 Annual Conference Book of Abstracts, 2014, 3, 39.
- Holm, S. (2003). Individual use of acoustic parameters in read and spontaneous speech. Phonum 9, 9, 157–160.
- Järvinen, K., Laukkanen, A., & Aaltonen, O. (2013). Speaking a foreign language and its effect on F0. Logopedics Phoniatrics Vocology, 38(2), 47–51. https://doi.org/10.3109/14015439.2012.687764
- Johnson, K. A., Babel, M., & Fuhrman, R. A. (2020). Bilingual acoustic voice variation is similarly structured across languages. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-Octob, 2387–2391. https://doi.org/10.21437/Interspeech.2020-3095
- Keshavarz, M. H., & Ingram, D. (2002). The early phonological development of a Farsi-English bilingual child. International Journal of Bilingualism, 6(3), 255–269. https://doi.org/10.1177/13670069020060030301
- Kinoshita, Y. (2001). Testing Realistic Forensic Speaker A Likelihood Ratio Based Approach Using Formants [The Australian National University]. Retrieved from <https://doi.org/10.1558/ijsll.v9i1.133>.
- Laan, G. P. M. (1997). The contribution of intonation, segmental durations, and spectral features to the perception of a spontaneous and a read speaking style. Speech Communication, 22(1), 43–65. https://doi.org/https://doi.org/10.1016/S0167-6393(97)00012-5
- Ladefoged, P. (2006). A course in phonetics (5nd ed.). USA: Wadsworth Cengage Learning.
- Lo, J. J. H. (2021). Cross-Linguistic Speaker Individuality of Long-Term Formant Distributions: Phonetic and Forensic Perspectives. 416–420. https://doi.org/10.21437/interspeech.2021-1699
- McDougall, K. (2004). Speaker-specific formant dynamics: An experiment on Australian English /aI/. International Journal of Speech Language and the Law, 11(1 SE-Articles), 103–130. https://doi.org/10.1558/sll.2004.11.1.103
- Mok, P. P. K., Xu, R. B., & Zuo, D. (2015). Bilingual speaker identification: Chinese and English. International Journal of Speech, Language and the Law, 22(1), 57–77. https://doi.org/10.1558/ijsll.v22i1.18636
- Moos, A. (2010). Long-term formant distribution as a measure of speaker characteristics in read and spontaneous speech. The Phonetician, 101/102, 7–24.
- Nolan, F., & Grigoras, C. (2005). A case for formant analysis in forensic speaker identification. International Journal of Speech, Language and the Law, 12(2), 143–173. https://doi.org/10.1558/sll.2005.12.2.143
- Paradis, J. (2001). Do bilingual two-year-olds have separate phonological systems? International Journal of Bilingualism, 5(1), 19–38. https://doi.org/10.1177/13670069010050010201
- Park, S. J., Yeung, G., Vesselinova, N., Kreiman, J., Keating, P. A., & Alwan, A. (2018). Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles. The Journal of the Acoustical Society of America, 144(1), 375–386. https://doi.org/10.1121/1.5045323
- R Core Team. (2021). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Retrieved from <https://www.r-project.org/>
- Rose, P. (2002). Forensic Speaker Identification. London: Taylor and Francis.
- Rose, P., & Clermont, F. (2001). A comparison of two acoustic methods for forensic speaker discrimination. Acoustics Australia, 29(1), 31–35.
- Sambur, M. R. (1975). Selection of Acoustic Features for Speaker Identification. IEEE Transactions on Acoustics, Speech, and Signal Processing, 23(2), 176–182. https://doi.org/10.1109/TASSP.1975.1162664
- Vaheb, A., Choobbasti, A. J., Sabeti, B., Mortazavi Najafabadi, S. H. E., & Safavi, S. (2018a). Mirasvoice: A bilingual (English-Farsi) speech corpus. LREC 2018 - 11th International Conference on Language Resources and Evaluation, 3, 2903–2906.
- Vaheb, A., Choobbasti, A. J., Sabeti, B., Mortazavi Najafabadi, S. H. E., & Safavi, S. (2018b). Mirasvoice: A bilingual (English-Farsi) speech corpus. LREC 2018 - 11th International Conference on Language Resources and Evaluation, May, 2903–2906.