非同时期普通话语音对说话人辨别及自评置信度的影响
Effects of Non-Contemporaneous Mandarin Speech on Talker Discrimination and Self-Reported Confidence
DOI: 10.12677/ap.2026.165266, PDF,    科研立项经费支持
作者: 王丹琳:北京警察学院刑事科学技术系,北京;柯青青:长沙市天心区人民法院,湖南 长沙;曹洪林*:中国政法大学证据科学教育部重点实验室,北京;法大法庭科学技术鉴定研究所,北京
关键词: 非同时期普通话语音说话人辨别自评置信度信号检测理论Non-Contemporaneous Speech Mandarin Speech Talker Discrimination Self-Reported Confidence Signal Detection Theory
摘要: 说话人辨别是听者基于听觉线索判断两段语音是否来源于同一说话人的过程。本研究在同异判断(same-different) AX范式下,构建1年和16年时间间隔的非同时期普通话语音材料,控制语料长度和说话人性别,招募42名男性听音被试完成同异判断及置信度评分,采用广义线性混合模型进行分析。结果表明,时间跨度显著降低辨别准确率;语料长度呈“长句 > 短句 > 词语”的非显著提升趋势;说话人性别对辨别准确率无显著影响;自评置信度能够显著正向预测判断准确率。信号检测分析结果显示,被试整体具备一定感知敏感性且个体差异明显,判别标准整体接近中性。研究表明,时间跨度会削弱普通话语音的可辨识性,自评置信度可作为判断可靠性的有效参考指标。
Abstract: Talker discrimination refers to the process by which listeners judge whether two speech utterances originate from the same talker based on auditory cues. In this study, non-simultaneous Mandarin speech materials with time intervals of 1 year and 16 years were constructed under the AX same-different judgment paradigm. Speech length and talker gender were controlled, and 42 male listeners were recruited to perform same-different judgments and confidence ratings. Generalized linear mixed models were used for data analysis. The results showed that time span significantly reduced discrimination accuracy; speech length exhibited a non-significant increasing trend of “long sentences > short sentences > words”; talker gender had no significant effect on discrimination accuracy; and self-rated confidence significantly and positively predicted judgment accuracy. Signal detection analysis revealed that listeners generally possessed certain perceptual sensitivity with obvious individual differences, and the overall decision criterion was close to neutral. The study indicates that time span weakens the identifiability of Mandarin speech, and self-rated confidence can serve as an effective indicator for judging the reliability of discrimination.
文章引用:王丹琳, 柯青青, 曹洪林 (2026). 非同时期普通话语音对说话人辨别及自评置信度的影响. 心理学进展, 16(5), 311-323. https://doi.org/10.12677/ap.2026.165266

参考文献

[1] Afshan, A., Kreiman, J., & Alwan, A. (2022). Speaker Discrimination Performance for “Easy” versus “Hard” Voices in Style-Matched and Mismatched Speech. The Journal of the Acoustical Society of America, 151, 1393-1403.[CrossRef] [PubMed]
[2] Bartle, A., & Dellwo, V. (2015). Auditory Speaker Discrimination by Forensic Phoneticians and Naive Listeners in Voiced and Whispered Speech. The International Journal of Speech, Language and the Law, 22, 229-248.[CrossRef
[3] Bradshaw, L., Chodroff, E., & Dellwo, V. (2025). The Role of Phonetic Overlap for Speaker Discrimination. The Journal of the Acoustical Society of America, 157, 3572-3589.[CrossRef] [PubMed]
[4] Hollien, H., & Schwartz, R. (2000). Aural-Perceptual Speaker Identification: Problems with Noncontemporary Samples. The International Journal of Speech, Language and the Law, 7, 199-211.[CrossRef
[5] Hu, X., Wang, X., Gu, Y. et al. (2017). Phonological Experience Modulates Voice Discrimination: Evidence from Functional Brain Networks Analysis. Brain and Language, 173, 67-75.[CrossRef] [PubMed]
[6] Narayan, C. R., Mak, L., & Bialystok, E. (2017). Words Get in the Way: Linguistic Effects on Talker Discrimination. Cognitive Science, 41, 1361-1376.[CrossRef] [PubMed]
[7] Park, S. J. (2019). Towards Understanding Voice Discrimination Abilities of Humans and Machines. Doctoral Dissertation, University of California.
[8] Quinto, A., Abu El Adas, S., & Levi, S. V. (2020). Re-Examining the Effect of Top-Down Linguistic Information on Speaker-Voice Discrimination. Cognitive Science, 44, e12902.[CrossRef] [PubMed]
[9] Rose, P., & Duncan, S. (1995). Naive Auditory Identification and Discrimination of Similar Voices by Familiar Listeners. The International Journal of Speech, Language and the Law, 2, 1-17.[CrossRef
[10] Santos, S. C., Kapadia, A., & Feinberg, D. R. (2025). Hearing People Speak in Different Accents Biases Voice Discrimination. Scientific Reports, 15, Article No. 30775.[CrossRef] [PubMed]
[11] Schäfer, S., & Foulkes, P. (2022). The Impact of Voice Recognition Skills on Earwitness Testimony. In Proceedings of the Voice Identity (VoiceID) Conference 2022 (p. 42). University of Zurich.
[12] Smith, H. M. J., Baguley, T. S., Robson, J. et al. (2019). Forensic Voice Discrimination by Lay Listeners: The Effect of Speech Type and Background Noise on Performance. Applied Cognitive Psychology, 33, 272-287.[CrossRef
[13] Stevenage, S. V., Neil, G. J., Parsons, B. et al. (2018). A Sound Effect: Exploration of the Distinctiveness Advantage in Voice Recognition. Applied Cognitive Psychology, 32, 526-536.[CrossRef] [PubMed]
[14] Stevenage, S. V., Tomlin, R., Neil, G. J. et al. (2021). May I Speak Freely? The Difficulty in Vocal Identity Processing Across Free and Scripted Speech. Journal of Nonverbal Behavior, 45, 149-163.[CrossRef
[15] Suire, A., Raymond, M., & Barkat-Defradas, M. (2019). Male Vocal Quality and Its Relation to Females’ Preferences. Evolutionary Psychology, 17, 1-12.[CrossRef] [PubMed]
[16] Vyshnevetska, V., Giroud, N., Ramon, M., & Dellwo, V. (2025). Listeners are Biased Towards Voices of Young Speakers and Female Speakers When Discriminating Voices. Cognitive Research: Principles and Implications, 10, Article No. 28.[CrossRef] [PubMed]