|
[1]
|
Rao, K., Peng, F., Sak, H., et al. (2015) Grapheme-to-Phoneme Conversion Using Long Short-Term Memory Recurrent Neural Networks. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, 19-24 April 2015, 4225-4229. [Google Scholar] [CrossRef]
|
|
[2]
|
胡伟湘, 徐波, 黄泰翼. 汉语语音韵律边界的检测和识别研究[C]//第六届全国人机语音通讯学术会议论文集. 北京: 中国中文信息学会, 2001: 39-42.
|
|
[3]
|
Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, 4-9 December 2017, 6000-6010.
|
|
[4]
|
Mnih, V., Heess, N. and Graves, A. (2014) Recurrent Models of Visual Attention. Proceedings of the 27th International Conference on Neural Information Processing Systems, Volume 2, 2204-2212.
|
|
[5]
|
Yolchuyeva, S., Németh, G. and Gyires-Tóth, B. (2020) Transformer Based Grapheme-to-Phoneme Conversion. 20th Annual Conference of the International Speech Communication Association, Graz, 15-19 September 2019, 2095-2099. [Google Scholar] [CrossRef]
|
|
[6]
|
Sutskever, I., Vinyals, O. and Le, Q.V. (2014) Sequence to Sequence Learning with Neural Networks. Proceedings of the 27th International Conference on Neural Information Processing Systems, Volume 2, 3104-3112.
|
|
[7]
|
LeCun, Y., Boser, B., Denker, J.S., et al. (1989) Backpropagation Ap-plied to Handwritten Zip Code Recognition. Neural Computation, 1, 541-551. [Google Scholar] [CrossRef]
|
|
[8]
|
Mikolov, T., Karafiát, M., Burget, L., et al. (2010) Recurrent Neural Network Based Language Model. Proceedings Interspeech, Vol. 2, 1045-1048. [Google Scholar] [CrossRef]
|
|
[9]
|
Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
|
|
[10]
|
The CMU Pronouncing Dictionary. http://www.speech.cs.cmu.edu/cgi-bin/cmudict
|
|
[11]
|
Sejnowski, T.J. (1988) The NetTalk Corpus: Phonetic Transcrip-tion of 20008 English Words.
|
|
[12]
|
Kingsbury, P., Strassel, S., McLemore, C., et al. (1997) CALLHOME American English Lexicon (PRONLEX). Linguistic Data Consortium, Philadelphia.
|
|
[13]
|
Bisani, M. and Ney, H. (2008) Joint-Sequence Models for Grapheme-to-Phoneme Conversion. Speech Communication, 50, 434-451. [Google Scholar] [CrossRef]
|
|
[14]
|
Yao, K. and Zweig, G. (2015) Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion. INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, 6-10 September 2015, 3330-3334. [Google Scholar] [CrossRef]
|
|
[15]
|
Ashby, L.F.E., Bartley, T.M., Clematide, S., et al. (2021) Re-sults of the Second Sigmorphon Shared Task on Multilingual Grapheme-to-Phoneme Conversion. Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, August 2021, 115-125. [Google Scholar] [CrossRef]
|
|
[16]
|
El Saadany, O. and Suter, B. (2020) Graph-eme-to-Phoneme Conversion with a Multilingual Transformer Model. Proceedings of the 17th SIGMORPHON Work-shop on Computational Research in Phonetics, Phonology, and Morphology, July 2020, 85-89. [Google Scholar] [CrossRef]
|
|
[17]
|
Levenshtein, V.I. (1966) Binary Codes Capable of Correct-ing Deletions, Insertions, and Reversals. Soviet Physics Doklady, 10, 707-710.
|
|
[18]
|
Galescu, L. and Allen, J.F. (2002) Pronunciation of Proper Names with a Joint n-Gram Model for Bi-Directional Grapheme-to-Phoneme Conversion. 7th International Conference on Spoken Language Processing, Denver, 16-20 September 2002, 109-112. [Google Scholar] [CrossRef]
|
|
[19]
|
Yolchuyeva, S., Németh, G. and Gyires-Tóth, B. (2019) Graph-eme-to-Phoneme Conversion with Convolutional Neural Networks. Applied Sciences, 9, 1143. [Google Scholar] [CrossRef]
|