基于机器学习的译者风格研究——以《红字》两中译本为例
A ML-Based Study on Translator’s Style—A Case Study of Two Chinese Translations of The Scarlet Letter
DOI: 10.12677/ml.2025.135555, PDF,   
作者: 徐颖楠:中国矿业大学外国语言文化学院,江苏 徐州
关键词: 译者风格《红字》机器学习Translator’s Style The Scarlet Letter Machine Learning
摘要: 基于机器学习的方法,采用卡方评估法提取有效特征集,使用支持向量机(SVM)分类器对《红字》傅东华与潘庆舲译本进行分类。研究发现,SVM能够有效区分两个译本,分类正确率达到98.0%。通过分析文本,揭示了两译者风格的差异。傅东华在翻译中偏好使用复杂定语、显化原文时态,词汇选择相对简洁,并频繁运用中文特殊句式,更多地使用连词和破折号作为衔接手段。潘庆舲的翻译风格更贴近现代汉语的表达习惯,倾向于隐化助词,偏好使用三字词语和四字成语来装点译文,常以冒号作为衔接手段,更加注重译文的流畅性和可读性。通过研究验证,机器学习方法在译者风格研究中具有可行性。
Abstract: Based on the machine learning approach, the distinctions was revealed between the Chinese translations of The Scarlet Letter by Fu Donghua and Pan Qingling. Utilizing the Chi-squared Attribute Eval feature selection method to extract significant style features, a Support Vector Machine (SVM) classifier was trained, achieving an accuracy of 98.0%. The analysis reveals marked stylistic divergences between the two translators. Fu’s translation exhibits a preference for complex attributive structures, explicit temporal markers, and simplified lexical choices, alongside frequent use of Chinese-specific syntactic patterns and cohesive devices such as conjunctions and dashes. In contrast, Pan’s translation aligns closely with modern Mandarin conventions, characterized by the omission of auxiliary particles, the incorporation of tri-syllabic expressions and quadri-syllabic idioms for stylistic embellishment, and a reliance on colons as cohesive markers, prioritizing textual fluency and readability. These findings underscore the efficacy of computational methods in identifying translator-specific stylistic fingerprints.
文章引用:徐颖楠. 基于机器学习的译者风格研究——以《红字》两中译本为例[J]. 现代语言学, 2025, 13(5): 908-915. https://doi.org/10.12677/ml.2025.135555

参考文献

[1] Baker, M. (2000) Towards a Methodology for Investigating the Style of a Literary Translator. Target. International Journal of Translation Studies, 12, 241-266. [Google Scholar] [CrossRef
[2] 黄立波. 语料库译者风格研究反思[J]. 外语教学, 2018, 39(1): 77-81.
[3] 詹菊红, 蒋跃. 机器学习算法在翻译风格研究中的应用[J]. 外语教学, 2017, 38(5): 80-85.
[4] Lynch, G. and Vogel, C. (2018) The Translator’s Visibility: Detecting Translatorial Fingerprints in Contemporaneous Parallel Translations. Computer Speech & Language, 52, 79-104. [Google Scholar] [CrossRef
[5] 仲文明, 姚梦妮. 基于降维分类模型的译者风格研究[J]. 外语电化教学, 2023(4): 24-31.
[6] 张华平, 商建云. NLPIR-Parser: 大数据语义智能分析平台[J]. 语料库语言学, 2019(1): 87-104.
[7] Olohan M. (2004) Introducing Corpora in Translation Studies, Routledge.
[8] Hu, K.B. (2016) New Frontiers in Translation Studies. Springer, Berlin & Heidelberg.
[9] 胡开宝, 黑黟. 数字人文视域下翻译研究: 特征、领域与意义[J].中国翻译, 2020, 41(2): 5-15.
[10] Leech, G. and Short, M.H. (2007) Style in Fiction: A Linguistic Introduction to English Fictional Prose. 2nd Edition, Pearson Education.
[11] 胡开宝, 崔薇. 基于语料库的莎士比亚戏剧汉译本中“使”字句应用的研究[J]. 外语教学, 2016, 37(1): 102-106.
[12] 刘艳春, 胡显耀. 基于语料库的翻译汉语变体多特征统计分析[J]. 语料库研究前沿, 2021(1): 25-36.
[13] 吴思远, 于东, 江新. 汉语文本可读性特征体系构建和效度验证[J]. 世界汉语教学, 2020, 34(1): 81-97.
[14] 董琇. 基于降维法的译者风格研究[J]. 外语教学与研究, 2014, 46(2): 282-293.
[15] 孔德璐. 机器学习视域下《三国演义》三译本翻译风格对比研究[J]. 大连大学学报, 2023, 44(4): 38-47.
[16] Hawthorn, N. (2002) The Scarlet Letter. Penguin Classics.
[17] 霍桑. 猩红文[M]. 傅东华, 译. 北京: 商务印书馆, 1947.
[18] 霍桑. 猩红A字母[M]. 潘庆舲, 译. 北京: 九州出版社, 2017.
[19] 胡显耀. 基于语料库的汉语翻译语体特征多维分析[J]. 外语教学与研究, 2010, 42(6): 451-458.
[20] 肖忠华. 英汉翻译中的汉语译文语料库研究[M]. 上海: 上海交通大学出版社, 2012.
[21] 戴光荣. 译文源语透过效应研究[M]. 上海: 上海交通大学出版社, 2013.
[22] 黄伯荣, 廖序东. 现代汉语(增订六版)上册[M]. 北京: 高等教育出版社, 2017.
[23] 胡明亮. 语篇衔接与翻译[M]. 成都: 巴蜀书社, 2007.