five

Training Translation Style: A Q-Learning Approach to Enhancing Syntactic Diversity in AI-Generated English from Chinese Source Texts

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://doi.org/10.7910/DVN/ZXBWCL
下载链接
链接失效反馈
官方服务:
资源简介:
This study investigates whether reinforcement learning can mitigate the stylistic flattening observed in AI-generated translations. By training a reward-aware T5-small model on a subset of the UN parallel corpus, we demonstrate that Q-learning can effectively enhance syntactic diversity in machine translations from Chinese to English. Our approach operationalizes three key stylistic features—parse tree depth, clause variety, and lexical distribution—as reward signals to guide the model toward more structurally diverse translations. Results show significant improvements in stylometric measures compared to standard GPT-4 outputs, with human evaluators rating reinforcement learning-optimized translations as more natural and stylistically varied in 73% of cases. This research contributes to translation studies by demonstrating that AI translation systems can be reoriented toward stylistic goals beyond mere fluency, potentially supporting human translators in maintaining stylistic richness across languages.
创建时间:
2025-05-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作