多语言对齐语料库
收藏arXiv2014-07-07 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/1407.1605v1
下载链接
链接失效反馈官方服务:
资源简介:
本研究构建了一个包含十种不同语言版本的《八十天环游地球》的多语言对齐语料库,用于研究专有名词的翻译问题。该数据集包含3415个专有名词,覆盖了人名、地名等多种类型,通过精确的对齐技术,实现了不同语言间句子的对应关系。数据集的创建旨在探讨专有名词的可译性,特别是在跨语言环境下的翻译策略和方法,为翻译实践和理论研究提供实证支持。
This study develops a multilingual aligned corpus consisting of ten language versions of *Around the World in Eighty Days*, which is dedicated to research on proper noun translation. This dataset encompasses 3,415 proper nouns spanning diverse categories including personal names and geographical names, and establishes sentence-level correspondences across different languages through precise alignment technologies. The creation of this dataset aims to investigate the translatability of proper nouns, particularly translation strategies and methods in cross-lingual contexts, and provides empirical support for both translation practice and theoretical research.
提供机构:
图尔弗朗索瓦·拉伯雷大学
创建时间:
2014-07-07



