PrompTartLAB/PTT_en_ko
收藏Hugging Face2025-03-01 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/PrompTartLAB/PTT_en_ko
下载链接
链接失效反馈官方服务:
资源简介:
括号内术语翻译(PTT)数据集是专门为评估和训练在保持专业领域清晰度和准确性下进行技术术语翻译的模型而设计的数据集。它包含了英语-韩语双语句子对,原始英语技术术语以括号形式显示在韩语翻译旁边。主要数据集涉及人工智能领域,同时提供物理学和生物学领域的离域数据集作为评估使用。数据集的主领域为人工智能,离域评估领域包括生物学和物理学。
The Parenthetical Terminology Translation (PTT) dataset is designed for evaluating and training models to translate technical terms with clarity and accuracy in specialized fields. It consists of English-Korean bilingual sentence pairs with the original English technical terms presented in parentheses next to their Korean translations. The main dataset covers the domain of Artificial Intelligence (AI), with additional out-of-domain (OOD) datasets for Physics and Biology used for evaluation purposes. The primary domain is Artificial Intelligence, and the out-of-domain evaluation covers Biology and Physics.
提供机构:
PrompTartLAB



