five

Shi Shuo Xin Yu

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/NiuTrans/Classical-Modern
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了来自《世说新语》的双语数据,涵盖了36个章节,主要用于古文到现代汉语的翻译以及人名识别。数据集按章节划分,提供了原文、译文以及双语对照三种格式,是评估ChatGPT在古文翻译和人名识别性能方面的全面资源。该数据集规模包含3923个句子,其中有300个句子是手动标注用于人名识别的。任务重点在于古文到现代汉语的翻译以及人名识别。

This dataset contains bilingual parallel data sourced from *A New Account of the Tales of the World*, covering 36 chapters. It is primarily intended for classical Chinese to modern Chinese translation and personal name recognition. Organized by chapters, it provides three formats: original classical Chinese text, modern Chinese translated text, and bilingual parallel text. With a total of 3,923 sentences, 300 of which are manually annotated for personal name recognition tasks, this dataset serves as a comprehensive resource for evaluating ChatGPT's performance in classical Chinese translation and personal name recognition. The core tasks of this dataset focus on classical Chinese to modern Chinese translation and personal name recognition.
提供机构:
NiuTrans
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作