Travel Conversation Exercise Corpus
收藏DataCite Commons2021-10-05 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/travel-conversation-exercise-corpus
下载链接
链接失效反馈官方服务:
资源简介:
The Travel Conversation Exercise (TCE) corpus contains 888 parallel sentences and their zero pronoun label (true or false) which are written in Japanese and English.The sentences in this dataset are sentences that are often used to practice English conversations, when travelling abroad, for L2 English learners in Japan.Some of the conversation themes include (but are not limited to) conversations at the airport, hotel reservations, sightseeing, and emergencies.Several example usages include using the dataset as an evaluation set to compare machine translation results, compare machine translation models' ability in translating zero pronoun sentences and those which are not, and build a zero pronoun binary classification model to automatically label new Japanese-English parallel sentences.
旅行会话练习(Travel Conversation Exercise,简称TCE)语料库包含888组日英平行句对及其零代词(zero pronoun)标注(true或false)。本数据集收录的语句均为日本地区第二语言(L2)英语学习者用于出国旅行时的英语会话练习常用语句。该数据集涵盖的会话主题包括(但不限于)机场场景对话、酒店预订、观光游览及应急场景对话。该数据集的典型应用场景包括:将其作为评测集用于对比机器翻译(machine translation)结果、评测机器翻译模型对零代词语句与非零代词语句的翻译能力,以及构建零代词二分类模型以自动为新增日英平行句对添加标注。
提供机构:
IEEE DataPort
创建时间:
2021-10-05



