five

Business Scene Dialogue Dataset

收藏
paperswithcode.com2025-03-26 收录
下载链接:
https://paperswithcode.com/dataset/business-scene-dialogue
下载链接
链接失效反馈
官方服务:
资源简介:
The Japanese-English business conversation corpus, namely Business Scene Dialogue corpus, was constructed in 3 steps: selecting business scenes, writing monolingual conversation scenarios according to the selected scenes, and translating the scenarios into the other language. Half of the monolingual scenarios were written in Japanese and the other half were written in English. The whole construction process was supervised by a person who satisfies the following conditions to guarantee the conversations to be natural: has the experience of being engaged in language learning programs, especially for business conversations is able to smoothly communicate with others in various business scenes both in Japanese and English has the experience of being involved in business The BSD corpus is split into balanced training, development and evaluation sets. The documents in these sets are balanced in terms of scenes and original languages. In this repository we publicly share the full development and evaluation sets and a part of the training data set.

《日英商业对话语料库》,亦称商业场景对话语料库,系通过三阶段构建而成:首先,选择商业场景;其次,根据所选场景撰写单语种对话场景;最后,将场景翻译为另一种语言。其中,单语种场景一半以日语撰写,另一半则以英语完成。整个构建过程均由满足以下条件的人员监督,以确保对话的自然性:具有参与语言学习项目尤其是商业对话项目的经验;能够在日英两种语言中流畅地与其他人在各种商业场景中进行沟通;具备参与商业活动的经验。BSD 语料库被分为均衡的训练集、开发集和评估集,其中各集合在场景和原始语言方面均保持平衡。在本存储库中,我们公开发布了全部开发集和评估集,以及部分训练数据。
提供机构:
Papers with Code
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作