five

Persian ATIS

收藏
arXiv2023-03-01 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2303.00408v1
下载链接
链接失效反馈
官方服务:
资源简介:
Persian ATIS数据集是由阿米尔卡比尔理工大学的研究团队基于英文ATIS数据集创建的,旨在为波斯语提供一个用于联合意图检测和槽填充的基准。该数据集包含5871条波斯语语音,内容涉及旅行相关领域,通过机器翻译和人工校正的方式从英文ATIS数据集转换而来。创建过程中,研究团队采用了EasyNMT工具进行初步翻译,并由波斯语母语者进行校对和标注。Persian ATIS数据集的应用领域主要集中在自然语言处理,特别是波斯语的意图理解和对话系统开发,以解决波斯语在自然语言处理领域的数据稀缺问题。

The Persian ATIS dataset was created by a research team from Amirkabir University of Technology based on the English ATIS dataset, aiming to provide a benchmark for joint intent detection and slot filling in the Persian language. This dataset contains 5,871 Persian speech utterances related to the travel domain, which was converted from the English ATIS dataset through machine translation and manual correction. During the development process, the research team adopted the EasyNMT tool for preliminary translation, and native Persian speakers conducted proofreading and annotation work. The application scenarios of the Persian ATIS dataset mainly focus on natural language processing, especially Persian intent understanding and dialog system development, to address the data scarcity problem of Persian in the field of natural language processing.
提供机构:
阿米尔卡比尔理工大学
创建时间:
2023-03-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作