Persian ATIS
收藏arXiv2023-03-01 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2303.00408v1
下载链接
链接失效反馈官方服务:
资源简介:
Persian ATIS数据集是由阿米尔卡比尔理工大学的研究团队基于英文ATIS数据集创建的,旨在为波斯语提供一个用于联合意图检测和槽填充的基准。该数据集包含5871条波斯语语音,内容涉及旅行相关领域,通过机器翻译和人工校正的方式从英文ATIS数据集转换而来。创建过程中,研究团队采用了EasyNMT工具进行初步翻译,并由波斯语母语者进行校对和标注。Persian ATIS数据集的应用领域主要集中在自然语言处理,特别是波斯语的意图理解和对话系统开发,以解决波斯语在自然语言处理领域的数据稀缺问题。
The Persian ATIS dataset was created by a research team from Amirkabir University of Technology based on the English ATIS dataset, aiming to provide a benchmark for joint intent detection and slot filling in the Persian language. This dataset contains 5,871 Persian speech utterances related to the travel domain, which was converted from the English ATIS dataset through machine translation and manual correction. During the development process, the research team adopted the EasyNMT tool for preliminary translation, and native Persian speakers conducted proofreading and annotation work. The application scenarios of the Persian ATIS dataset mainly focus on natural language processing, especially Persian intent understanding and dialog system development, to address the data scarcity problem of Persian in the field of natural language processing.
提供机构:
阿米尔卡比尔理工大学
创建时间:
2023-03-01



