QALD-9-plus
收藏arXiv2022-02-07 更新2024-06-21 收录
下载链接:
https://doi.org/10.6084/m9.figshare.16864273
下载链接
链接失效反馈官方服务:
资源简介:
QALD-9-plus是由安哈尔特应用科技大学计算机科学与语言系创建的多语言数据集,旨在通过高质量的本地语言翻译增强知识图谱问答系统的多语言访问性。该数据集包含4930条问题翻译,覆盖9种语言,包括英语、德语、法语、俄语等,其中部分语言如亚美尼亚语、巴什基尔语等首次被纳入研究。数据集的创建过程涉及众包翻译,确保了翻译的准确性和语言的多样性。QALD-9-plus的应用领域主要集中在提升知识图谱问答系统的多语言处理能力,特别是对于低资源和濒危语言的支持,以实现更广泛的用户群体访问。
QALD-9-plus is a multilingual dataset developed by the Department of Computer Science and Linguistics, Anhalt University of Applied Sciences, aiming to enhance the multilingual accessibility of Knowledge Graph Question Answering (KGQA) systems through high-quality native-language translations. This dataset includes 4,930 question translations covering 9 languages, such as English, German, French, Russian and others. Notably, some languages like Armenian and Bashkir are included in the research for the first time. The dataset construction process involved crowdsourced translations, ensuring translation accuracy and linguistic diversity. The primary application scenarios of QALD-9-plus focus on improving the multilingual processing capabilities of KGQA systems, especially supporting low-resource and endangered languages, to enable access for a broader user base.
提供机构:
安哈尔特应用科技大学计算机科学与语言系
创建时间:
2022-02-01



