PCoQA
收藏arXiv2023-12-07 更新2024-06-21 收录
下载链接:
https://github.com/HamedHematian/PCoQA
下载链接
链接失效反馈官方服务:
资源简介:
PCoQA是首个波斯语对话式问答数据集,由谢里夫理工大学人工智能组创建。该数据集包含870个对话,共计9026个问题-答案对,数据来源于维基百科。创建过程中,研究者采取了多种措施确保数据质量,如限制不可回答问题的比例和减少词汇重叠。PCoQA特别适用于研究对话式问答系统,旨在解决传统问答系统中忽视的对话动态问题,提供更自然、互动的问答体验。
PCoQA is the first Persian conversational question answering dataset created by the Artificial Intelligence Group of Sharif University of Technology. This dataset contains 870 conversations and a total of 9026 question-answer pairs sourced from Wikipedia. During its creation, researchers adopted various measures to ensure data quality, such as limiting the proportion of unanswerable questions and reducing lexical overlap. PCoQA is particularly suitable for research on conversational question answering systems, aiming to address the conversational dynamics issues overlooked by traditional question answering systems and provide a more natural and interactive question answering experience.
提供机构:
计算机工程系,谢里夫理工大学人工智能组
创建时间:
2023-12-07



