five

Textbook Dataset from NCTB

收藏
Mendeley Data2024-03-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/gktc5y2sy2
下载链接
链接失效反馈
官方服务:
资源简介:
In our quest to advance Bangla language processing, we have created a specialized dataset tailored to our project's objectives. This dataset is a cornerstone in developing an effective Bangla Question-Answering system with a strong emphasis on customization. It comprises approximately 3,000 meticulously curated question-and-answer pairs. Human annotators, guided by NCTB textbooks from classes six to ten, painstakingly selected these pairs. Each passage in the dataset, averaging 387 words, offers rich context for meaningful question answering. Human annotators also diligently collected responses for various question types, ensuring the dataset's reliability and relevance in Bangla. Our primary goal is to develop a proficient Bangla question-answering system. We have organized the dataset into training and validation subsets to achieve this, conveniently encapsulated within CSV files. These files seamlessly integrate multiple passages with corresponding questions and expertly annotated answers. Our dataset forms the foundation for a precision-driven, context-aware Bangla question-answering system. It serves as a vital resource for researchers and developers working to enhance Bangla language processing capabilities, poised to advance the state of the art in this field.
创建时间:
2024-01-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作