five

XCommonsense-BN: A Bangla Commonsense Reasoning Dataset

收藏
DataCite Commons2026-04-06 更新2026-05-04 收录
下载链接:
https://data.mendeley.com/datasets/cwzv55wtvf/2
下载链接
链接失效反馈
官方服务:
资源简介:
The XCommonsense-BN dataset is a curated collection of Bangla multiple-choice questions designed for commonsense reasoning research. It is organized in a tabular format, where each entry includes a unique identifier, the reasoning category, the question in Bangla, four answer options labeled A through D, the correct option, and a short explanation in Bangla justifying the correct answer. The dataset comprises over 1,000 entries spanning five categories: causal, temporal, social, physical, and intentional reasoning, with each category containing at least 150–200 questions to ensure balanced coverage. The dataset is provided in Excel formats encoded in UTF-8 to support the Bangla script. Sample entries illustrate typical questions, answer options, correct labels, and explanations, providing a representative view of the dataset’s structure and content. This dataset enables the development, evaluation, and benchmarking of machine learning models in Bangla commonsense reasoning tasks and contributes to research in low-resource language NLP. Value of the Data: 1. Enables research in Bangla NLP, particularly in commonsense reasoning. 2. Can be used to train, evaluate, and benchmark machine learning and AI models for Bangla question-answering systems. 3. Promotes data-driven AI research in low-resource languages. 4. Supports cross-lingual and multilingual model development by providing high-quality, curated Bangla data.
提供机构:
Mendeley Data
创建时间:
2026-04-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作