XCommonsense-BN: A Bangla Commonsense Reasoning Dataset
收藏DataCite Commons2026-04-06 更新2026-05-04 收录
下载链接:
https://data.mendeley.com/datasets/cwzv55wtvf/2
下载链接
链接失效反馈官方服务:
资源简介:
The XCommonsense-BN dataset is a curated collection of Bangla multiple-choice questions designed for commonsense reasoning research. It is organized in a tabular format, where each entry includes a unique identifier, the reasoning category, the question in Bangla, four answer options labeled A through D, the correct option, and a short explanation in Bangla justifying the correct answer. The dataset comprises over 1,000 entries spanning five categories: causal, temporal, social, physical, and intentional reasoning, with each category containing at least 150–200 questions to ensure balanced coverage. The dataset is provided in Excel formats encoded in UTF-8 to support the Bangla script. Sample entries illustrate typical questions, answer options, correct labels, and explanations, providing a representative view of the dataset’s structure and content. This dataset enables the development, evaluation, and benchmarking of machine learning models in Bangla commonsense reasoning tasks and contributes to research in low-resource language NLP.
Value of the Data:
1. Enables research in Bangla NLP, particularly in commonsense reasoning.
2. Can be used to train, evaluate, and benchmark machine learning and AI models for Bangla question-answering systems.
3. Promotes data-driven AI research in low-resource languages.
4. Supports cross-lingual and multilingual model development by providing high-quality, curated Bangla data.
提供机构:
Mendeley Data
创建时间:
2026-04-06



