BengaliTenseCorpus: A comprehensive corpus in Bengali texts categorized in Present , Past, and Future
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/w9mdy6tw84
下载链接
链接失效反馈官方服务:
资源简介:
The BengaliTenseCorpus has been sourced from various publicly accessible Bangla blogs, Facebook pages,
magazines, books, and news articles, and some of the data are self-made, which ensures a diverse representation
of contemporary language use. A critical aspect of the dataset’s curation was maintaining an equal distribution of
sentences across three tense categories: past, present, and future. The dataset comprises 13,500 Bangla sentences that
are categorized into three classes: present tense with 4,550 sentences, past tense with 4,460, and future tense collection
with 4,490 sentences. For labeling purposes, 3 numerical values are used as - 0, 1, and 2, respectively, for present tense,
past tense, and future tense.
创建时间:
2024-12-06



