BengaliTenseCorpus: A comprehensive corpus in Bengali texts categorized in Present , Past, and Future

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://data.mendeley.com/datasets/w9mdy6tw84

下载链接

链接失效反馈

官方服务：

资源简介：

The BengaliTenseCorpus has been sourced from various publicly accessible Bangla blogs, Facebook pages, magazines, books, and news articles, and some of the data are self-made, which ensures a diverse representation of contemporary language use. A critical aspect of the dataset’s curation was maintaining an equal distribution of sentences across three tense categories: past, present, and future. The dataset comprises 13,500 Bangla sentences that are categorized into three classes: present tense with 4,550 sentences, past tense with 4,460, and future tense collection with 4,490 sentences. For labeling purposes, 3 numerical values are used as - 0, 1, and 2, respectively, for present tense, past tense, and future tense.

创建时间：

2024-12-06