five

BengaliTenseCorpus: A comprehensive corpus in Bengali texts categorized in Present , Past, and Future

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/w9mdy6tw84
下载链接
链接失效反馈
官方服务:
资源简介:
The BengaliTenseCorpus has been sourced from various publicly accessible Bangla blogs, Facebook pages, magazines, books, and news articles, and some of the data are self-made, which ensures a diverse representation of contemporary language use. A critical aspect of the dataset’s curation was maintaining an equal distribution of sentences across three tense categories: past, present, and future. The dataset comprises 13,500 Bangla sentences that are categorized into three classes: present tense with 4,550 sentences, past tense with 4,460, and future tense collection with 4,490 sentences. For labeling purposes, 3 numerical values are used as - 0, 1, and 2, respectively, for present tense, past tense, and future tense.
创建时间:
2024-12-06
二维码
社区交流群
二维码
科研交流群
商业服务