Bangla Text Paraphrase Corpus for Natural Language Processing
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/ffkm5rk2yg
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is Bangla Paraphrase Sentence Pair Dataset (BPDS), contains pairs of Bangla sentences labeled as paraphrase (same meaning) or non-paraphrase (different meaning). The data has been collected from diverse Bangla sources including books, newspapers and literature articles, covering a wide range of topics and writing styles. It is designed for research in natural language processing tasks such as paraphrase detection, semantic textual similarity, text generation and plagiarism detection in Bangla. The dataset is provided in .xlsx format with three columns: Sentence1, Sentence2.
创建时间:
2025-08-12



