MASBA: A Large-Scale Dataset for Multi-Level Abstractive Summarization of Bangla Articles
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/rxhj7g6y2k
下载链接
链接失效反馈官方服务:
资源简介:
Our research hypothesis is to evaluate the effectiveness of different Bangla text summarization methods compared to the original text ('main'). The data shows that:
- The average length of the main text is 2482.72 characters.
- The average length of the summaries are:
- sum1: 293.75 characters,
- sum2: 506.10 characters,
- sum3: 688.50 characters.
The compression ratio of each summary method (summary length divided by main length) reveals that:
- sum1's mean compression ratio is 0.14,
- sum2's mean compression ratio is 0.24, and
- sum3's mean compression ratio is 0.33.
Notable findings:
- sum1 appears to be the shortest summary on average, with a higher degree of compression.
- sum2 produces summaries of medium length, while sum3 tends to generate the longest summaries.
Data Gathering and Interpretation:
The data can be interpreted to assess which method produces the most concise, yet meaningful, summaries. Researchers can use these findings to evaluate the trade-offs between summary length and completeness of information conveyed.
创建时间:
2025-05-21



