BengaliNewspaperCommonCoverageArticleDataset
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/msmhb5fmf6
下载链接
链接失效反馈官方服务:
资源简介:
The dataset has been gathered from five well-known newspapers in Bangladesh. It contains 1056 titles and under each title (per row), the dataset provides five sets of information corresponding to each newspaper, including the date, URL, headline, and description.
Data Format:
• Articles are encoded into UTF-8 text file.
• The title is in the first row of each file.
• The content of the article starts from the second-row forwards.
Each row represents one article with a unique serial number (SL), along with its common Bangla headline and publication date (yy/mm/dd format). For each article, the dataset includes the article URL links and the corresponding headlines and descriptions from five major Bangladeshi newspapers:
Prothom Alo
Daily Inqilab
Daily Ittefaq
Bangladesh Pratidin
Kaler Kantho
This allows comparison of how the same news article was reported, titled, and described across different newspapers.
创建时间:
2025-11-10



