shironaam
收藏OpenXLab2026-04-18 收录
下载链接:
https://openxlab.org.cn/datasets/OpenDataLab/shironaam
下载链接
链接失效反馈官方服务:
资源简介:
Automatic headline generation systems have the potential to assist editors in finding interesting headlines to attract visitors or readers.
However, the performance of headline generation systems remains challenging due to the unavailability of sufficient parallel data for
low-resource languages like Bengali. We provide Shironaam, a large-scale news headline generation dataset of a low-resource language
i.e., Bengali containing over 240K news headline-article pairings with auxiliary information such as image captions, topic words,
and category information. Also, this dataset can potentially be used for other tasks such as document categorization, news clustering,
keyword identification, etc.
提供机构:
OpenDataLab
创建时间:
2023-12-06



