aloobun/dhpileIN
收藏Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/aloobun/dhpileIN
下载链接
链接失效反馈官方服务:
资源简介:
Vārta是一个大规模的头条生成数据集,专注于印度语言。该数据集支持多种印度语言,包括孟加拉语(bn)、古吉拉特语(gu)、印地语(hi)、卡纳达语(kn)、泰米尔语(ta)、泰卢固语(te)和马拉雅拉姆语(ml)。数据集的规模在100万到1000万之间。
Vārta is a large-scale headline-generation dataset focused on Indic languages. The dataset supports multiple Indic languages, including Bengali (bn), Gujarati (gu), Hindi (hi), Kannada (kn), Tamil (ta), Telugu (te), and Malayalam (ml). The dataset size ranges between 1 million and 10 million.
提供机构:
aloobun



