jhu-clsp/mmBERT-decay-data
收藏Hugging Face2025-12-11 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/jhu-clsp/mmBERT-decay-data
下载链接
链接失效反馈官方服务:
资源简介:
MMBERT退火阶段数据集,包含1000亿个token,支持1833种语言,采用级联退火语言学习方法,特别适合低资源语言的学习。
MMBERT Decay Phase Data, containing 100B tokens, supporting 1833 languages, utilizing the novel Cascading Annealed Language Learning approach, particularly suitable for low-resource language learning.
提供机构:
jhu-clsp



