Wikitext-103
收藏academictorrents.com2025-03-22 收录
下载链接:
https://academictorrents.com/details/a4fee5547056c845e31ab952598f43b42333183c
下载链接
链接失效反馈官方服务:
资源简介:
A collection of over 100 million tokens extracted from the set of verified Good and Featured articles on Wikipedia. Widely used for language modeling, including the pretrained models used in the fastai library and ULMFiT algorithm.
本数据集汇聚了超过一亿个标记,源自维基百科经过验证的优质和特色文章集合。该数据集被广泛应用于语言建模,包括在 fastai 库和 ULMFiT 算法中使用的预训练模型。
提供机构:
academictorrents.com



