OpenWebText
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3834941
下载链接
链接失效反馈官方服务:
资源简介:
An open-source replication of the WebText dataset from OpenAI.
For more info please visit https://skylion007.github.io/OpenWebTextCorpus/
@misc{Gokaslan2019OpenWeb,
title={OpenWebText Corpus},
author={Aaron Gokaslan*, Vanya Cohen*, Ellie Pavlick, Stefanie Tellex},
howpublished{\url{http://Skylion007.github.io/OpenWebTextCorpus}},
year={2019}
}
创建时间:
2020-05-27



