ittia/wiki_dpr
收藏Hugging Face2024-08-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ittia/wiki_dpr
下载链接
链接失效反馈官方服务:
资源简介:
Wiki-DPR项目包含用于RAG训练和研究的索引、数据集和检查点。数据集由众包创建,语言为英语,许可证为cc-by-nc-4.0,具有多语言性,规模在10M到100M之间,来源于原始数据集,任务类别包括填充掩码和文本生成,任务ID包括语言建模和掩码语言建模。
This dataset is used for RAG training and research, containing indexes, datasets, and checkpoints. The dataset language is English and is a multilingual dataset. Task categories include fill-mask and text-generation, with specific tasks including language modeling and masked language modeling. The dataset size is between 10M and 100M, and it is an original dataset.
提供机构:
ittia



