guus4324343/NexusCorpus-1B
收藏Hugging Face2026-03-04 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/guus4324343/NexusCorpus-1B
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
language:
- en
task_categories:
- text-generation
---
# NexusCorpus-1B
NexusCorpus-1B is a cleaned English training corpus for language modeling.
## Contents
A mix of subtitle and web text sources, stored as JSONL (`text` field).
## Intended use
Training and fine-tuning causal language models.
提供机构:
guus4324343



