Lumia101/Fineweb-2-kor-500MT
收藏Hugging Face2026-04-12 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Lumia101/Fineweb-2-kor-500MT
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- text-generation
language:
- ko
size_categories:
- 100K<n<1M
---
# Lumia101/Fineweb-2-kor-500MT
This dataset was created by extracting 500 million tokens from [Fineweb-2](https://huggingface.co/datasets/HuggingFaceFW/fineweb-2).
Since the data is only 500M tokens in size, please consider using it in conjunction with other high-quality data or using the [original dataset](https://huggingface.co/datasets/HuggingFaceFW/fineweb-2).
This dataset was created by the owner of this dataset repository to verify the additional filtering effect of [Lumia101/Nari-C4-ko-500MT](https://huggingface.co/datasets/Lumia101/Nari-C4-ko-500MT).
提供机构:
Lumia101



