cosmopedia-100k
收藏魔搭社区2026-01-06 更新2024-06-08 收录
下载链接:
https://modelscope.cn/datasets/swift/cosmopedia-100k
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset description
This is a 100k subset of [Cosmopedia](https://huggingface.co/datasets/HuggingFaceTB/cosmopedia) dataset. A synthetic dataset of textbooks, blogposts, stories, posts and WikiHow articles generated by Mixtral-8x7B-Instruct-v0.1.
Here's how you can load the dataset
```python
from datasets import load_dataset
ds = load_dataset("HuggingFaceTB/cosmopedia-100k", split="train")
````
# 数据集描述
本数据集为[Cosmopedia](https://huggingface.co/datasets/HuggingFaceTB/cosmopedia)数据集的10万条子集。该子集为由Mixtral-8x7B-Instruct-v0.1生成的合成数据集,涵盖教科书、博客文章、故事、帖子以及WikiHow(维基指南)类文章。
您可通过以下方式加载该数据集:
python
from datasets import load_dataset
ds = load_dataset("HuggingFaceTB/cosmopedia-100k", split="train")
`
提供机构:
maas
创建时间:
2024-06-05



