five

Sellopale/Assistantsellopale53oi

收藏
Hugging Face2025-12-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Sellopale/Assistantsellopale53oi
下载链接
链接失效反馈
官方服务:
资源简介:
OpenAssistant Conversations (OASST1)是一个人类生成、人类注释的助手风格对话语料库,旨在推动大规模对齐研究的民主化。该数据集包含161,443条消息,涵盖35种不同语言,并包含461,292个质量评分,形成了超过10,000个完全注释的对话树。数据集由全球超过13,500名志愿者共同创建,每条消息都有prompter或assistant角色,并在对话线程中严格交替。数据集提供了多种文件格式,包括嵌套消息树和扁平消息列表,适用于监督微调(SFT)和奖励模型(RM)训练。

OpenAssistant Conversations (OASST1) is a human-generated, human-annotated assistant-style conversation corpus aimed at democratizing research on large-scale alignment. The dataset consists of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 fully annotated conversation trees. It is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. Each message has a role of either prompter or assistant, strictly alternating in conversation threads. The dataset is provided in various formats, including nested message trees and flat message lists, suitable for supervised fine-tuning (SFT) and reward model (RM) training.
提供机构:
Sellopale
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作