five

Sellopale/Assistantz8

收藏
Hugging Face2025-12-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Sellopale/Assistantz8
下载链接
链接失效反馈
官方服务:
资源简介:
OpenAssistant Conversations Dataset (OASST1) 是一个由人类生成和标注的助手风格对话语料库,包含161,443条消息,涉及35种不同语言,标注了461,292条质量评级,形成了超过10,000个完全标注的对话树。这个数据集是全球范围内超过13,500名志愿者参与的众包努力的成果。数据集的结构包括消息树,每个消息树有一个初始提示消息作为根节点,可以有多个子消息作为回复,这些子消息又可以有多条回复。所有消息都有一个角色属性,可以是“assistant”或“prompter”,并且在对话线程中严格交替。

OpenAssistant Conversations Dataset (OASST1) is a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 fully annotated conversation trees. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. The dataset contains message trees where each tree has an initial prompt message as the root node, which can have multiple child messages as replies, and these child messages can have multiple replies. All messages have a role property, either assistant or prompter, and the roles in conversation threads strictly alternate between prompter and assistant.
提供机构:
Sellopale
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作