five

Avelina/UltraSteer-v0

收藏
Hugging Face2024-10-04 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Avelina/UltraSteer-v0
下载链接
链接失效反馈
官方服务:
资源简介:
UltraSteer是一个大规模的对话数据集,包含单轮和多轮对话,并带有细粒度的标签。这些标签由Nvidia的Llama2-13B-SteerLM-RM奖励模型生成,使用了NeMo Aligner框架。数据集中的每个助手回复都根据多个属性进行评分,包括质量、毒性、幽默感、创造力、帮助性、正确性、连贯性、复杂性和详细程度。数据集还进行了因果去重处理,确保每个对话的最后一个助手回复都有标签且唯一。数据来源于多个高质量的对齐数据集和聊天数据集,并进行了过滤和去重处理。

UltraSteer is a large-scale dataset of single- and multi-turn dialogue with fine-grained labels produced by Nvidias Llama2-13B-SteerLM-RM reward model using the NeMo Aligner framework. Each assistant turn is rated with attributes such as quality, toxicity, humor, creativity, helpfulness, correctness, coherence, complexity, and verbosity. The dataset has undergone causal deduplication to ensure that the final assistant message in every conversation is labeled and unique. The data is sourced from multiple high-quality alignment datasets and chat datasets, and has been filtered and deduplicated.
提供机构:
Avelina
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作