five

Weyaxi/HelpSteer-filtered

收藏
Hugging Face2023-11-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Weyaxi/HelpSteer-filtered
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 --- # HelpSteer-filtered This dataset is a highly filtered version of the [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer) dataset. # ❓ How this dataset was filtered: 1. I calculated the sum of the columns `["helpfulness," "correctness," "coherence," "complexity," "verbosity"]` and created a new column named `sum`. 2. I changed some column names and added a **empty column** to match the Alpaca format. 3. The dataset was then filtered to include only those entries with a sum greater than or equal to 16. # 🧐 More Information You can find more information about the unfiltered dataset here: - [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)

--- 许可证:CC BY 4.0 --- # 经筛选的HelpSteer数据集 本数据集是[nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer) 数据集的高度筛选版本。 # ❓ 本数据集的筛选流程: 1. 对`["helpfulness", "correctness", "coherence", "complexity", "verbosity"]`列的数值进行求和,并创建名为`sum`的新列。 2. 修改了部分列名,并新增**空列**以适配Alpaca格式。 3. 随后对数据集进行筛选,仅保留`sum`值大于或等于16的条目。 # 🧐 更多信息 您可通过以下链接获取未筛选原始数据集的详细信息: - [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
提供机构:
Weyaxi
原始信息汇总

HelpSteer-filtered

数据集概述

过滤过程

  1. 计算总和:计算列 ["helpfulness," "correctness," "coherence," "complexity," "verbosity"] 的总和,并创建新列 sum
  2. 列名修改:修改部分列名,并添加一个空列以匹配 Alpaca 格式。
  3. 数据过滤:仅保留总和大于或等于 16 的条目。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作