Weyaxi/HelpSteer-filtered
收藏Hugging Face2023-11-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Weyaxi/HelpSteer-filtered
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
---
# HelpSteer-filtered
This dataset is a highly filtered version of the [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer) dataset.
# ❓ How this dataset was filtered:
1. I calculated the sum of the columns `["helpfulness," "correctness," "coherence," "complexity," "verbosity"]` and created a new column named `sum`.
2. I changed some column names and added a **empty column** to match the Alpaca format.
3. The dataset was then filtered to include only those entries with a sum greater than or equal to 16.
# 🧐 More Information
You can find more information about the unfiltered dataset here:
- [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
---
许可证:CC BY 4.0
---
# 经筛选的HelpSteer数据集
本数据集是[nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer) 数据集的高度筛选版本。
# ❓ 本数据集的筛选流程:
1. 对`["helpfulness", "correctness", "coherence", "complexity", "verbosity"]`列的数值进行求和,并创建名为`sum`的新列。
2. 修改了部分列名,并新增**空列**以适配Alpaca格式。
3. 随后对数据集进行筛选,仅保留`sum`值大于或等于16的条目。
# 🧐 更多信息
您可通过以下链接获取未筛选原始数据集的详细信息:
- [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
提供机构:
Weyaxi
原始信息汇总
HelpSteer-filtered
数据集概述
- 原始数据集:nvidia/HelpSteer
- 版本:高度过滤版本
过滤过程
- 计算总和:计算列
["helpfulness," "correctness," "coherence," "complexity," "verbosity"]的总和,并创建新列sum。 - 列名修改:修改部分列名,并添加一个空列以匹配 Alpaca 格式。
- 数据过滤:仅保留总和大于或等于 16 的条目。



