distilabel-internal-testing/embeddings-dataset-answer
收藏数据集概述
数据集名称
- 名称: embeddings-dataset-answer
数据集创建工具
- 创建工具: distilabel
数据集大小
- 大小: n<1K
数据集标签
- 标签:
- synthetic
- distilabel
- rlaif
数据集结构
- 结构:
- 数据集包含一个
pipeline.yaml文件,用于在distilabel中重现生成此数据集的流程。 - 数据集示例结构如下: json { "anchor": "Astrology: I am a Capricorn Sun Cap moon and cap rising...what does that say about me?", "distilabel_metadata": { "raw_output_generate_sentence_pair_0": "## Positive
- 数据集包含一个
As a triple Capricorn, youu0027re likely to be an ambitious, disciplined, and responsible individual with a strong sense of duty and a natural flair for leadership, which can help you achieve great success in your personal and professional life.
Negative
The cap on my favorite pen has gone missing, and Iu0027m left struggling to find a suitable replacement." }, "model_name": "meta-llama/Meta-Llama-3-70B-Instruct", "negative": "The cap on my favorite pen has gone missing, and Iu0027m left struggling to find a suitable replacement.", "positive": "As a triple Capricorn, youu0027re likely to be an ambitious, disciplined, and responsible individual with a strong sense of duty and a natural flair for leadership, which can help you achieve great success in your personal and professional life." }
数据集加载
- 加载方式:
-
使用以下代码加载数据集: python from datasets import load_dataset
ds = load_dataset("distilabel-internal-testing/embeddings-dataset-answer", "default")
-
或简化为: python from datasets import load_dataset
ds = load_dataset("distilabel-internal-testing/embeddings-dataset-answer")
-



