lamm-mit/bio-inspired-DPO
收藏Hugging Face2024-08-26 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/lamm-mit/bio-inspired-DPO
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: prompt
dtype: string
- name: chosen
list:
- name: content
dtype: string
- name: role
dtype: string
- name: rejected
list:
- name: content
dtype: string
- name: role
dtype: string
- name: source_mmd
dtype: string
- name: analysis_formatted
dtype: string
- name: analysis
struct:
- name: answers
sequence: string
- name: comparisons
sequence: string
- name: details
sequence: string
- name: facts
sequence: string
- name: insights
sequence: string
- name: questions
sequence: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 63831272.0
num_examples: 4928
download_size: 26419505
dataset_size: 63831272.0
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征字段:
- 名称:提示词(Prompt),数据类型:字符串
- 名称:选中回复(Chosen),数据类型:列表,包含子字段:
- 内容(content):字符串类型
- 角色(role):字符串类型
- 名称:拒选回复(Rejected),数据类型:列表,包含子字段:
- 内容(content):字符串类型
- 角色(role):字符串类型
- 名称:源MMD(source_mmd),数据类型:字符串
- 名称:格式化分析结果(analysis_formatted),数据类型:字符串
- 名称:分析结构体(analysis),数据类型:结构体,包含子字段:
- 答案集(answers):字符串序列
- 对比项集(comparisons):字符串序列
- 细节集(details):字符串序列
- 事实集(facts):字符串序列
- 见解集(insights):字符串序列
- 问题集(questions):字符串序列
- 标题(title):字符串类型
数据集划分:
- 划分名称:训练集(train),数据字节数:63831272.0,样本数量:4928
下载大小:26419505
数据集总大小:63831272.0
配置项:
- 配置名称:默认配置(default),数据文件配置:
- 对应划分:训练集(train),文件路径:data/train-*
提供机构:
lamm-mit



