jarod0411/llm_sim_new
收藏Hugging Face2024-06-26 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/jarod0411/llm_sim_new
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
dataset_info:
features:
- name: p1
dtype: string
- name: p2
dtype: string
- name: Scaffold
dtype: string
- name: similarity
dtype: float64
splits:
- name: train
num_bytes: 826422294.521295
num_examples: 6827666
- name: validation
num_bytes: 439895818.32039374
num_examples: 2991768
download_size: 181952658
dataset_size: 1266318112.8416886
---
# Dataset Card for "llm_sim_new"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
jarod0411
原始信息汇总
数据集概述
数据集名称
- llm_sim_new
数据集配置
- 默认配置:default
数据文件
- 训练集(train):路径为
data/train-* - 验证集(validation):路径为
data/validation-*
数据集特征
- 特征名称及数据类型:
- p1: string
- p2: string
- Scaffold: string
- similarity: float64
数据集分割
- 训练集(train):
- 字节数:826422294.521295
- 样本数:6827666
- 验证集(validation):
- 字节数:439895818.32039374
- 样本数:2991768
数据集大小
- 下载大小:181952658
- 数据集总大小:1266318112.8416886



