Razvan27/dataset_forge_paper2024
收藏Hugging Face2024-06-02 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Razvan27/dataset_forge_paper2024
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: dedup
features:
- name: comments
dtype: string
splits:
- name: train
num_bytes: 380
num_examples: 6
download_size: 1238
dataset_size: 380
- config_name: dedup2
features:
- name: comments
dtype: string
splits:
- name: train
num_bytes: 63
num_examples: 1
download_size: 1161
dataset_size: 63
- config_name: dedup3
features:
- name: comments
dtype: string
splits:
- name: train
num_bytes: 374
num_examples: 2
download_size: 1953
dataset_size: 374
- config_name: dedup4
features:
- name: comments
dtype: string
splits:
- name: train
num_bytes: 802
num_examples: 5
download_size: 1592
dataset_size: 802
configs:
- config_name: dedup
data_files:
- split: train
path: data/Dedup/train-*
- config_name: dedup2
data_files:
- split: train
path: data/Dedup/train-*
- config_name: dedup3
data_files:
- split: train
path: data/Dedup3/train-*
- config_name: dedup4
data_files:
- split: train
path: data/Dedup4/train-*
---
The dataset includes four configurations (dedup, dedup2, dedup3, dedup4), each containing a string feature named comments. Each configuration has a training split with different numbers of bytes and examples. The download size and actual size of the dataset are specified in each configuration.
提供机构:
Razvan27
原始信息汇总
数据集详情
配置信息
配置名称:dedup
- 特征:
- 名称:comments
- 数据类型:string
- 分割:
- 名称:train
- 字节数:380
- 样本数:6
- 下载大小:1238
- 数据集大小:380
- 数据文件:
- 分割:train
- 路径:data/Dedup/train-*
配置名称:dedup2
- 特征:
- 名称:comments
- 数据类型:string
- 分割:
- 名称:train
- 字节数:63
- 样本数:1
- 下载大小:1161
- 数据集大小:63
- 数据文件:
- 分割:train
- 路径:data/Dedup/train-*
配置名称:dedup3
- 特征:
- 名称:comments
- 数据类型:string
- 分割:
- 名称:train
- 字节数:374
- 样本数:2
- 下载大小:1953
- 数据集大小:374
- 数据文件:
- 分割:train
- 路径:data/Dedup3/train-*
配置名称:dedup4
- 特征:
- 名称:comments
- 数据类型:string
- 分割:
- 名称:train
- 字节数:802
- 样本数:5
- 下载大小:1592
- 数据集大小:802
- 数据文件:
- 分割:train
- 路径:data/Dedup4/train-*



