gayanin/pubmed-abstracts-noised-with-prob-dist
收藏Hugging Face2024-02-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/gayanin/pubmed-abstracts-noised-with-prob-dist
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: babylon
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 6285769
num_examples: 24908
- name: test
num_bytes: 792114
num_examples: 3113
- name: validation
num_bytes: 782322
num_examples: 3114
download_size: 4422353
dataset_size: 7860205
- config_name: gcd
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 5315199
num_examples: 24908
- name: test
num_bytes: 732074
num_examples: 3114
- name: validation
num_bytes: 730918
num_examples: 3114
download_size: 3902937
dataset_size: 6778191
- config_name: kaggle
features:
- name: refs
dtype: string
- name: trans
dtype: string
splits:
- name: train
num_bytes: 6277893
num_examples: 24908
- name: test
num_bytes: 789835
num_examples: 3113
- name: validation
num_bytes: 786894
num_examples: 3114
download_size: 4398378
dataset_size: 7854622
configs:
- config_name: babylon
data_files:
- split: train
path: babylon/train-*
- split: test
path: babylon/test-*
- split: validation
path: babylon/validation-*
- config_name: gcd
data_files:
- split: train
path: gcd/train-*
- split: test
path: gcd/test-*
- split: validation
path: gcd/validation-*
- config_name: kaggle
data_files:
- split: train
path: kaggle/train-*
- split: test
path: kaggle/test-*
- split: validation
path: kaggle/validation-*
---
提供机构:
gayanin
原始信息汇总
数据集概述
数据集配置
配置名称:babylon
- 特征:
refs:字符串类型trans:字符串类型
- 分割:
train:- 字节数:6285769
- 样本数:24908
test:- 字节数:792114
- 样本数:3113
validation:- 字节数:782322
- 样本数:3114
- 下载大小:4422353
- 数据集大小:7860205
配置名称:gcd
- 特征:
refs:字符串类型trans:字符串类型
- 分割:
train:- 字节数:5315199
- 样本数:24908
test:- 字节数:732074
- 样本数:3114
validation:- 字节数:730918
- 样本数:3114
- 下载大小:3902937
- 数据集大小:6778191
配置名称:kaggle
- 特征:
refs:字符串类型trans:字符串类型
- 分割:
train:- 字节数:6277893
- 样本数:24908
test:- 字节数:789835
- 样本数:3113
validation:- 字节数:786894
- 样本数:3114
- 下载大小:4398378
- 数据集大小:7854622
数据文件路径
配置名称:babylon
- 训练集:
babylon/train-* - 测试集:
babylon/test-* - 验证集:
babylon/validation-*
配置名称:gcd
- 训练集:
gcd/train-* - 测试集:
gcd/test-* - 验证集:
gcd/validation-*
配置名称:kaggle
- 训练集:
kaggle/train-* - 测试集:
kaggle/test-* - 验证集:
kaggle/validation-*



