Mint1456/NLP_VIN2023
收藏Hugging Face2023-10-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Mint1456/NLP_VIN2023
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
dataset_info:
features:
- name: input_ids
sequence: int32
- name: attention_mask
sequence: int8
- name: labels
sequence: int64
splits:
- name: train
num_bytes: 1696181647
num_examples: 2884451
- name: validation
num_bytes: 7002160
num_examples: 11316
- name: test
num_bytes: 6952905
num_examples: 11225
download_size: 383293199
dataset_size: 1710136712
---
# Dataset Card for "NLP_VIN2023"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
配置项:
- config_name: 默认
data_files:
- split: 训练集
path: data/train-*
- split: 验证集
path: data/validation-*
- split: 测试集
path: data/test-*
数据集信息:
特征:
- name: 输入标识(input_ids)
sequence: int32
- name: 注意力掩码(attention_mask)
sequence: int8
- name: 标签(labels)
sequence: int64
数据拆分:
- name: 训练集
num_bytes: 1696181647
num_examples: 2884451
- name: 验证集
num_bytes: 7002160
num_examples: 11316
- name: 测试集
num_bytes: 6952905
num_examples: 11225
下载大小: 383293199
数据集总大小: 1710136712
---
# 「NLP_VIN2023」数据集卡片
[需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Mint1456
原始信息汇总
数据集概述
数据集配置
- 配置名称: default
- 数据文件:
- 训练集: data/train-*
- 验证集: data/validation-*
- 测试集: data/test-*
数据集信息
- 特征:
- input_ids: 序列类型为int32
- attention_mask: 序列类型为int8
- labels: 序列类型为int64
- 分割:
- 训练集:
- 字节数: 1696181647
- 样本数: 2884451
- 验证集:
- 字节数: 7002160
- 样本数: 11316
- 测试集:
- 字节数: 6952905
- 样本数: 11225
- 训练集:
- 下载大小: 383293199
- 数据集大小: 1710136712



