tyzhu/fwv2_squad_num_train_10000_eval_100
收藏Hugging Face2023-08-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tyzhu/fwv2_squad_num_train_10000_eval_100
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: train_doc2id
path: data/train_doc2id-*
- split: train_id2doc
path: data/train_id2doc-*
- split: train_find_word
path: data/train_find_word-*
- split: eval_find_word
path: data/eval_find_word-*
- split: id_context_mapping
path: data/id_context_mapping-*
dataset_info:
features:
- name: inputs
dtype: string
- name: targets
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 2877195
num_examples: 20100
- name: train_doc2id
num_bytes: 1736997
num_examples: 10100
- name: train_id2doc
num_bytes: 1767297
num_examples: 10100
- name: train_find_word
num_bytes: 1109898
num_examples: 10000
- name: eval_find_word
num_bytes: 10775
num_examples: 100
- name: id_context_mapping
num_bytes: 1444097
num_examples: 10100
download_size: 4619144
dataset_size: 8946259
---
# Dataset Card for "fwv2_squad_num_train_10000_eval_100"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
tyzhu
原始信息汇总
数据集概述
配置信息
- 默认配置:
- 训练数据:
- 分割:train
- 路径:data/train-*
- 文档到ID映射:
- 分割:train_doc2id
- 路径:data/train_doc2id-*
- ID到文档映射:
- 分割:train_id2doc
- 路径:data/train_id2doc-*
- 查找单词:
- 分割:train_find_word
- 路径:data/train_find_word-*
- 分割:eval_find_word
- 路径:data/eval_find_word-*
- ID上下文映射:
- 分割:id_context_mapping
- 路径:data/id_context_mapping-*
- 训练数据:
数据集信息
-
特征:
- 输入(inputs):字符串类型
- 目标(targets):字符串类型
- 文本(text):字符串类型
-
分割信息:
- 训练集:
- 名称:train
- 字节数:2877195
- 样本数:20100
- 文档到ID映射:
- 名称:train_doc2id
- 字节数:1736997
- 样本数:10100
- ID到文档映射:
- 名称:train_id2doc
- 字节数:1767297
- 样本数:10100
- 查找单词(训练):
- 名称:train_find_word
- 字节数:1109898
- 样本数:10000
- 查找单词(评估):
- 名称:eval_find_word
- 字节数:10775
- 样本数:100
- ID上下文映射:
- 名称:id_context_mapping
- 字节数:1444097
- 样本数:10100
- 训练集:
-
数据集大小:
- 下载大小:4619144 字节
- 数据集大小:8946259 字节



