tyzhu/fw_num_bi_train_1000_eval_100
收藏Hugging Face2023-08-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tyzhu/fw_num_bi_train_1000_eval_100
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: train_doc2id
path: data/train_doc2id-*
- split: train_id2doc
path: data/train_id2doc-*
- split: train_find_word
path: data/train_find_word-*
- split: eval_find_word
path: data/eval_find_word-*
dataset_info:
features:
- name: inputs
dtype: string
- name: targets
dtype: string
splits:
- name: train
num_bytes: 225375
num_examples: 3200
- name: train_doc2id
num_bytes: 87993
num_examples: 1100
- name: train_id2doc
num_bytes: 91293
num_examples: 1100
- name: train_find_word
num_bytes: 46089
num_examples: 1000
- name: eval_find_word
num_bytes: 4723
num_examples: 100
download_size: 104282
dataset_size: 455473
---
# Dataset Card for "fw_num_bi_train_1000_eval_100"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
tyzhu
原始信息汇总
数据集卡片 "fw_num_bi_train_1000_eval_100"
配置
- 默认配置
- 数据文件:
- 训练集:
data/train-* - 训练集(文档到ID):
data/train_doc2id-* - 训练集(ID到文档):
data/train_id2doc-* - 训练集(查找单词):
data/train_find_word-* - 评估集(查找单词):
data/eval_find_word-*
- 训练集:
- 数据文件:
数据集信息
-
特征
- 输入:字符串类型
- 目标:字符串类型
-
分割
- 训练集:
- 字节数:225375
- 样本数:3200
- 训练集(文档到ID):
- 字节数:87993
- 样本数:1100
- 训练集(ID到文档):
- 字节数:91293
- 样本数:1100
- 训练集(查找单词):
- 字节数:46089
- 样本数:1000
- 评估集(查找单词):
- 字节数:4723
- 样本数:100
- 训练集:
-
下载大小:104282字节
-
数据集大小:455473字节



