uyentk/thucuc_data
收藏Hugging Face2024-01-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/uyentk/thucuc_data
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: QA_data
features:
- name: quest_content
dtype: string
- name: text_ans
dtype: string
- name: url
dtype: string
- name: quest
dtype: string
splits:
- name: train
num_bytes: 3918114
num_examples: 1944
download_size: 1881638
dataset_size: 3918114
- config_name: default
features:
- name: text
dtype: string
- name: metadata
struct:
- name: desc
dtype: string
- name: title
dtype: string
- name: url
dtype: string
- name: type
dtype: string
splits:
- name: train
num_bytes: 54612520
num_examples: 6735
download_size: 15710416
dataset_size: 54612520
- config_name: full_qa
features:
- name: metadata
struct:
- name: url
dtype: string
- name: quest
dtype: string
- name: quest_content
dtype: string
- name: text_ans
dtype: string
splits:
- name: train
num_bytes: 3677460
num_examples: 1944
download_size: 1759570
dataset_size: 3677460
- config_name: news6
features:
- name: text_ans
dtype: string
- name: metadata
struct:
- name: quest
dtype: string
- name: url
dtype: string
- name: quest_content
dtype: string
splits:
- name: train
num_bytes: 71898
num_examples: 44
download_size: 49839
dataset_size: 71898
- config_name: news_data
features:
- name: type
dtype: string
- name: text
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: desc
dtype: string
splits:
- name: train
num_bytes: 792249025
num_examples: 114323
download_size: 257336967
dataset_size: 792249025
configs:
- config_name: QA_data
data_files:
- split: train
path: QA_data/train-*
- config_name: default
data_files:
- split: train
path: data/train-*
- config_name: full_qa
data_files:
- split: train
path: full_qa/train-*
- config_name: news6
data_files:
- split: train
path: news6/train-*
- config_name: news_data
data_files:
- split: train
path: news_data/train-*
---
提供机构:
uyentk
原始信息汇总
数据集概述
配置名称:QA_data
- 特征:
quest_content: 字符串text_ans: 字符串url: 字符串quest: 字符串
- 分割:
train:- 字节数:3918114
- 样本数:1944
- 下载大小:1881638
- 数据集大小:3918114
配置名称:default
- 特征:
text: 字符串metadata: 结构体desc: 字符串title: 字符串url: 字符串
type: 字符串
- 分割:
train:- 字节数:54612520
- 样本数:6735
- 下载大小:15710416
- 数据集大小:54612520
配置名称:full_qa
- 特征:
metadata: 结构体url: 字符串quest: 字符串
quest_content: 字符串text_ans: 字符串
- 分割:
train:- 字节数:3677460
- 样本数:1944
- 下载大小:1759570
- 数据集大小:3677460
配置名称:news6
- 特征:
text_ans: 字符串metadata: 结构体quest: 字符串url: 字符串
quest_content: 字符串
- 分割:
train:- 字节数:71898
- 样本数:44
- 下载大小:49839
- 数据集大小:71898
配置名称:news_data
- 特征:
type: 字符串text: 字符串url: 字符串title: 字符串desc: 字符串
- 分割:
train:- 字节数:792249025
- 样本数:114323
- 下载大小:257336967
- 数据集大小:792249025



