kenhktsui/simple_wikipedia_LM_quality_score_v1
收藏Hugging Face2024-01-29 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/kenhktsui/simple_wikipedia_LM_quality_score_v1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: text
dtype: string
- name: quality_score_v1
dtype: float64
splits:
- name: train
num_bytes: 228625682
num_examples: 225984
- name: test
num_bytes: 5815940
num_examples: 5943
- name: validation
num_bytes: 6369557
num_examples: 5949
download_size: 140637963
dataset_size: 240811179
task_categories:
- text-generation
language:
- en
---
# Dataset Card for "simple_wikipedia_LM_quality_score_v1"
Adding quality score v1 to [pszemraj/simple_wikipedia_LM](https://huggingface.co/datasets/pszemraj/simple_wikipedia_LM)
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
kenhktsui
原始信息汇总
数据集概述
数据集信息
-
特征列表:
id: 字符串类型url: 字符串类型title: 字符串类型text: 字符串类型quality_score_v1: 浮点数类型
-
数据分割:
train: 228625682 字节,225984 个样本test: 5815940 字节,5943 个样本validation: 6369557 字节,5949 个样本
-
数据集大小:
- 下载大小:140637963 字节
- 数据集大小:240811179 字节
任务类别
- 文本生成
语言
- 英语



