gudleifrr/text-correction-en

Name: gudleifrr/text-correction-en
Creator: gudleifrr
Published: 2024-06-06 06:06:17
License: 暂无描述

Hugging Face2024-06-06 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/gudleifrr/text-correction-en

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: id dtype: string - name: url dtype: string - name: title dtype: string - name: text dtype: string - name: bad_text dtype: string splits: - name: train num_bytes: 1228951148 num_examples: 783520 download_size: 715033519 dataset_size: 1228951148 configs: - config_name: default data_files: - split: train path: data/train-* ---

The dataset includes multiple features such as id, url, title, text, and bad_text, all of which are string types. The dataset is divided into a training set containing 783520 samples, with a total size of 1228951148 bytes. The download size of the dataset is 715033519 bytes. The dataset configuration is set to default, with training data files located at data/train-* path.

提供机构：

gudleifrr

原始信息汇总

数据集概述

数据集特征

id: 数据类型为字符串。
url: 数据类型为字符串。
title: 数据类型为字符串。
text: 数据类型为字符串。
bad_text: 数据类型为字符串。

数据集分割

train:
- 数据量: 1228951148 字节
- 示例数量: 783520

数据集大小

下载大小: 715033519 字节
数据集总大小: 1228951148 字节

配置

config_name: default
- data_files:
  - split: train
    - path: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集