five

ola13/wikipedia_citations

收藏
Hugging Face2023-05-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ola13/wikipedia_citations
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: default features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: format dtype: string - name: publisher dtype: string - name: last dtype: string - name: first dtype: string - name: archiveurl dtype: string - name: urlstatus dtype: string - name: work dtype: string - name: language dtype: string - name: author dtype: string - name: year dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 29536547204 num_examples: 45750324 download_size: 12683322513 dataset_size: 29536547204 - config_name: 20230301.aa features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train download_size: 45886 dataset_size: 0 - config_name: 20230301.ab features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 387102 num_examples: 857 download_size: 3222122 dataset_size: 387102 - config_name: 20230301.ace features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 4265488 num_examples: 4337 download_size: 3608741 dataset_size: 4265488 - config_name: 20230301.ady features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 1660 num_examples: 4 download_size: 1065537 dataset_size: 1660 - config_name: 20230301.af features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 89889221 num_examples: 159932 download_size: 133044790 dataset_size: 89889221 - config_name: 20230301.ak features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 170161 num_examples: 301 download_size: 692116 dataset_size: 170161 - config_name: 20230301.als features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 10169196 num_examples: 21089 download_size: 60679007 dataset_size: 10169196 - config_name: 20230301.alt features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 2004152 num_examples: 2704 download_size: 3845233 dataset_size: 2004152 - config_name: 20230301.am features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 1016959 num_examples: 1562 download_size: 8450310 dataset_size: 1016959 - config_name: 20230301.ami features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train download_size: 1259913 dataset_size: 0 - config_name: 20230301.an features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 8318957 num_examples: 37082 download_size: 42295559 dataset_size: 8318957 - config_name: 20230301.ang features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 270983 num_examples: 475 download_size: 4849741 dataset_size: 270983 - config_name: 20230301.ar features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 2900899732 num_examples: 4229039 download_size: 1610559727 dataset_size: 2900899732 - config_name: 20230301.arc features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 2384 num_examples: 4 download_size: 1216435 dataset_size: 2384 - config_name: 20230301.ary features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 6452887 num_examples: 10571 download_size: 8557208 dataset_size: 6452887 - config_name: 20230301.arz features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 932036810 num_examples: 1570403 download_size: 239271648 dataset_size: 932036810 - config_name: 20230301.as features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 44514889 num_examples: 60972 download_size: 35918397 dataset_size: 44514889 - config_name: 20230301.ast features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 171210748 num_examples: 334041 download_size: 232707623 dataset_size: 171210748 - config_name: 20230301.atj features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train download_size: 728991 dataset_size: 0 - config_name: 20230301.av features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 2344714 num_examples: 3003 download_size: 8458811 dataset_size: 2344714 - config_name: 20230301.avk features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 135757 num_examples: 332 download_size: 9999475 dataset_size: 135757 - config_name: 20230301.awa features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 889915 num_examples: 1087 download_size: 2383110 dataset_size: 889915 - config_name: 20230301.ay features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 25717 num_examples: 52 download_size: 2602828 dataset_size: 25717 - config_name: 20230301.az features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 305578561 num_examples: 429469 download_size: 255702339 dataset_size: 305578561 - config_name: 20230301.azb features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 72619205 num_examples: 117094 download_size: 104641635 dataset_size: 72619205 - config_name: 20230301.ba features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 86705789 num_examples: 150012 download_size: 99635090 dataset_size: 86705789 - config_name: 20230301.ban features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 31814985 num_examples: 39972 download_size: 16420334 dataset_size: 31814985 - config_name: 20230301.bar features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 6206957 num_examples: 13109 download_size: 36275305 dataset_size: 6206957 - config_name: 20230301.bat-smg features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 91639 num_examples: 166 download_size: 5404604 dataset_size: 91639 - config_name: 20230301.bcl features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 20835602 num_examples: 33256 download_size: 17211482 dataset_size: 20835602 - config_name: 20230301.be features: - name: id dtype: string - name: wiki_id dtype: string - name: wiki_url dtype: string - name: wiki_title dtype: string - name: citation_type dtype: string - name: template dtype: string - name: title dtype: string - name: url dtype: string - name: domain dtype: string - name: archiveurl dtype: string - name: format dtype: string - name: publisher dtype: string - name: work dtype: string - name: isbn dtype: string - name: journal dtype: string - name: volume dtype: string - name: doi dtype: string - name: issue dtype: string - name: newspaper dtype: string splits: - name: train num_bytes: 160244430 num_examples: 255562 download_size: 277205567 dataset_size: 160244430 --- # Dataset Card for "wikipedia_citations" Sample usage: ``` simple = load_dataset("ola13/wikipedia_citations", split="train", language="simple", date="20230301") ``` [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
ola13
原始信息汇总

数据集概述

数据集配置

  • config_name: 多个配置名称,包括 default, 20230301.aa20230301.ay
  • features: 每个配置包含相同的特征列表,包括 id, wiki_id, wiki_url, wiki_title, citation_type, template, title, url, domain, archiveurl, format, publisher, work, isbn, journal, volume, doi, issue, newspaper 等,所有特征的数据类型均为 string

数据集分割

  • splits: 每个配置包含一个名为 train 的分割。
  • num_bytes: 训练集的大小,以字节为单位。
  • num_examples: 训练集中的示例数量。

数据集大小

  • download_size: 数据集下载大小。
  • dataset_size: 数据集实际大小。

数据集详细信息

默认配置

  • config_name: default
  • features: 同上
  • splits:
    • name: train
    • num_bytes: 29536547204
    • num_examples: 45750324
  • download_size: 12683322513
  • dataset_size: 29536547204

其他配置示例

  • config_name: 20230301.aa

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 未提供
    • num_examples: 未提供
  • download_size: 45886

  • dataset_size: 0

  • config_name: 20230301.ab

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 387102
    • num_examples: 857
  • download_size: 3222122

  • dataset_size: 387102

  • config_name: 20230301.ady

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 1660
    • num_examples: 4
  • download_size: 1065537

  • dataset_size: 1660

  • config_name: 20230301.af

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 89889221
    • num_examples: 159932
  • download_size: 133044790

  • dataset_size: 89889221

  • config_name: 20230301.ak

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 170161
    • num_examples: 301
  • download_size: 692116

  • dataset_size: 170161

  • config_name: 20230301.als

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 10169196
    • num_examples: 21089
  • download_size: 60679007

  • dataset_size: 10169196

  • config_name: 20230301.alt

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 2004152
    • num_examples: 2704
  • download_size: 3845233

  • dataset_size: 2004152

  • config_name: 20230301.am

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 1016959
    • num_examples: 1562
  • download_size: 8450310

  • dataset_size: 1016959

  • config_name: 20230301.ami

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 未提供
    • num_examples: 未提供
  • download_size: 1259913

  • dataset_size: 0

  • config_name: 20230301.an

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 8318957
    • num_examples: 37082
  • download_size: 42295559

  • dataset_size: 8318957

  • config_name: 20230301.ang

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 270983
    • num_examples: 475
  • download_size: 4849741

  • dataset_size: 270983

  • config_name: 20230301.ar

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 2900899732
    • num_examples: 4229039
  • download_size: 1610559727

  • dataset_size: 2900899732

  • config_name: 20230301.arc

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 2384
    • num_examples: 4
  • download_size: 1216435

  • dataset_size: 2384

  • config_name: 20230301.ary

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 6452887
    • num_examples: 10571
  • download_size: 8557208

  • dataset_size: 6452887

  • config_name: 20230301.arz

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 932036810
    • num_examples: 1570403
  • download_size: 239271648

  • dataset_size: 932036810

  • config_name: 20230301.as

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 44514889
    • num_examples: 60972
  • download_size: 35918397

  • dataset_size: 44514889

  • config_name: 20230301.ast

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 171210748
    • num_examples: 334041
  • download_size: 232707623

  • dataset_size: 171210748

  • config_name: 20230301.atj

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 未提供
    • num_examples: 未提供
  • download_size: 728991

  • dataset_size: 0

  • config_name: 20230301.av

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 2344714
    • num_examples: 3003
  • download_size: 8458811

  • dataset_size: 2344714

  • config_name: 20230301.avk

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 135757
    • num_examples: 332
  • download_size: 9999475

  • dataset_size: 135757

  • config_name: 20230301.awa

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 889915
    • num_examples: 1087
  • download_size: 2383110

  • dataset_size: 889915

  • config_name: 20230301.ay

  • features: 同上

  • splits:

    • name: train
    • num_bytes: 未提供
    • num_examples: 未提供
  • download_size: 未提供

  • dataset_size: 未提供

以上信息总结了数据集的配置、特征、分割以及大小等关键信息。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作