five

KETI-AIR/kor_hellaswag

收藏
Hugging Face2023-12-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/KETI-AIR/kor_hellaswag
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* dataset_info: features: - name: data_index_by_user dtype: int32 - name: ind dtype: int32 - name: activity_label dtype: string - name: ctx_a dtype: string - name: ctx_b dtype: string - name: ctx dtype: string - name: endings sequence: string - name: source_id dtype: string - name: split dtype: string - name: split_type dtype: string - name: label dtype: string - name: joined sequence: string splits: - name: train num_bytes: 105739666 num_examples: 39905 - name: validation num_bytes: 27367976 num_examples: 10042 - name: test num_bytes: 26340397 num_examples: 10003 download_size: 69994643 dataset_size: 159448039 license: mit --- # Dataset Card for "kor_hellaswag" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) # Source Data Citation Information ``` @inproceedings{zellers2019hellaswag, title={HellaSwag: Can a Machine Really Finish Your Sentence?}, author={Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin}, booktitle ={Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics}, year={2019} } ```
提供机构:
KETI-AIR
原始信息汇总

数据集概述

配置

  • 默认配置 (default)
    • 数据文件路径:
      • 训练集 (train): data/train-*
      • 验证集 (validation): data/validation-*
      • 测试集 (test): data/test-*

特征

  • data_index_by_user: 数据索引(用户),类型:int32
  • ind: 索引,类型:int32
  • activity_label: 活动标签,类型:string
  • ctx_a: 上下文A,类型:string
  • ctx_b: 上下文B,类型:string
  • ctx: 上下文,类型:string
  • endings: 结尾,类型:sequence of string
  • source_id: 源ID,类型:string
  • split: 分割,类型:string
  • split_type: 分割类型,类型:string
  • label: 标签,类型:string
  • joined: 合并,类型:sequence of string

分割

  • 训练集 (train)
    • 字节数:105739666
    • 样本数:39905
  • 验证集 (validation)
    • 字节数:27367976
    • 样本数:10042
  • 测试集 (test)
    • 字节数:26340397
    • 样本数:10003

数据集大小

  • 下载大小:69994643 字节
  • 数据集大小:159448039 字节

许可

  • MIT 许可证

数据来源引用

@inproceedings{zellers2019hellaswag, title={HellaSwag: Can a Machine Really Finish Your Sentence?}, author={Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin}, booktitle ={Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics}, year={2019} }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作