five

Kyle1668/LLM-TTA-Cached-Rewrites

收藏
Hugging Face2024-02-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Kyle1668/LLM-TTA-Cached-Rewrites
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: boss_sentiment_stabilityai_StableBeluga_13B_tempequals0dot0 path: data/boss_sentiment_stabilityai_StableBeluga_13B_tempequals0dot0-* - split: ag_news_twitter_aug_substitute path: data/ag_news_twitter_aug_substitute-* - split: boss_sentiment_aug_insert path: data/boss_sentiment_aug_insert-* - split: ag_news_twitter_aug_insert path: data/ag_news_twitter_aug_insert-* - split: ag_news_twitter_aug_back_translate path: data/ag_news_twitter_aug_back_translate-* - split: boss_toxicity_stabilityai_StableBeluga_7b_tempequals0dot0 path: data/boss_toxicity_stabilityai_StableBeluga_7b_tempequals0dot0-* - split: ag_news_twitter_stabilityai_StableBeluga_7b_tempequals0dot0 path: data/ag_news_twitter_stabilityai_StableBeluga_7b_tempequals0dot0-* - split: boss_sentiment_aug_back_translate path: data/boss_sentiment_aug_back_translate-* - split: boss_toxicity_aug_substitute path: data/boss_toxicity_aug_substitute-* - split: boss_toxicity_aug_insert path: data/boss_toxicity_aug_insert-* - split: boss_sentiment_aug_substitute path: data/boss_sentiment_aug_substitute-* - split: boss_toxicity_aug_back_translate path: data/boss_toxicity_aug_back_translate-* - split: boss_sentiment_stabilityai_StableBeluga_7b_tempequals0dot0 path: data/boss_sentiment_stabilityai_StableBeluga_7b_tempequals0dot0-* dataset_info: features: - name: prompt_hash dtype: string - name: prompt dtype: string - name: rewrites dtype: string splits: - name: boss_sentiment_stabilityai_StableBeluga_13B_tempequals0dot0 num_bytes: 3703118 num_examples: 2132 - name: ag_news_twitter_aug_substitute num_bytes: 22069756 num_examples: 15200 - name: boss_sentiment_aug_insert num_bytes: 101392185 num_examples: 61580 - name: ag_news_twitter_aug_insert num_bytes: 25877025 num_examples: 15200 - name: ag_news_twitter_aug_back_translate num_bytes: 21078091 num_examples: 15200 - name: boss_toxicity_stabilityai_StableBeluga_7b_tempequals0dot0 num_bytes: 659072364 num_examples: 240078 - name: ag_news_twitter_stabilityai_StableBeluga_7b_tempequals0dot0 num_bytes: 82978276 num_examples: 30400 - name: boss_sentiment_aug_back_translate num_bytes: 75819709 num_examples: 61580 - name: boss_toxicity_aug_substitute num_bytes: 200434523 num_examples: 120032 - name: boss_toxicity_aug_insert num_bytes: 222397157 num_examples: 120032 - name: boss_sentiment_aug_substitute num_bytes: 91318472 num_examples: 61580 - name: boss_toxicity_aug_back_translate num_bytes: 186461827 num_examples: 120032 - name: boss_sentiment_stabilityai_StableBeluga_7b_tempequals0dot0 num_bytes: 291064861 num_examples: 123133 download_size: 714228880 dataset_size: 1983667364 --- # Dataset Card for "LLM-TTA-Cached-Rewrites" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Kyle1668
原始信息汇总

数据集概述

数据集配置

  • 默认配置
    • 包含多个数据文件,每个文件对应一个特定的分割(split)。

数据文件详情

  • 分割(split)列表
    • boss_sentiment_stabilityai_StableBeluga_13B_tempequals0dot0
    • ag_news_twitter_aug_substitute
    • boss_sentiment_aug_insert
    • ag_news_twitter_aug_insert
    • ag_news_twitter_aug_back_translate
    • boss_toxicity_stabilityai_StableBeluga_7b_tempequals0dot0
    • ag_news_twitter_stabilityai_StableBeluga_7b_tempequals0dot0
    • boss_sentiment_aug_back_translate
    • boss_toxicity_aug_substitute
    • boss_toxicity_aug_insert
    • boss_sentiment_aug_substitute
    • boss_toxicity_aug_back_translate
    • boss_sentiment_stabilityai_StableBeluga_7b_tempequals0dot0

数据集特征

  • 特征列表
    • prompt_hash:字符串类型
    • prompt:字符串类型
    • rewrites:字符串类型

数据分割详情

  • 分割详情
    • boss_sentiment_stabilityai_StableBeluga_13B_tempequals0dot0
      • 字节数:3703118
      • 样本数:2132
    • ag_news_twitter_aug_substitute
      • 字节数:22069756
      • 样本数:15200
    • boss_sentiment_aug_insert
      • 字节数:101392185
      • 样本数:61580
    • ag_news_twitter_aug_insert
      • 字节数:25877025
      • 样本数:15200
    • ag_news_twitter_aug_back_translate
      • 字节数:21078091
      • 样本数:15200
    • boss_toxicity_stabilityai_StableBeluga_7b_tempequals0dot0
      • 字节数:659072364
      • 样本数:240078
    • ag_news_twitter_stabilityai_StableBeluga_7b_tempequals0dot0
      • 字节数:82978276
      • 样本数:30400
    • boss_sentiment_aug_back_translate
      • 字节数:75819709
      • 样本数:61580
    • boss_toxicity_aug_substitute
      • 字节数:200434523
      • 样本数:120032
    • boss_toxicity_aug_insert
      • 字节数:222397157
      • 样本数:120032
    • boss_sentiment_aug_substitute
      • 字节数:91318472
      • 样本数:61580
    • boss_toxicity_aug_back_translate
      • 字节数:186461827
      • 样本数:120032
    • boss_sentiment_stabilityai_StableBeluga_7b_tempequals0dot0
      • 字节数:291064861
      • 样本数:123133

数据集大小

  • 下载大小:714228880 字节
  • 数据集大小:1983667364 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作