five

hs-knowledge/sbf-enriched

收藏
Hugging Face2023-06-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hs-knowledge/sbf-enriched
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: whoTarget dtype: string - name: intentYN dtype: string - name: sexYN dtype: string - name: sexReason dtype: string - name: offensiveYN dtype: string - name: annotatorGender dtype: string - name: annotatorMinority dtype: string - name: sexPhrase dtype: string - name: speakerMinorityYN dtype: string - name: WorkerId dtype: string - name: HITId dtype: string - name: annotatorPolitics dtype: string - name: annotatorRace dtype: string - name: annotatorAge dtype: string - name: post dtype: string - name: targetMinority dtype: string - name: targetCategory dtype: string - name: targetStereotype dtype: string - name: dataSource dtype: string - name: ner_output struct: - name: entities list: - name: end dtype: int64 - name: kg_results struct: - name: '@context' struct: - name: '@vocab' dtype: string - name: EntitySearchResult dtype: string - name: detailedDescription dtype: string - name: goog dtype: string - name: kg dtype: string - name: resultScore dtype: string - name: '@type' dtype: string - name: itemListElement list: - name: '@type' dtype: string - name: result struct: - name: '@id' dtype: string - name: '@type' sequence: string - name: description dtype: string - name: detailedDescription struct: - name: articleBody dtype: string - name: license dtype: string - name: url dtype: string - name: image struct: - name: contentUrl dtype: string - name: url dtype: string - name: name dtype: string - name: url dtype: string - name: resultScore dtype: float64 - name: wikidata_id dtype: string - name: query_text dtype: string - name: start dtype: int64 - name: text dtype: string - name: type dtype: string - name: labels sequence: string - name: sentence dtype: string - name: tokens sequence: string - name: entities list: - name: '@type' dtype: string - name: end dtype: int64 - name: kg_result struct: - name: '@id' dtype: string - name: '@type' sequence: string - name: description dtype: string - name: detailedDescription struct: - name: articleBody dtype: string - name: license dtype: string - name: url dtype: string - name: image struct: - name: contentUrl dtype: string - name: url dtype: string - name: name dtype: string - name: url dtype: string - name: resultScore dtype: float64 - name: score dtype: float64 - name: similarity dtype: float64 - name: start dtype: int64 - name: text dtype: string - name: type dtype: string - name: wikidata_id dtype: string splits: - name: test num_bytes: 54962629 num_examples: 17501 - name: validation num_bytes: 55627871 num_examples: 16738 - name: train num_bytes: 379229290 num_examples: 112900 download_size: 0 dataset_size: 489819790 --- # Dataset Card for "sbf-enriched" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
hs-knowledge
原始信息汇总

数据集概述

数据集名称

"sbf-enriched"

数据集大小

  • 总大小: 489819790 字节
  • 下载大小: 0 字节

数据集结构

特征信息

  • whoTarget: 字符串
  • intentYN: 字符串
  • sexYN: 字符串
  • sexReason: 字符串
  • offensiveYN: 字符串
  • annotatorGender: 字符串
  • annotatorMinority: 字符串
  • sexPhrase: 字符串
  • speakerMinorityYN: 字符串
  • WorkerId: 字符串
  • HITId: 字符串
  • annotatorPolitics: 字符串
  • annotatorRace: 字符串
  • annotatorAge: 字符串
  • post: 字符串
  • targetMinority: 字符串
  • targetCategory: 字符串
  • targetStereotype: 字符串
  • dataSource: 字符串
  • ner_output: 结构体
    • entities: 列表
      • end: 整数
      • kg_results: 结构体
        • @context: 结构体
          • @vocab: 字符串
          • EntitySearchResult: 字符串
          • detailedDescription: 字符串
          • goog: 字符串
          • kg: 字符串
          • resultScore: 字符串
        • @type: 字符串
        • itemListElement: 列表
          • @type: 字符串
          • result: 结构体
            • @id: 字符串
            • @type: 序列
            • description: 字符串
            • detailedDescription: 结构体
              • articleBody: 字符串
              • license: 字符串
              • url: 字符串
            • image: 结构体
              • contentUrl: 字符串
              • url: 字符串
            • name: 字符串
            • url: 字符串
          • resultScore: 浮点数
          • wikidata_id: 字符串
        • query_text: 字符串
      • start: 整数
      • text: 字符串
      • type: 字符串
    • labels: 序列
    • sentence: 字符串
    • tokens: 序列
  • entities: 列表
    • @type: 字符串
    • end: 整数
    • kg_result: 结构体
      • @id: 字符串
      • @type: 序列
      • description: 字符串
      • detailedDescription: 结构体
        • articleBody: 字符串
        • license: 字符串
        • url: 字符串
      • image: 结构体
        • contentUrl: 字符串
        • url: 字符串
      • name: 字符串
      • url: 字符串
    • resultScore: 浮点数
    • score: 浮点数
    • similarity: 浮点数
    • start: 整数
    • text: 字符串
    • type: 字符串
    • wikidata_id: 字符串

数据集分割

  • train: 112900 个例子, 379229290 字节
  • validation: 16738 个例子, 55627871 字节
  • test: 17501 个例子, 54962629 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作