hs-knowledge/sbf-enriched
收藏Hugging Face2023-06-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hs-knowledge/sbf-enriched
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: whoTarget
dtype: string
- name: intentYN
dtype: string
- name: sexYN
dtype: string
- name: sexReason
dtype: string
- name: offensiveYN
dtype: string
- name: annotatorGender
dtype: string
- name: annotatorMinority
dtype: string
- name: sexPhrase
dtype: string
- name: speakerMinorityYN
dtype: string
- name: WorkerId
dtype: string
- name: HITId
dtype: string
- name: annotatorPolitics
dtype: string
- name: annotatorRace
dtype: string
- name: annotatorAge
dtype: string
- name: post
dtype: string
- name: targetMinority
dtype: string
- name: targetCategory
dtype: string
- name: targetStereotype
dtype: string
- name: dataSource
dtype: string
- name: ner_output
struct:
- name: entities
list:
- name: end
dtype: int64
- name: kg_results
struct:
- name: '@context'
struct:
- name: '@vocab'
dtype: string
- name: EntitySearchResult
dtype: string
- name: detailedDescription
dtype: string
- name: goog
dtype: string
- name: kg
dtype: string
- name: resultScore
dtype: string
- name: '@type'
dtype: string
- name: itemListElement
list:
- name: '@type'
dtype: string
- name: result
struct:
- name: '@id'
dtype: string
- name: '@type'
sequence: string
- name: description
dtype: string
- name: detailedDescription
struct:
- name: articleBody
dtype: string
- name: license
dtype: string
- name: url
dtype: string
- name: image
struct:
- name: contentUrl
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: url
dtype: string
- name: resultScore
dtype: float64
- name: wikidata_id
dtype: string
- name: query_text
dtype: string
- name: start
dtype: int64
- name: text
dtype: string
- name: type
dtype: string
- name: labels
sequence: string
- name: sentence
dtype: string
- name: tokens
sequence: string
- name: entities
list:
- name: '@type'
dtype: string
- name: end
dtype: int64
- name: kg_result
struct:
- name: '@id'
dtype: string
- name: '@type'
sequence: string
- name: description
dtype: string
- name: detailedDescription
struct:
- name: articleBody
dtype: string
- name: license
dtype: string
- name: url
dtype: string
- name: image
struct:
- name: contentUrl
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: url
dtype: string
- name: resultScore
dtype: float64
- name: score
dtype: float64
- name: similarity
dtype: float64
- name: start
dtype: int64
- name: text
dtype: string
- name: type
dtype: string
- name: wikidata_id
dtype: string
splits:
- name: test
num_bytes: 54962629
num_examples: 17501
- name: validation
num_bytes: 55627871
num_examples: 16738
- name: train
num_bytes: 379229290
num_examples: 112900
download_size: 0
dataset_size: 489819790
---
# Dataset Card for "sbf-enriched"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
hs-knowledge
原始信息汇总
数据集概述
数据集名称
"sbf-enriched"
数据集大小
- 总大小: 489819790 字节
- 下载大小: 0 字节
数据集结构
特征信息
- whoTarget: 字符串
- intentYN: 字符串
- sexYN: 字符串
- sexReason: 字符串
- offensiveYN: 字符串
- annotatorGender: 字符串
- annotatorMinority: 字符串
- sexPhrase: 字符串
- speakerMinorityYN: 字符串
- WorkerId: 字符串
- HITId: 字符串
- annotatorPolitics: 字符串
- annotatorRace: 字符串
- annotatorAge: 字符串
- post: 字符串
- targetMinority: 字符串
- targetCategory: 字符串
- targetStereotype: 字符串
- dataSource: 字符串
- ner_output: 结构体
- entities: 列表
- end: 整数
- kg_results: 结构体
- @context: 结构体
- @vocab: 字符串
- EntitySearchResult: 字符串
- detailedDescription: 字符串
- goog: 字符串
- kg: 字符串
- resultScore: 字符串
- @type: 字符串
- itemListElement: 列表
- @type: 字符串
- result: 结构体
- @id: 字符串
- @type: 序列
- description: 字符串
- detailedDescription: 结构体
- articleBody: 字符串
- license: 字符串
- url: 字符串
- image: 结构体
- contentUrl: 字符串
- url: 字符串
- name: 字符串
- url: 字符串
- resultScore: 浮点数
- wikidata_id: 字符串
- query_text: 字符串
- @context: 结构体
- start: 整数
- text: 字符串
- type: 字符串
- labels: 序列
- sentence: 字符串
- tokens: 序列
- entities: 列表
- entities: 列表
- @type: 字符串
- end: 整数
- kg_result: 结构体
- @id: 字符串
- @type: 序列
- description: 字符串
- detailedDescription: 结构体
- articleBody: 字符串
- license: 字符串
- url: 字符串
- image: 结构体
- contentUrl: 字符串
- url: 字符串
- name: 字符串
- url: 字符串
- resultScore: 浮点数
- score: 浮点数
- similarity: 浮点数
- start: 整数
- text: 字符串
- type: 字符串
- wikidata_id: 字符串
数据集分割
- train: 112900 个例子, 379229290 字节
- validation: 16738 个例子, 55627871 字节
- test: 17501 个例子, 54962629 字节



