hs-knowledge/hatecheck-enriched
收藏Hugging Face2023-06-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hs-knowledge/hatecheck-enriched
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: functionality
dtype: string
- name: case_id
dtype: int64
- name: test_case
dtype: string
- name: label_gold
dtype: string
- name: target_ident
dtype: string
- name: direction
dtype: string
- name: focus_words
dtype: string
- name: focus_lemma
dtype: string
- name: ref_case_id
dtype: float64
- name: ref_templ_id
dtype: float64
- name: templ_id
dtype: int64
- name: case_templ
dtype: string
- name: ner_output
struct:
- name: entities
list:
- name: end
dtype: int64
- name: kg_results
struct:
- name: '@context'
struct:
- name: '@vocab'
dtype: string
- name: EntitySearchResult
dtype: string
- name: detailedDescription
dtype: string
- name: goog
dtype: string
- name: kg
dtype: string
- name: resultScore
dtype: string
- name: '@type'
dtype: string
- name: itemListElement
list:
- name: '@type'
dtype: string
- name: result
struct:
- name: '@id'
dtype: string
- name: '@type'
sequence: string
- name: description
dtype: string
- name: detailedDescription
struct:
- name: articleBody
dtype: string
- name: license
dtype: string
- name: url
dtype: string
- name: image
struct:
- name: contentUrl
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: url
dtype: string
- name: resultScore
dtype: float64
- name: wikidata_id
dtype: string
- name: query_text
dtype: string
- name: start
dtype: int64
- name: text
dtype: string
- name: type
dtype: string
- name: labels
sequence: string
- name: sentence
dtype: string
- name: tokens
sequence: string
- name: entities
list:
- name: '@type'
dtype: string
- name: end
dtype: int64
- name: kg_result
struct:
- name: '@id'
dtype: string
- name: '@type'
sequence: string
- name: description
dtype: string
- name: detailedDescription
struct:
- name: articleBody
dtype: string
- name: license
dtype: string
- name: url
dtype: string
- name: image
struct:
- name: contentUrl
dtype: string
- name: url
dtype: string
- name: name
dtype: string
- name: url
dtype: string
- name: resultScore
dtype: float64
- name: score
dtype: float64
- name: similarity
dtype: float64
- name: start
dtype: int64
- name: text
dtype: string
- name: type
dtype: string
- name: wikidata_id
dtype: string
splits:
- name: test
num_bytes: 1647429
num_examples: 3728
download_size: 392671
dataset_size: 1647429
---
# Dataset Card for "hatecheck-enriched"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
hs-knowledge
原始信息汇总
数据集概述
数据集名称
"hatecheck-enriched"
数据集特征
- functionality: 字符串类型
- case_id: 整数类型(int64)
- test_case: 字符串类型
- label_gold: 字符串类型
- target_ident: 字符串类型
- direction: 字符串类型
- focus_words: 字符串类型
- focus_lemma: 字符串类型
- ref_case_id: 浮点数类型(float64)
- ref_templ_id: 浮点数类型(float64)
- templ_id: 整数类型(int64)
- case_templ: 字符串类型
- ner_output: 结构类型,包含以下子特征:
- entities: 列表类型,包含:
- end: 整数类型(int64)
- kg_results: 结构类型,包含:
- @context: 结构类型,包含:
- @vocab: 字符串类型
- EntitySearchResult: 字符串类型
- detailedDescription: 字符串类型
- goog: 字符串类型
- kg: 字符串类型
- resultScore: 字符串类型
- @type: 字符串类型
- itemListElement: 列表类型,包含:
- @type: 字符串类型
- result: 结构类型,包含:
- @id: 字符串类型
- @type: 序列类型(sequence)
- description: 字符串类型
- detailedDescription: 结构类型,包含:
- articleBody: 字符串类型
- license: 字符串类型
- url: 字符串类型
- image: 结构类型,包含:
- contentUrl: 字符串类型
- url: 字符串类型
- name: 字符串类型
- url: 字符串类型
- resultScore: 浮点数类型(float64)
- wikidata_id: 字符串类型
- query_text: 字符串类型
- @context: 结构类型,包含:
- start: 整数类型(int64)
- text: 字符串类型
- type: 字符串类型
- labels: 序列类型(sequence)
- sentence: 字符串类型
- tokens: 序列类型(sequence)
- entities: 列表类型,包含:
- entities: 列表类型,包含:
- @type: 字符串类型
- end: 整数类型(int64)
- kg_result: 结构类型,包含:
- @id: 字符串类型
- @type: 序列类型(sequence)
- description: 字符串类型
- detailedDescription: 结构类型,包含:
- articleBody: 字符串类型
- license: 字符串类型
- url: 字符串类型
- image: 结构类型,包含:
- contentUrl: 字符串类型
- url: 字符串类型
- name: 字符串类型
- url: 字符串类型
- resultScore: 浮点数类型(float64)
- score: 浮点数类型(float64)
- similarity: 浮点数类型(float64)
- start: 整数类型(int64)
- text: 字符串类型
- type: 字符串类型
- wikidata_id: 字符串类型
数据集分割
- test: 数据大小为1647429字节,包含3728个样本。
数据集大小
- 下载大小: 392671字节
- 数据集大小: 1647429字节



