five

pie/cdcp

收藏
Hugging Face2024-01-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/pie/cdcp
下载链接
链接失效反馈
官方服务:
资源简介:
# PIE Dataset Card for "CDCP" This is a [PyTorch-IE](https://github.com/ChristophAlt/pytorch-ie) wrapper for the [CDCP Huggingface dataset loading script](https://huggingface.co/datasets/DFKI-SLT/cdcp). ## Data Schema The document type for this dataset is `CDCPDocument` which defines the following data fields: - `text` (str) - `id` (str, optional) - `metadata` (dictionary, optional) and the following annotation layers: - `propositions` (annotation type: `LabeledSpan`, target: `text`) - `relations` (annotation type: `BinaryRelation`, target: `propositions`) - `urls` (annotation type: `Attribute`, target: `propositions`) See [here](https://github.com/ChristophAlt/pytorch-ie/blob/main/src/pytorch_ie/annotations.py) for the annotation type definitions. ## Document Converters The dataset provides document converters for the following target document types: - `pytorch_ie.documents.TextDocumentWithLabeledSpansAndBinaryRelations` See [here](https://github.com/ChristophAlt/pytorch-ie/blob/main/src/pytorch_ie/documents.py) for the document type definitions.
提供机构:
pie
原始信息汇总

PIE Dataset Card for "CDCP"

数据结构

数据集的文档类型为CDCPDocument,定义了以下数据字段:

  • text (str)
  • id (str, 可选)
  • metadata (字典, 可选)

以及以下注释层:

  • propositions (注释类型: LabeledSpan, 目标: text)
  • relations (注释类型: BinaryRelation, 目标: propositions)
  • urls (注释类型: Attribute, 目标: propositions)

文档转换器

数据集提供了以下目标文档类型的文档转换器:

  • pytorch_ie.documents.TextDocumentWithLabeledSpansAndBinaryRelations
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作