Docugami/dfm-csl-large-benchmark
收藏Hugging Face2023-10-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Docugami/dfm-csl-large-benchmark
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
language:
- en
size_categories:
- 1K<n<10K
source_datasets:
- original
task_categories:
- text2text-generation
- text-generation
dataset_info:
features:
- name: Text
dtype: string
- name: Ground Truth
dtype: string
- name: docugami/dfm-csl-large
dtype: string
splits:
- name: eval
num_bytes: 1137328
num_examples: 1088
- name: train
num_bytes: 83236
num_examples: 104
download_size: 572546
dataset_size: 1220564
tags:
- docugami
- dfm-csl
- xml-knowledge-graphs
pretty_name: Contextual Semantic Lables (Large)
---
# Contextual Semantic Labels (Large) Benchmark Dataset
Please see [https://github.com/docugami/DFM-benchmarks](https://github.com/docugami/DFM-benchmarks) for more details, eval code, and current scores for different models.
# Using Dataset
Please refer to standard huggingface documentation to use this dataset: [https://huggingface.co/docs/datasets/index](https://huggingface.co/docs/datasets/index)
The [explore.ipynb](./explore.ipynb) notebook has some reference code.
提供机构:
Docugami
原始信息汇总
数据集概述
基本信息
- 许可证: MIT
- 语言: 英语
- 数据规模: 1K < n < 10K
- 数据来源: 原始数据
- 任务类别:
- 文本到文本生成
- 文本生成
数据集特征
- 特征:
- Text: 数据类型为字符串
- Ground Truth: 数据类型为字符串
- docugami/dfm-csl-large: 数据类型为字符串
数据分割
- 分割:
- eval:
- 字节数: 1137328
- 样本数: 1088
- train:
- 字节数: 83236
- 样本数: 104
- eval:
数据集大小
- 下载大小: 572546 字节
- 数据集大小: 1220564 字节
标签
- 标签:
- docugami
- dfm-csl
- xml-knowledge-graphs
数据集名称
- 名称: Contextual Semantic Labels (Large)



