parsee-ai/revenues-example
收藏Hugging Face2024-03-20 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/parsee-ai/revenues-example
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- table-question-answering
- question-answering
language:
- en
tags:
- llm
- rag
- finance
- pdf
- document processing
size_categories:
- n<1K
---
# Revenues Sample Dataset
parsee-core version used: 0.1.3.14
This dataset was created on the basis of 15 pages from annual/quarterly filings of major German stock-exchange listed companies (PDF files).
All PDF files are publicly accessible on parsee.ai, to access them copy the "source_identifier" (first column) and paste it in this URL (replace '{SOURCE_IDENTIFIER}' with the actual identifier):
https://app.parsee.ai/documents/view/{SOURCE_IDENTIFIER}
So for example:
https://app.parsee.ai/documents/view/a8f9dc45fc64a66a4d419ddb56399bcb79a74cb8948d35e8bfa06671f8c47318
# Methodology
The dataset was created on [Parsee Cloud](https://app.parsee.ai), where all output was checked by a human and corrected prior to running this code.
All prompts were truncated to a max of 8k tokens, but this should not affect the prompts for this dataset, as the files are just single pages and thus quite small.
# LLM Evaluation
Evaluation results and more can be found on Github:
* Readme: https://github.com/parsee-ai/parsee-datasets/tree/main/datasets/revenues/parsee-loader
* Evaluation results: https://github.com/parsee-ai/parsee-datasets/blob/main/datasets/revenues/parsee-loader/evaluation.ipynb
提供机构:
parsee-ai
原始信息汇总
Revenues Sample Dataset 概述
数据集基本信息
- 许可证: MIT
- 任务类别:
- 表格问答
- 问答
- 语言: 英语
- 标签:
- 大型语言模型(LLM)
- 检索增强生成(RAG)
- 金融
- 文档处理
- 大小类别: 小于1K
数据集描述
- 来源: 基于15页来自德国主要股票交易所上市公司的年度/季度报告(PDF文件)。
- 访问方式: 所有PDF文件可通过parsee.ai公开访问,需使用“source_identifier”在指定URL中替换
{SOURCE_IDENTIFIER}进行访问。
数据集创建方法
- 创建平台: 在Parsee Cloud上创建。
- 质量控制: 所有输出均由人工检查并修正。
- 数据处理: 提示被截断至最多8k tokens,但由于文件仅为单页,因此不会影响本数据集的提示。
评估信息
- 评估结果: 可在GitHub上查看,包括评估结果和更多细节。
- 评估结果链接: https://github.com/parsee-ai/parsee-datasets/blob/main/datasets/revenues/parsee-loader/evaluation.ipynb



