five

The financial narrative summarisation shared task (FNS 2022 & 2023): Datasets

收藏
DataCite Commons2025-11-12 更新2025-04-10 收录
下载链接:
https://edatos.consorciomadrono.es/citation?persistentId=doi:10.21950/WRH0SO
下载链接
链接失效反馈
官方服务:
资源简介:
<p>Financial Narrative Processing (FNP) consists of workshops organized by Lancaster University at international NLP conferences to address various aspects of automatic processing of financial narratives, including automatic summarization. The LLI-UAM participated in 2022 and 2023 by creating Spanish-language datasets for the FNS shared task (evaluating AI systems using the same dataset to compare different approaches).</p> <p>The dataset consists of complete annual reports from companies, chairmen's letters (which are considered summaries of the reports), and a version created by linguists that consists of a summary of the chairmen's letters in fewer than 1,000 words. Based on the dataset, participants train their models to generate summaries similar to the chairman's letter or the simplified version for new evaluation reports that were not shared during training. The evaluation is conducted using the ROUGE metric.</p> <p>The dataset is composed of 262 financial reports taken from the FinT-esp corpus. The reports were originally in PDF format and were converted into plain text, removing tables, footnotes, headers, and retaining only the narrative content. The length of the reports ranges from 40 to 400 pages, with an average of 36,285 words. A total of 262 chairman's letters were extracted, and an additional 262 summary documents were created, each containing fewer than 1,000 words. This publication is about the dataset from the 2022 and 2023 competition.</p> <p>These are txt files containing the full report, their respective chairmen's letters, and the summaries of these letters. They belong to The Financial Narrative Summarisation Shared Task (2022 and 2023).</p>
提供机构:
e-cienciaDatos
创建时间:
2025-03-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作