DATATALES

Name: DATATALES
Creator: 新加坡国立大学, 力拓, Sea AI Lab
Published: 2024-10-23 21:30:02
License: 暂无描述

arXiv2024-10-23 更新2024-10-25 收录

下载链接：

https://github.com/yajingyang/DataTales/

下载链接

链接失效反馈

官方服务：

资源简介：

DATATALES是由新加坡国立大学、力拓和Sea AI Lab共同创建的一个用于评估语言模型在数据叙述任务中表现的新基准数据集。该数据集包含4900份金融报告及其对应的市场数据，旨在解决现有基准在实际应用中分析复杂性不足的问题。数据集内容涵盖广泛的市场数据，包括股票、债券、货币和商品等，通过复杂的分析操作生成详细的叙述报告。创建过程包括市场报告收集、句子分类和数据提取与对齐三个主要步骤。DATATALES主要应用于金融领域的自然语言处理任务，旨在提升模型在复杂数据分析和叙述中的准确性和深度。

DATATALES is a novel benchmark dataset jointly created by the National University of Singapore, Rio Tinto, and Sea AI Lab, designed to evaluate the performance of language models on data narration tasks. This dataset contains 4,900 financial reports and their corresponding market data, aiming to address the insufficient analytical complexity of existing benchmarks in real-world applications. The dataset covers a wide range of market data including stocks, bonds, currencies, commodities and more, and generates detailed narrative reports through sophisticated analytical operations. Its creation process involves three main steps: market report collection, sentence classification, and data extraction and alignment. DATATALES is primarily applied to natural language processing tasks in the financial domain, with the goal of enhancing the accuracy and depth of models in complex data analysis and narrative generation.

提供机构：

新加坡国立大学, 力拓, Sea AI Lab

创建时间：

2024-10-23

搜集汇总

数据集介绍

构建方式

DATATALES数据集通过三个关键步骤构建，以确保数据叙述任务的高质量。首先，从多个在线来源收集每日市场报告，涵盖广泛的部门（如股票、国债、货币、商品），并确保分析深度，包括因果分析、趋势分析和预测。其次，使用ChatGPT进行句子级别的分类，将句子分为市场走势、市场背景、外部事件和影响以及预测和建议，确保内容基于表格数据。最后，通过识别常见描述的金融工具，从多个在线数据库中提取相应的表格数据，并进行手动验证，确保数据准确性。

特点

DATATALES数据集的独特之处在于其提供了4.9k份财务报告及其对应的市场数据，强调了深入分析而非简单的信息转换。该数据集包含了复杂的分析操作，如趋势分析、因果分析和预测分析，以及专业术语的使用，模拟了真实世界的数据叙述挑战。此外，数据集还包含了大量的输入数据，要求模型从广泛的数据中选择并准确复制复杂的报告。

使用方法

DATATALES数据集主要用于评估语言模型在数据叙述任务中的表现，特别是在将复杂表格数据转化为清晰叙述的能力。研究者可以使用该数据集来测试和改进模型在零样本和微调设置下的性能，通过比较模型生成的报告与人类创建的报告在事实准确性、洞察力和语言风格方面的差异。此外，数据集还可用于开发新的自动化评估方法，专注于准确性和洞察质量。

背景与挑战

背景概述

DATATALES, introduced by Yajing Yang, Qian Liu, and Min-Yen Kan from the National University of Singapore and Sea AI Lab, is a pioneering benchmark designed to evaluate the proficiency of language models in data narration. This task is crucial for transforming complex tabular data into accessible narratives, which is essential for business decision-making and information accessibility. DATATALES addresses the limitations of existing benchmarks by offering 4.9k financial reports paired with corresponding market data, emphasizing the need for models to create clear narratives and analyze large datasets while understanding specialized financial terminology. The dataset was created to highlight the significant challenges language models face in achieving the necessary precision and analytical depth for proficient data narration, suggesting promising avenues for future model development and evaluation methodologies.

当前挑战

The primary challenge DATATALES addresses is the complexity and analytical depth required for data narration, which surpasses traditional data-to-text tasks. The dataset's creation involved significant challenges, including the need to pair financial reports with comprehensive market data and ensure the reports reflect real-world data narration complexities. The dataset's unique features, such as extensive input data and advanced analytics capabilities, pose significant challenges to state-of-the-art models. These challenges include achieving the required accuracy and analytical depth, integrating large input volumes, and effectively using domain-specific financial lexicon. Additionally, the dataset highlights the need for advanced reasoning capabilities over extensive data and the importance of domain knowledge for sophisticated analytical tasks.

常用场景

经典使用场景

DATATALES数据集的经典使用场景在于评估语言模型在数据叙述任务中的熟练度，特别是将复杂的表格数据转化为易于理解的叙述。该数据集通过提供4.9k份金融报告及其对应的市场数据，展示了模型在创建清晰叙述和分析大数据集时对专业术语的理解能力。

解决学术问题

DATATALES数据集解决了现有基准在捕捉实际应用所需分析复杂性方面的不足。它强调了模型在数据叙述中所需的精确性和分析深度，为未来模型开发和评估方法提供了有前景的方向。

衍生相关工作

DATATALES数据集催生了一系列相关工作，特别是在金融自然语言处理领域。例如，它启发了对欺诈检测、情感分析和问答任务的研究，这些任务需要对市场数据进行深入分析和合成。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集