yabramuvdi/InformesBanRep
收藏Hugging Face2024-12-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/yabramuvdi/InformesBanRep
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含哥伦比亚共和国银行的货币政策报告,数据以表格形式(104 x 2)呈现。每条记录包含两个属性:1. **fecha**:报告发布的年月日(AAA-MM-DD),其中日期不精确,所有日期的日均为该月的第一天;2. **text**:从原始PDF报告中提取的文本,以markdown格式保存,以尽可能保留文档结构(如标题、子标题),并包含图像或表格部分的指示。数据来源于哥伦比亚共和国银行的官方网站,使用Docling工具从PDF中提取文本,未进行任何文本预处理以保留所有信息。
This dataset contains text data from the minutes of monetary policy meetings of the Bank of the Republic of Colombia, formatted as a table (185 x 2). Each record contains two attributes: date (YYYY-MM-DD) and text extracted from the original PDFs. The purpose of the dataset is to increase the number of public databases in Spanish, and the data can be obtained from the official website of the Bank of the Republic of Colombia. The data processing involved using the PyMuPDF tool to extract text from PDFs, with no text preprocessing applied to preserve all possible information.
提供机构:
yabramuvdi



