nuuuwan/cbsl-annual-reports-chunks
收藏Hugging Face2025-10-16 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nuuuwan/cbsl-annual-reports-chunks
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文档的相关信息,如文档类型、ID、数量、日期、描述等,并且提供了文档的元数据和PDF链接。数据集分为训练集,其中包含了50802个示例,总大小为116MB。数据集支持多种语言,并且每个文档被分割为多个块,每个块包含了文本内容和相应的元数据。
The dataset includes document-related information such as document type, ID, count, date, description, and provides metadata and PDF links for the documents. The dataset is split into a training set, which contains 50802 examples and totals 116MB in size. The dataset supports multiple languages, and each document is split into multiple chunks, each containing text content and corresponding metadata.
提供机构:
nuuuwan



