nanonets/long_sparse_unstructured_table
收藏Hugging Face2025-05-07 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/nanonets/long_sparse_unstructured_table
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是合成的,用于创建具有以下特征的表格:单元格空百分比在40%到70%之间(稀疏),行与列之间没有分隔符(非结构化),表格行数在15到30之间,列数在7到15之间。数据集适用于表格问答任务,并涉及到OCR和IDP技术。
This dataset is synthetically generated to create tables with the following characteristics: an empty cell percentage within the range of 40% to 70% (Sparse), no separator between rows and columns (un-structured), and a number of rows between 15 and 30, and columns between 7 and 15. The dataset is used for table question answering tasks and involves OCR and IDP technologies.
提供机构:
nanonets



