The BriQ Data set
收藏DataCite Commons2020-08-27 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/The_BriQ_Data_set/7642055
下载链接
链接失效反馈官方服务:
资源简介:
This folder contains the annonated corpus in CSV format organized as follows: * page-data.csv : contains all the annotated web pages with their HTML content. * document-data.csv : contains the documents extracted from the web pages, where each document contains a single paragraph and have a set of related tables. * table-data.csv : contains the tables related to each document. It also contains the HTML content of the table extracted from the web page. * mention-data.csv : contains all the quantity mentions with ground truth mapping extracted from the documents. * mention_table-data.csv : contains the related table for each mention. * annotations-GT.csv : contains the collected ground truth annotations. <br>
本文件夹包含以CSV格式组织的标注语料库,具体结构如下:
• page-data.csv:存储所有带标注的网页及其HTML内容。
• document-data.csv:存储从网页中提取的文档,每份文档仅包含单个段落,并附带一组相关表格。
• table-data.csv:存储与各文档关联的表格,同时包含从网页中提取的表格HTML内容。
• mention-data.csv:存储从文档中提取的所有数量提及项及其对应的真实标注(ground truth)映射关系。
• mention_table-data.csv:存储各提及项对应的关联表格。
• annotations-GT.csv:存储收集得到的真实标注结果。
提供机构:
figshare
创建时间:
2019-01-29



