five

A corpus of tables in full-text biomedical research publications

收藏
DataCite Commons2020-09-18 更新2025-04-09 收录
下载链接:
https://data.csiro.au/collections/#collection/CIcsiro:24172v1
下载链接
链接失效反馈
官方服务:
资源简介:
The corpus was created to serve as a gold standard for two tasks of information extraction from biomedical tables: (1) mapping of table cells into fine-grained entity types, and (2) identification of relations between table cells. The corpus consists of a small set of full-text articles sourced from the Open Access subset of PubMed Central, for which four types of table annotations were created: (1) cell group, which splits each table into sets of homogeneous cell groups; (2) cell type, which represents the mapping of all cells in a homogeneous group into a single fine-grained named entity label; (3) concept, which represents the mapping of utterances inside table cells (i.e., the syntactic heads of the utterances expanded with their modifiers) into their semantic equivalents from a domain vocabulary; and (4) relation, which represents relations between cell groups.
提供机构:
CSIRO
创建时间:
2017-09-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作