five

Table understanding proposal and HTML tables

收藏
Mendeley Data2021-03-09 更新2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/87gr74cr4r/2
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset describes the on-line materials that accompany article "A Coral-Reef Approach to Extracting Information from HTML Tables", by Patricia Jiménez, Juan C. Roldán, and Rafael Corchuelo. The materials are provided in a zip file that contains the following folders: - "DATA": contains the original HTML tables from which to extract the information as well as some data to configurate the different proposals.. - "NOTEBOOK": it is a Jupyter Notebook that provides the python code required to run and test Coraline. There is a "launch.cmd/sh" script that launches the experimentation according to the operating system. There is also a README.txt file and a requirements.txt file. The former contains the instructions to launch the notebook. The latter provides a number of packages that should be installed prior to launch the notebook. Note that the folder called "output" contains the csv files with the results achieved regarding effectiveness and efficiency for every competitor, which are already implemented in the notebook.
创建时间:
2021-03-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作