five

Datasets: Molecular Entities as Structured Data on the Web

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://data.mendeley.com/datasets/n9xwfs5fcj
下载链接
链接失效反馈
官方服务:
资源简介:
Internet search engines have remodeled the use of the internet, making it easy to find the content we are interested in. The Web was originally designed to exchange natural language documents. It is difficult for machines to interpret this type of data. Structured data placed on websites solves this problem by allowing search engines to "understand" the content better. This can also be applied to chemical data. We have developed three tools to convert chemical data into structured data. SDFEater allows to convert SDF files, Molstruct converts CSV files and MEgen is a web application that allows entering data in a form. Using our tools, we generated 10 datasets including 5 main datasets (DS1, DS2, DS3, DS4, and DS5) and 5 small datasets (DS1s, DS2s, DS3s, DS4s, and DS5s) consisting of 10 files with one molecule each. They are based on well-known chemical databases (ChEBI, DrugBank, PubChem) as well as other data (WikiData). We make them available in JSON-LD HTML, JSON-LD, RDFa, and Microdata structured data formats. More details about the inputs and outputs as well as how the data is generated can be found in README.txt.
创建时间:
2021-04-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作