five

OntoChem PFAS CORE and Patent Files for MetFrag

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6034586
下载链接
链接失效反馈
官方服务:
资源简介:
These are MetFraggable versions of the PFAS lists produced by OntoChem by performing literature mining of the CORE database (27K entries) and Google Patent collections (1.7M entries). The MetFrag versions have undergone filtering to remove entries that prevent MetFrag running (i.e. multiple-entry formulas and certain elements). Each file contains a tag whether the PFAS fits three definitions, A, B and C. The CORE database also contains the number of references in which the PFAS entry was found. Each file is available as CSV or in compressed form. PFAS Definition A: Each compound that contains a CF2 group PFAS Definition B: Each compound that contains a (AH)(AH)(F)C-C(AH)F2 group, where AH groups could be hydrogen or any other atom and the bond between both aliphatic carbon atoms is a single bond PFAS Definition C: Each compound that contains a (R1)(R2)(F)C-C(R3)F2 group is considered a PFAS, where the R groups are any atom except hydrogen and the bond between both aliphatic carbon atoms is a single bond Please note these files are very large (especially patents) and should not be uploaded to MetFragWeb directly - they will be available from the dropdown menu. These are provided for command line users, and any other workflows interested in these files! The patent file is 1.7 million entries and can cause some delay in the command line, compared with smaller database files. Full details are available in this preprint by Barnabas et al (2022) DOI: 10.26434/chemrxiv-2022-nmnnd-v2 Update 20/04/2022: uploaded files with updated CID mappings post-PubChem deposition.
创建时间:
2022-04-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作