five

Profiling the REACH-chemical space with Generative Topographic Mapping

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/3872734
下载链接
链接失效反馈
官方服务:
资源简介:
The REACH (Registration Evaluation Authorization and restriction of Chemicals) regulation requires from industries the reporting of hazard data for substances placed on the market. Thanks to the registration procedure initiated in 2007, a large REACH database [1] of well defined (eco)toxicological properties has been created. The REACH-chemical space is defined by more than 21’500 compounds registered in the REACH database. Considering the high number of chemicals and endpoints for which experimental data is available, the ability to visualize this chemical space as well as to profile a compound integrating several key properties at once is a growing need. Here, the data distribution in REACH chemical space was visualized and analysed with the help of 2-dimensional Generative Topographic Map (GTM). Similarly to geographical map, on GTM, each object (compound) is visualized as a datapoint. Moreover, compounds possessing similar properties tend to be located in neighbourhood. The third dimension can be added in order to display a distribution of the given (eco)toxicological property (such-called “property landscape”), which can further be used for property assessment of new compounds projected on the map.  We report the universal REACH map which accommodates 11 endpoints, covering environmental fate, and (eco)toxicological properties. This map is able to provide predictions for each property, and demonstrates acceptable predictive performance in cross-validation: balanced accuracy ranges from 0.60 to 0.78. Superposition of different property landscapes allows to delineate the “areas of interest” populated by molecules possessing desirable (eco)toxicological profile.
创建时间:
2020-09-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作