Profiling the REACH-chemical space with Generative Topographic Mapping
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/3872734
下载链接
链接失效反馈官方服务:
资源简介:
The REACH (Registration Evaluation Authorization and restriction of Chemicals) regulation requires from industries the reporting of hazard data for substances placed on the market. Thanks to the registration procedure initiated in 2007, a large REACH database [1] of well defined (eco)toxicological properties has been created. The REACH-chemical space is defined by more than 21’500 compounds registered in the REACH database.
Considering the high number of chemicals and endpoints for which experimental data is available, the ability to visualize this chemical space as well as to profile a compound integrating several key properties at once is a growing need.
Here, the data distribution in REACH chemical space was visualized and analysed with the help of 2-dimensional Generative Topographic Map (GTM). Similarly to geographical map, on GTM, each object (compound) is visualized as a datapoint. Moreover, compounds possessing similar properties tend to be located in neighbourhood. The third dimension can be added in order to display a distribution of the given (eco)toxicological property (such-called “property landscape”), which can further be used for property assessment of new compounds projected on the map.
We report the universal REACH map which accommodates 11 endpoints, covering environmental fate, and (eco)toxicological properties. This map is able to provide predictions for each property, and demonstrates acceptable predictive performance in cross-validation: balanced accuracy ranges from 0.60 to 0.78. Superposition of different property landscapes allows to delineate the “areas of interest” populated by molecules possessing desirable (eco)toxicological profile.
创建时间:
2020-09-04



