five

Molecules in Wikipedia: Analysis of Their Chemical Diversity, Functional Roles, and Popularity

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Molecules_in_Wikipedia_Analysis_of_Their_Chemical_Diversity_Functional_Roles_and_Popularity/30917508
下载链接
链接失效反馈
官方服务:
资源简介:
Wikipedia is one of the most widely accessed sources of information worldwide, containing nearly 25,000 molecular entries spanning drugs, natural products, specialty chemicals, and other compounds. Despite its prominence, the chemical content of Wikipedia has not been systematically studied. In this work, we analyzed molecular entries and classified them into use categories, providing a first overview of their roles and applications. Structural diversity was examined using scaffold analysis and UMAP visualization, which revealed well-defined clusters corresponding to major chemical classes. In addition, Wikipedia pageview statistics were analyzed to explore the popularity of molecular entries. These data revealed a strong public focus on CNS-active drugs, recreational substances, and molecules with current medical or cultural relevance, while industrial and specialty chemicals attracted comparatively little attention. Overall, our findings show that Wikipedia offers both a chemically diverse and socially informative perspective on molecules, making it a unique resource at the intersection of chemistry, open data, and public knowledge.
创建时间:
2025-12-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作