five

Combining Suspect Screening with Large Language Model-Based Text Mining to Comprehensively Characterize Organic Compounds in Human Milk Associated with Pregnancy Complications

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Combining_Suspect_Screening_with_Large_Language_Model-Based_Text_Mining_to_Comprehensively_Characterize_Organic_Compounds_in_Human_Milk_Associated_with_Pregnancy_Complications/30951484
下载链接
链接失效反馈
官方服务:
资源简介:
Chemical exposure contributes to maternal pregnancy complications like gestational hypertension (GH), anemia, and gestational diabetes mellitus (GDM). However, current studies remain fragmented due to limited analysis of compounds, impeding mechanistic insights. Here, we present a novel framework that integrates high-throughput analysis and large language model-based text mining to identify organic compounds while leveraging existing massive data, thereby enabling a comprehensive understanding of pregnancy complication mechanisms and establishing an exposure atlas. Using this approach, we identified five compounds in human milk for the first time, including carbazole and 4,4′-diphenoxybenzophenone, and 35 additional compounds not previously linked to pregnancy complications. We further employed text mining to comprehensively uncover disease-specific chemical signatures based on global data: GH with polycyclic aromatic hydrocarbons (PAHs) and derivatives (e.g., 2-methylnaphthalene and acenaphthene), anemia with nitrogen-containing compounds (e.g., 4-methoxyformanilide), and GDM with long-chain carboxylic acids (e.g., 2,4,7,9-tetramethyldec-5-yne-4,7-diol). Further analysis revealed pathogenic mechanisms: PAHs and derivatives promoted oxidative stress in GH, nitrogen-containing compounds damaged red blood cells in anemia, and long-chain carboxylic acids interfered with mitochondrial function in GDM. These findings construct an atlas of organic compounds associated with pregnancy complications and offer new leads for understanding their environmental origins.
创建时间:
2025-12-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作