five

PubDigest: Replication Data and Additional information

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://doi.org/10.7910/DVN/TWZAGW
下载链接
链接失效反馈
官方服务:
资源简介:
PubDigest can be found at: https://github.com/dansteiert/PubDigest Background: In the modern era, the growth of scientific literature presents a daunting challenge for researchers to keep informed of advancements across multiple disciplines. Objective: We apply natural language processing (NLP) to design a software tool that combs PubMed literature, aiming to pinpoint potential drugs that could be repurposed. Methods: Using NLP, especially term associations through word embeddings, we explored unrecognized relationships between drugs and diseases. To illustrate the utility of this software, we focused on chronic thromboembolic pulmonary hypertension (CTEPH), a rare disease with limited numbers of scientific publications. Results: Our literature analysis identified key clinical features linked to CTEPH using term frequency-inverse document frequency (TF-IDF), a technique measuring a term's significance in a text corpus. This allowed us to map related diseases. One standout was venous thrombosis (VT), which showed strong textual links with CTEPH. Looking deeper, we discovered potential drugs for CTEPH by analyzing literature on both CTEPH and VT. An intriguing find were benzofuran-derivatives, particularly amiodarone, which displayed potential anti-thrombotic properties. In vitro tests confirmed its ability to reduce platelet aggregation significantly by 68% (p=0.018). However, real-world clinical data indicated that CTEPH patients on amiodarone faced a significant 15.9% higher mortality risk (p < 0.001). Conclusion: While NLP offers an innovative approach to interpret scientific literature, especially for drug repurposing, it's crucial to tread with caution. Our exploration with benzofuran-derivatives and CTEPH underscores this point. Thus, blending NLP with hands-on experiments and clinical trials can pave the way for safer drug repurposing, especially for rare diseases like CTEPH.
创建时间:
2024-08-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作