five

NCDs Listener: A Social Listening Tool for Non-Communicable Diseases

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://doi.org/10.7910/DVN/HRSM09
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset accompanies the NCDs Listener project, an open-source social listening tool designed to collect, process, and analyze public social media data related to non-communicable diseases (NCDs) The NCDs Listener leverages Natural Language Processing (NLP) techniques, the BERT language model, and Generative AI to extract insights from Thai and English content. Data is collected from publicly available posts using web scraping methods and API integrations, then processed through tokenization, stopword removal, normalization, lemmatization, keyword matching, and classification into thematic categories such as personal experiences, questions, and non-informative content. This dataset includes: 1) Raw and processed Facebook and Reddit comments 2) Model Development Data 3) Data from questionnaire responses of users The dataset is intended for research on public health communication, patient experience, and NCD awareness in online communities. It supports studies in medical sociology, health policy, and computational social science.
创建时间:
2025-08-14
二维码
社区交流群
二维码
科研交流群
商业服务