five

Discovering health topics in social media using topic models

收藏
Figshare2015-12-02 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Discovering_health_topics_in_social_media_using_topic_models/1007712
下载链接
链接失效反馈
官方服务:
资源简介:
Data set for M. Paul and M. Dredze, "Discovering health topics in social media using topic models". This includes the set of tweets used in the experiments, and the words associated<br>with ailments discovered by the Ailment Topic Aspect Model (ATAM). Contact: Michael Paul (mpaul39@gmail.com)<br>Released June 26, 2014 atam.topwords.csv<br>- The most probable words for each ailment. The first column is the ailment ID.<br>The second column indicates if it is a general (G), symptom (S), or treatment (T) word.<br>The third column is the word. The fourth column is the probability. Words are shown<br>in descending order of probability until 90% of the probability mass is accumulated<br>for each ailment or until probabilities drop below 1.0e-4. atam.tweets.x.csv (for x=[0-9])<br>- The tweets used in the study. The first column is the tweet ID. The second column<br>indicates the ailment ID for the ailment sampled for that tweet.<br>(See the atam.topwords.csv file for the most probable words associated with each ailment ID.)<br>Full tweets can be downloaded using the tweet ID through the Twitter API<br>(https://dev.twitter.com/docs/api/1.1). keywords.txt<br> - The set of 269 health-related keywords used in our keyword-filtered Twitter stream as part of our dataset. keywords_x.txt (for x={diseases,symptoms,treatments})<br>- The set of approximately 20,000 keyphrases crawled from wrongdiagnosis.com describing<br>the names of diseases, symptoms, and treatments and medications. These keyword lists are<br>used to create input for ATAM (which requires phrases to be labeled as symptoms or treatments),<br>and also to initially filter our dataset when constructing our health classifiers.
创建时间:
2014-04-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作