five

TITAN example code and explanation.

收藏
Figshare2020-08-18 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/TITAN_example_code_and_explanation_/12823113
下载链接
链接失效反馈
官方服务:
资源简介:
Lines 7–13 prepare a sample dataset of twenty potential symptoms and fifty individual respondents to mimic a subset of the data used in this study. For each respondent, 1s and 0s were used randomly for each symptom. A 1 means they did have that symptom, 0 means they did not. Now we have a dataset of fifty respondents and what symptoms they did or did not have. Line 16 creates a randomized list of exposure, one for each of the fifty respondents. In our study, each respondent had a measure of cumulative well density (CWD), an inverse distance weighting (IDW) score, and a measure of estimated annual emissions concentration (AEC). Line 16 creates an exposure variable that ranges from 0 to 50 (no units), with 0 being no exposure and 50 being representative of high exposure, though in our sample there was no limit to how high an exposure measure could go. Line 19 uses titan() to run the TITAN analysis, taking the reported symptoms and exposure values to determine if certain symptoms occur more or less at different levels of exposure. For example, when the exposure measure reaches 12, the model is looking for any symptoms that stand out as occurring more frequently at that exposure level. Indicator values (range 0–100) are used to score each symptom’s relationship to that exposure level, or gradient. A high indicator value shows a strong relationship with the gradient at a certain level. Then, the model determines if that relationship is positive or inverse. In ecological studies, one might study how changes in dissolved oxygen (DO) in a pond ecosystem cause certain species to die off or thrive as levels of DO change. When we begin to see a certain species appear in the pond, we can hypothesize that there may also be a change in DO as well since that species is an indicator of a certain threshold, or level of DO. Lines 22–29 takes information from the TITAN analysis and creates a table. For this table, the rows each represent the different symptoms, while columns are information pertaining to Indicator Value, the frequency of the symptom, p-values, whether the symptom is positively or inversely associated with the gradient, and the z-score. Using these parameters, we begin to filter out symptoms that were infrequent (line 25) and can also filter out insignificant symptoms or symptoms with low z-scores (lines 40–41). The latter two were done in our study but did not make sense for this sample data. Lines 34–36 construct the final plot we used to visualize the results of the TITAN analysis. In the plot, there are ten symptoms positively associated with the gradient with indicator values ranging from 32 to 71. The same goes for the inversely associated symptoms. For the plots in our study, we added additional characteristics like colors to group symptoms into categories and using the width of each bar to represent the frequency of symptoms being reported. (R)
创建时间:
2020-08-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作