OAC2021-2 Ancillary data and link to Github
收藏DataCite Commons2025-12-08 更新2026-05-07 收录
下载链接:
https://rdr.ucl.ac.uk/articles/dataset/OAC2021-2_Ancillary_data_and_link_to_Github/28485338/1
下载链接
链接失效反馈官方服务:
资源简介:
Creation of the <b>2021/2 Output Area Classification</b><b> </b><b>[External Github Repo LINK]</b><b>Scripts</b><code>Metadata.R</code> - creating reference tables and glossaries for Census data.<code>Downloading_data.R</code> – downloading 2011 and 2021 Census data from NOMIS using <code>nomisr</code> package.<code>Comparing_Censuses.R</code> - data cleaning and amalgamation.<code>Transforming_Census_data.R</code> – data manipulation and transformation for the classification.<code>NI.R</code> – modelling 2021 data for Northern Ireland.<code>Correlation.R</code> - testing correlation between the variables.<code>Pre-clustering.R</code> - preparing data for clustering.<code>Clustering.R</code> - clustering of the data.<code>Post-clustering.R</code> - creating maps and plots of the cluster solution.<code>Testing_clustering.R</code><code>Clustergrams.ipynb</code> – creating Clustergrams in Python. (Credits: Prof Alex Singleton)<code>Industry.R</code> - loading Industry data<code>Industry_classification.R</code> - creating geodemographic classification with Industry variables<code>Graph_comparisons.R</code> - comparing data with graphs<b>Data</b>List of folders (subfolders & files) in the project:<b>API</b> - Census data downloaded and saved with use of <code>nomisr</code> package.<b>Clean</b> – amalgamated data ready for the analysis.Raw_counts - datasets with raw countsPercentages - datasets transformed into percentagesTransformed - datasets transformed with IHS (analysis-ready)Final_variables - datasets with OAC variables onlyAll_data_clustering - results of the clustering for all investigated datasets.<b>Clustering</b> - datasets with cluster assignment for the UK and centroids.<b>Lookups</b> - reference tables for 2011 and 2021 Census variables.<b>NISRA 2021</b> - 2021 Census data at LGD level for Northern Ireland.<b>Objects</b> - R objects created and stored to ensure consistency of the results or load big files<b>SIR</b> – contingency tables on disability counts by age, utilised for calculation of Standardised Illness Ratio.<b>shapefiles</b> - folder containing shapefiles used for some of the calculations.<b>Plots</b>Bar_plots - Comparison of clusters to the UK (as well as Supergroup and Group averages)Clustergrams – plots used to establish number of clusters at each classification level.<b>Maps</b><br>
提供机构:
University College London
创建时间:
2025-12-08



