TY - JOUR
T1 - Including household effects in Big Data research
T2 - the experience of building a longitudinal residence algorithm using linked administrative data in Wales
AU - Tingay, Karen Susan
AU - Roberts, Matthew
AU - Musselwhite, Charles
N1 - Funding Information:
This project was supported by Farr@CIPHER and the Administrative Data Research Centre Wales, and makes use of the SAIL Databank.
Publisher Copyright:
© The Authors. Open Access under CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/deed.en)
PY - 2018/11/20
Y1 - 2018/11/20
N2 - The effect of the wider social-environment on physical and emotional health has long been an area of study. Extrapolating the impact of the individual's immediate environment, such as living with a smoker or caring for a chronically-ill child, would potentially reduce confounding effects in health-related research. Surveys, including the UK Census, are beginning to collect data on household composition. However, these surveys are expensive, time consuming, and, as such, are only completed by a subsection of the population. Large-scale, linked databanks, such as the SAIL Databank at Swansea University, which hold routinely collected secondary use clinical and administrative datasets, are broader in scope, both in terms of the nature of the data held, and the population. The SAIL databank includes demographic data and a geographic indicator that makes it possible to identify groups of people that share accommodation, and in some cases the familial relationships among them. This paper describes a method for creating households, including considerations for how that information can be securely shared for research purposes. This approach has broad implications in Wales and beyond, opening up possibilities for more detailed population-level research that includes consideration of residential social interactions.
AB - The effect of the wider social-environment on physical and emotional health has long been an area of study. Extrapolating the impact of the individual's immediate environment, such as living with a smoker or caring for a chronically-ill child, would potentially reduce confounding effects in health-related research. Surveys, including the UK Census, are beginning to collect data on household composition. However, these surveys are expensive, time consuming, and, as such, are only completed by a subsection of the population. Large-scale, linked databanks, such as the SAIL Databank at Swansea University, which hold routinely collected secondary use clinical and administrative datasets, are broader in scope, both in terms of the nature of the data held, and the population. The SAIL databank includes demographic data and a geographic indicator that makes it possible to identify groups of people that share accommodation, and in some cases the familial relationships among them. This paper describes a method for creating households, including considerations for how that information can be securely shared for research purposes. This approach has broad implications in Wales and beyond, opening up possibilities for more detailed population-level research that includes consideration of residential social interactions.
KW - Big data
KW - housing
KW - Health
UR - http://www.scopus.com/inward/record.url?scp=85086478221&partnerID=8YFLogxK
U2 - 10.23889/ijpds.v3i1.452
DO - 10.23889/ijpds.v3i1.452
M3 - Article
C2 - 32935012
SN - 2399-4908
VL - 31
JO - International Journal of Population Data Science
JF - International Journal of Population Data Science
IS - 1
M1 - A24
ER -