The ‘art’ of cohort and study construction in administrative datasets: examples from Scotland
Recent News
Recent Outputs
Upcoming Events
Sorry, there are currently no upcoming Events.
Williamson, L. (2017) UK Administrative Data Research Network Annual Research Conference, Royal College of Surgeons, Edinburgh, UK, 1 - 2 June 2017 [SLS]
Other information:
2011_002
2013_005
2013_008
Abstract:
Using specific research case studies I will give an overview as to how as researchers we can have a great research idea, grounded in the relevant literature, but there are problems translating it into a robust research design. Assuming that the area/question cannot be reliably researched using small but rich sample surveys I will present ways in which routine admin data can help, along with the additional challenges of creating the correct cohort to address the research question.
The examples are from the Scottish Longitudinal Study (SLS) which links together routinely collected administrative data for a 5.3% representative sample of the Scottish population (about 270,000 people). It includes a wealth of information from the censuses (1991-2011), vital events registrations (ie births and deaths), and education data from 2007 onwards. The SLS with appropriate permissions can also be linked to health data such as cancer registry and hospital admission data from the NHS in Scotland. The size and scope of the SLS make it an unparalleled resource for analysing a range of socio-economic, demographic and health questions.
I will demonstrate how despite the large number of study members owing to the constraints on various admin data being available centrally for Scotland in systems (ie health data and education data) cohorts have to be carefully considered in order to research outcomes (events/results). Examples include: (1) life-course events for a cohort of SLS women born 1959-1965 followed up from 1991, (2) setting up 2 complex cohorts of SLS members and children of the SLS (COTS) born from 1991 onwards to investigate child development including social status information from family background, and (3) constructing relevant cohort samples to investigate those not in employment, education or training (NEET).
Available online: Link
Download output document: Video (link to YouTube)
Output from project: 2011_002, 2013_005, 2013_008
© 2026 CALLS Hub - Mtc - SMA Login Contact - Output Login
| Cookie | Duration | Description |
|---|---|---|
| cookielawinfo-checkbox-analytics | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics". |
| cookielawinfo-checkbox-functional | 11 months | The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional". |
| cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
| cookielawinfo-checkbox-others | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other. |
| cookielawinfo-checkbox-performance | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance". |
| viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
| Cookie | Duration | Description |
|---|---|---|
| __utma | 2 years | Used to distinguish users and sessions. The cookie is created when the javascript library executes and no existing __utma cookies exists. The cookie is updated every time data is sent to Google Analytics. |
| __utmb | 30 minutes | Used to determine new sessions/visits. The cookie is created when the javascript library executes and no existing __utmb cookies exists. The cookie is updated every time data is sent to Google Analytics. |
| __utmc | Not used in ga.js. Set for interoperability with urchin.js. Historically, this cookie operated in conjunction with the __utmb cookie to determine whether the user was in a new session/visit. | |
| __utmt | 10 minutes | Used to throttle request rate. |
| __utmz | 6 months | Stores the traffic source or campaign that explains how the user reached your site. The cookie is created when the javascript library executes and is updated every time data is sent to Google Analytics. |
| _ga | 2 years | Used to distinguish users. |
| _gat | 1 minute | Used to throttle request rate. |
| _gid | 24 hours | Used to distinguish users. |