A simplified approach to generating synthetic data for disclosure control
Recent News
Upcoming Events
Sorry, there are currently no upcoming Events.
Raab, G., Nowok, B. & Dibben, C. (2014) arXiv.org (arXiv:1409.0217v2), [SLS]
Other information:
Abstract:
We describe results on the creation and use of synthetic data that were derived in the context of a project to make synthetic extracts available for users of the UK Longitudinal Studies. Contrary to the existing literature we show that there are circumstances when inferences can be made from fully synthetic data generated from fitted parameters without sampling from their posterior distributions (simple synthesis). The condition that allows this, which we describe as "common-sampling", is that the original sample and the synthetic data can be considered as sampled in the same way from their respective populations. New variance estimators for the analysis of synthetic data are derived when the common-sampling condition is met. It is shown that simple synthesis, with these estimators, provide better estimates than the methods suggested in the literature for fully synthetic data. The results are confirmed by simulations and are illustrated with an example from the Scottish Longitudinal Study.
Available online: arXiv.org
Download output document: Full paper (PDF 288KB)
Output from project: 2013_012
Cookie | Duration | Description |
---|---|---|
cookielawinfo-checkbox-analytics | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics". |
cookielawinfo-checkbox-functional | 11 months | The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional". |
cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-others | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other. |
cookielawinfo-checkbox-performance | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance". |
viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
Cookie | Duration | Description |
---|---|---|
__utma | 2 years | Used to distinguish users and sessions. The cookie is created when the javascript library executes and no existing __utma cookies exists. The cookie is updated every time data is sent to Google Analytics. |
__utmb | 30 minutes | Used to determine new sessions/visits. The cookie is created when the javascript library executes and no existing __utmb cookies exists. The cookie is updated every time data is sent to Google Analytics. |
__utmc | Not used in ga.js. Set for interoperability with urchin.js. Historically, this cookie operated in conjunction with the __utmb cookie to determine whether the user was in a new session/visit. | |
__utmt | 10 minutes | Used to throttle request rate. |
__utmz | 6 months | Stores the traffic source or campaign that explains how the user reached your site. The cookie is created when the javascript library executes and is updated every time data is sent to Google Analytics. |
_ga | 2 years | Used to distinguish users. |
_gat | 1 minute | Used to throttle request rate. |
_gid | 24 hours | Used to distinguish users. |