Practical data synthesis for large samples

Raab, G.M., Nowok, B. & Dibben, C. (2016) Journal of Privacy and Confidentiality 7(3), 67-97. [SLS][ONS LS][NILS][CALLS]

Other information:
Abstract:

We describe results on the creation and use of synthetic data that were derived in the context of a project to make synthetic extracts available for users of the UK Longitudinal Studies. A critical review of existing methods of inference from large synthetic data sets is presented. We introduce new variance estimates for use with large samples of completely synthesised data that do not require them to be generated from the posterior predictive distribution derived from the observed data and can be used with a single synthetic data set. We make recommendations on how to synthesise data based on these findings. An example of synthesising data from the Scottish Longitudinal Study is included to illustrate our results.


SLS project page
ONS LS project page
NILS project page

Available online: Journal of Privacy and Confidentiality
Output from project: 2013_012 (SLS), 30158 (ONS LS), 079 (NILS)

QUICK DATA DICTIONARY SEARCH

Recent News

Upcoming Events

Sorry, there are currently no upcoming Events.

Latest Tweets