Utility Measures for Synthetic Data
Recent News
Recent Outputs
Upcoming Events
Sorry, there are currently no upcoming Events.
Raab, G. (2016) Data Linkage and Anonymisation, Isaac Newton Institute for Mathematical Sciences, Cambridge, 3 November 2016 [SLS][ONS LS][NILS][CALLS]
Other information:
SLS project pageONS LS project pageNILS project page
Abstract:
When synthetic data are produced to overcome potential disclosure they can be used either in place of the original data or, more commonly, to allow researchers to develop code that will ultimately be run on the original data. The utility of synthetic data can be measured by comparing the results of the final analysis with the synthetic and original data. This is not possible until the final analysis is complete. General utility measures that measure the overall differences between the original and synthetic data are more useful for those creating synthetic data. This presentation will discuss two such measures. The first is a propensity score measure originally proposed by Woo et. al., 2009 and the second is one based on comparing tables, suggested by Voas and Williamson, 2001. Their null distributions, when the synthesis model is "correct" will be discussed as well as their practical implementation as part of the synthpop package.
Available online: Link
Output from project: 2013_012 (SLS), 30158 (ONS LS), 079 (NILS)
© 2026 CALLS Hub - Mtc - SMA Login Contact - Output Login
| Cookie | Duration | Description |
|---|---|---|
| cookielawinfo-checkbox-analytics | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics". |
| cookielawinfo-checkbox-functional | 11 months | The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional". |
| cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
| cookielawinfo-checkbox-others | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other. |
| cookielawinfo-checkbox-performance | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance". |
| viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
| Cookie | Duration | Description |
|---|---|---|
| __utma | 2 years | Used to distinguish users and sessions. The cookie is created when the javascript library executes and no existing __utma cookies exists. The cookie is updated every time data is sent to Google Analytics. |
| __utmb | 30 minutes | Used to determine new sessions/visits. The cookie is created when the javascript library executes and no existing __utmb cookies exists. The cookie is updated every time data is sent to Google Analytics. |
| __utmc | Not used in ga.js. Set for interoperability with urchin.js. Historically, this cookie operated in conjunction with the __utmb cookie to determine whether the user was in a new session/visit. | |
| __utmt | 10 minutes | Used to throttle request rate. |
| __utmz | 6 months | Stores the traffic source or campaign that explains how the user reached your site. The cookie is created when the javascript library executes and is updated every time data is sent to Google Analytics. |
| _ga | 2 years | Used to distinguish users. |
| _gat | 1 minute | Used to throttle request rate. |
| _gid | 24 hours | Used to distinguish users. |