Skip to main content

Showing 1–1 of 1 results for author: Heppenstall, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2203.01363  [pdf, other

    cs.LG stat.AP

    Faking feature importance: A cautionary tale on the use of differentially-private synthetic data

    Authors: Oscar Giles, Kasra Hosseini, Grigorios Mingas, Oliver Strickson, Louise Bowler, Camila Rangel Smith, Harrison Wilde, Jen Ning Lim, Bilal Mateen, Kasun Amarasinghe, Rayid Ghani, Alison Heppenstall, Nik Lomax, Nick Malleson, Martin O'Reilly, Sebastian Vollmerteke

    Abstract: Synthetic datasets are often presented as a silver-bullet solution to the problem of privacy-preserving data publishing. However, for many applications, synthetic data has been shown to have limited utility when used to train predictive models. One promising potential application of these data is in the exploratory phase of the machine learning workflow, which involves understanding, engineering a… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 27 pages, 8 figures