Skip to main content

Showing 1–11 of 11 results for author: Abowd, J M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.14191  [pdf, ps, other

    cs.CR econ.EM stat.AP

    Noisy Measurements Are Important, the Design of Census Products Is Much More Important

    Authors: John M. Abowd

    Abstract: McCartan et al. (2023) call for "making differential privacy work for census data users." This commentary explains why the 2020 Census Noisy Measurement Files (NMFs) are not the best focus for that plea. The August 2021 letter from 62 prominent researchers asking for production of the direct output of the differential privacy system deployed for the 2020 Census signaled the engagement of the schol… ▽ More

    Submitted 1 May, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Journal ref: Harvard Data Science Review, Volume 6, Number 2 (Spring, 2024)

  2. arXiv:2312.11283  [pdf, other

    stat.AP cs.CR econ.EM

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    Authors: John M. Abowd, Tamara Adams, Robert Ashmead, David Darais, Sourya Dey, Simson L. Garfinkel, Nathan Goldschlag, Daniel Kifer, Philip Leclerc, Ethan Lew, Scott Moore, Rolando A. Rodríguez, Ramy N. Tadros, Lars Vilhuber

    Abstract: Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can veri… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2312.10863  [pdf, ps, other

    cs.CR stat.CO

    Disclosure Avoidance for the 2020 Census Demographic and Housing Characteristics File

    Authors: Ryan Cumings-Menon, Robert Ashmead, Daniel Kifer, Philip Leclerc, Matthew Spence, Pavel Zhuravlev, John M. Abowd

    Abstract: In "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Abowd et al. (2022) describe the concepts and methods used by the Disclosure Avoidance System (DAS) to produce formally private output in support of the 2020 Census data product releases, with a particular focus on the DAS implementation that was used to create the 2020 Census Redistricting Data (P.L. 94-171) Summary File. In this… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  4. arXiv:2310.09398  [pdf, other

    cs.CR econ.EM stat.ME

    An In-Depth Examination of Requirements for Disclosure Risk Assessment

    Authors: Ron S. Jarmin, John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Nathan Goldschlag, Michael B. Hawes, Sallie Ann Keller, Daniel Kifer, Philip Leclerc, Jerome P. Reiter, Rolando A. Rodríguez, Ian Schmutte, Victoria A. Velkoff, Pavel Zhuravlev

    Abstract: The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be bas… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 47 pages, 1 table

    Journal ref: PNAS, October 13, 2023, Vol. 120, No. 43

  5. arXiv:2303.00845  [pdf, ps, other

    stat.AP cs.CR econ.EM

    $21^{st}$ Century Statistical Disclosure Limitation: Motivations and Challenges

    Authors: John M Abowd, Michael B Hawes

    Abstract: This chapter examines the motivations and imperatives for modernizing how statistical agencies approach statistical disclosure limitation for official data product releases. It discusses the implications for agencies' broader data governance and decision-making, and it identifies challenges that agencies will likely face along the way. In conclusion, the chapter proposes some principles and best p… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Forthcoming CRC Handbook of Formally Private and Synthetic Data Approaches for Statistical Disclosure Control

  6. arXiv:2209.03310  [pdf, other

    cs.CR stat.ME

    Bayesian and Frequentist Semantics for Common Variations of Differential Privacy: Applications to the 2020 Census

    Authors: Daniel Kifer, John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Philip Leclerc, Ashwin Machanavajjhala, William Sexton, Pavel Zhuravlev

    Abstract: The purpose of this paper is to guide interpretation of the semantic privacy guarantees for some of the major variations of differential privacy, which include pure, approximate, Rényi, zero-concentrated, and $f$ differential privacy. We interpret privacy-loss accounting parameters, frequentist semantics, and Bayesian semantics (including new results). The driving application is the interpretation… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  7. Confidentiality Protection in the 2020 US Census of Population and Housing

    Authors: John M Abowd, Michael B Hawes

    Abstract: In an era where external data and computational capabilities far exceed statistical agencies' own resources and capabilities, they face the renewed challenge of protecting the confidentiality of underlying microdata when publishing statistics in very granular form and ensuring that these granular data are used for statistical purposes only. Conventional statistical disclosure limitation methods ar… ▽ More

    Submitted 27 December, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Version 2 corrects a few transcription errors in Tables 2, 3 and 5. Version 3 adds final journal copy edits to the preprint

    Journal ref: Annual Review of Statistics and Its Application 2023 10:1

  8. arXiv:2204.08986  [pdf, other

    cs.CR econ.EM stat.AP

    The 2020 Census Disclosure Avoidance System TopDown Algorithm

    Authors: John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Simson Garfinkel, Micah Heineck, Christine Heiss, Robert Johns, Daniel Kifer, Philip Leclerc, Ashwin Machanavajjhala, Brett Moran, William Sexton, Matthew Spence, Pavel Zhuravlev

    Abstract: The Census TopDown Algorithm (TDA) is a disclosure avoidance system using differential privacy for privacy-loss accounting. The algorithm ingests the final, edited version of the 2020 Census data and the final tabulation geographic definitions. The algorithm then creates noisy versions of key queries on the data, referred to as measurements, using zero-Concentrated Differential Privacy. Another ke… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  9. arXiv:2112.05822  [pdf

    econ.GN stat.AP

    U.S. Long-Term Earnings Outcomes by Sex, Race, Ethnicity, and Place of Birth

    Authors: Kevin L. McKinney, John M. Abowd, Hubert P. Janicki

    Abstract: This paper is part of the Global Income Dynamics Project cross-country comparison of earnings inequality, volatility, and mobility. Using data from the U.S. Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files we produce a uniform set of earnings statistics for the U.S. From 1998 to 2019, we find U.S. earnings inequality has increased and volatility has decreased. T… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 77 pages, 42 figures

  10. arXiv:2008.00253  [pdf

    econ.GN stat.AP

    Male Earnings Volatility in LEHD before, during, and after the Great Recession

    Authors: Kevin L. McKinney, John M. Abowd

    Abstract: This paper is part of a coordinated collection of papers on prime-age male earnings volatility. Each paper produces a similar set of statistics for the same reference population using a different primary data source. Our primary data source is the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files. Using LEHD data from 1998 to 2016, we create a well-defined popula… ▽ More

    Submitted 1 February, 2022; v1 submitted 1 August, 2020; originally announced August 2020.

    Comments: Revision submitted to JBES with figures included in the text and Appendix added

  11. arXiv:2007.13275  [pdf, other

    econ.EM stat.ME

    Total Error and Variability Measures for the Quarterly Workforce Indicators and LEHD Origin-Destination Employment Statistics in OnTheMap

    Authors: Kevin L. McKinney, Andrew S. Green, Lars Vilhuber, John M. Abowd

    Abstract: We report results from the first comprehensive total quality evaluation of five major indicators in the U.S. Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) Program Quarterly Workforce Indicators (QWI): total flow-employment, beginning-of-quarter employment, full-quarter employment, average monthly earnings of full-quarter employees, and total quarterly payroll. Beginning-of-quarte… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.