Skip to main content

Showing 1–5 of 5 results for author: Koga, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.11526  [pdf, other

    cs.CR

    Measuring Privacy Loss in Distributed Spatio-Temporal Data

    Authors: Tatsuki Koga, Casey Meehan, Kamalika Chaudhuri

    Abstract: Statistics about traffic flow and people's movement gathered from multiple geographical locations in a distributed manner are the driving force powering many applications, such as traffic prediction, demand prediction, and restaurant occupancy reports. However, these statistics are often based on sensitive location data of people, and hence privacy has to be preserved while releasing them. The sta… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Chrome PDF viewer might not display Figures 3 and 4 properly

  2. arXiv:2310.06237  [pdf, other

    cs.LG cs.CR

    Differentially Private Multi-Site Treatment Effect Estimation

    Authors: Tatsuki Koga, Kamalika Chaudhuri, David Page

    Abstract: Patient privacy is a major barrier to healthcare AI. For confidentiality reasons, most patient data remains in silo in separate hospitals, preventing the design of data-driven healthcare AI systems that need large volumes of patient data to make effective decisions. A solution to this is collective learning across multiple sites through federated learning with differential privacy. However, litera… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 16 pages

  3. arXiv:2307.07477  [pdf, other

    cs.LG cs.CL cs.CR

    Population Expansion for Training Language Models with Private Federated Learning

    Authors: Tatsuki Koga, Congzheng Song, Martin Pelikan, Mona Chitnis

    Abstract: Federated learning (FL) combined with differential privacy (DP) offers machine learning (ML) training with distributed devices and with a formal privacy guarantee. With a large population of devices, FL with DP produces a performant model in a timely manner. However, for applications with a smaller population, not only does the model utility degrade as the DP noise is inversely proportional to pop… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  4. arXiv:2201.04762  [pdf, ps, other

    cs.CR cs.AI cs.LG

    Privacy Amplification by Subsampling in Time Domain

    Authors: Tatsuki Koga, Casey Meehan, Kamalika Chaudhuri

    Abstract: Aggregate time-series data like traffic flow and site occupancy repeatedly sample statistics from a population across time. Such data can be profoundly useful for understanding trends within a given population, but also pose a significant privacy risk, potentially revealing e.g., who spends time where. Producing a private version of a time-series satisfying the standard definition of Differential… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  5. arXiv:1812.01690  [pdf, other

    cs.CV cs.LG stat.ML

    General-to-Detailed GAN for Infrequent Class Medical Images

    Authors: Tatsuki Koga, Naoki Nonaka, Jun Sakuma, Jun Seita

    Abstract: Deep learning has significant potential for medical imaging. However, since the incident rate of each disease varies widely, the frequency of classes in a medical image dataset is imbalanced, leading to poor accuracy for such infrequent classes. One possible solution is data augmentation of infrequent classes using synthesized images created by Generative Adversarial Networks (GANs), but conventio… ▽ More

    Submitted 28 November, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/64