Skip to main content

Showing 1–4 of 4 results for author: Ngo, D D

.
  1. arXiv:2405.19667  [pdf, other

    cs.LG cs.AI

    Reconciling Model Multiplicity for Downstream Decision Making

    Authors: Ally Yalei Du, Dung Daniel Ngo, Zhiwei Steven Wu

    Abstract: We consider the problem of model multiplicity in downstream decision-making, a setting where two predictive models of equivalent accuracy cannot agree on the best-response action for a downstream loss function. We show that even when the two predictive models approximately agree on their individual predictions almost everywhere, it is still possible for their induced best-response actions to diffe… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages main body, 6 figures

  2. arXiv:2302.08533  [pdf, other

    cs.LG cs.DC

    Federated Learning as a Network Effects Game

    Authors: Shengyuan Hu, Dung Daniel Ngo, Shuran Zheng, Virginia Smith, Zhiwei Steven Wu

    Abstract: Federated Learning (FL) aims to foster collaboration among a population of clients to improve the accuracy of machine learning without directly sharing local data. Although there has been rich literature on designing federated learning algorithms, most prior works implicitly assume that all clients are willing to participate in a FL scheme. In practice, clients may not benefit from joining in FL,… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 14 pages of main text, 26 pages in total

  3. arXiv:2206.00494  [pdf, ps, other

    cs.LG

    Incentivizing Combinatorial Bandit Exploration

    Authors: Xinyan Hu, Dung Daniel Ngo, Aleksandrs Slivkins, Zhiwei Steven Wu

    Abstract: Consider a bandit algorithm that recommends actions to self-interested users in a recommendation system. The users are free to choose other actions and need to be incentivized to follow the algorithm's recommendations. While the users prefer to exploit, the algorithm can incentivize them to explore by leveraging the information collected from the previous users. All published work on this problem,… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 9 pages of main text, 21 pages in total

  4. arXiv:2202.01292  [pdf, other

    cs.LG

    Improved Regret for Differentially Private Exploration in Linear MDP

    Authors: Dung Daniel Ngo, Giuseppe Vietri, Zhiwei Steven Wu

    Abstract: We study privacy-preserving exploration in sequential decision-making for environments that rely on sensitive data such as medical records. In particular, we focus on solving the problem of reinforcement learning (RL) subject to the constraint of (joint) differential privacy in the linear MDP setting, where both dynamics and rewards are given by linear functions. Prior work on this problem due to… ▽ More

    Submitted 22 June, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 13 pages of main text, 30 pages in total; typo corrected, references added