Skip to main content

Showing 1–12 of 12 results for author: Dick, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.03115  [pdf, other

    cs.LG stat.ML

    Easy Learning from Label Proportions

    Authors: Robert Istvan Busa-Fekete, Hee** Choi, Travis Dick, Claudio Gentile, Andres Munoz medina

    Abstract: We consider the problem of Learning from Label Proportions (LLP), a weakly supervised classification setup where instances are grouped into "bags", and only the frequency of class labels at each bag is available. Albeit, the objective of the learner is to achieve low task loss at an individual instance level. Here we propose Easyllp: a flexible and simple-to-implement debiasing approach based on a… ▽ More

    Submitted 13 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  2. arXiv:2210.11222  [pdf, other

    cs.CR cs.AI cs.DS cs.LG stat.ML

    Learning-Augmented Private Algorithms for Multiple Quantile Release

    Authors: Mikhail Khodak, Kareem Amin, Travis Dick, Sergei Vassilvitskii

    Abstract: When applying differential privacy to sensitive data, we can often improve performance using external information such as other sensitive data, public data, or human priors. We propose to use the learning-augmented algorithms (or algorithms with predictions) framework -- previously applied largely to improve time complexity or competitive ratios -- as a powerful way of designing and analyzing priv… ▽ More

    Submitted 8 May, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: To appear in ICML 2023

  3. arXiv:2208.03291  [pdf

    stat.AP

    Comparing Unit Trains versus Manifest Trains for the Risk of Rail Transport of Hazardous Materials -- Part II: Application and Case Study

    Authors: Di Kang, Jiaxi Zhao, C. Tyler Dick, Xiang Liu, Zheyong Bian, Steven W. Kirkpatrick, Chen-Yu Lin

    Abstract: Built upon the risk analysis methodology (presented in the part I paper), this part II paper focuses on applying this methodology. Five illustrative scenarios were used to analyze the best or worst cases and compare the transportation risk differences between service options using unit trains and manifest trains. The comparison results indicate that if all tank cars are placed at the positions wit… ▽ More

    Submitted 4 July, 2022; originally announced August 2022.

  4. arXiv:2207.02113  [pdf

    stat.AP stat.ME

    Comparing Unit Trains versus Manifest Trains for the Risk of Rail Transport of Hazardous Materials -- Part I: Risk Analysis Methodology

    Authors: Di Kang, Jiaxi Zhao, C. Tyler Dick, Xiang Liu, Zheyong Bian, Steven W. Kirkpatrick, Chen-Yu Lin

    Abstract: Transporting hazardous materials (hazmats) using tank cars has more significant economic benefits than other transportation modes. Although railway transportation is roughly four times more fuel-efficient than roadway transportation, a train derailment has greater potential to cause more disastrous consequences than a truck incident. Train types, such as unit train or manifest train (also called m… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  5. arXiv:2012.10602  [pdf, other

    cs.LG cs.CR stat.ML

    Scalable and Provably Accurate Algorithms for Differentially Private Distributed Decision Tree Learning

    Authors: Kaiwen Wang, Travis Dick, Maria-Florina Balcan

    Abstract: This paper introduces the first provably accurate algorithms for differentially private, top-down decision tree learning in the distributed setting (Balcan et al., 2012). We propose DP-TopDown, a general privacy preserving decision tree learning algorithm, and present two distributed implementations. Our first method NoisyCounts naturally extends the single machine algorithm by using the Laplace m… ▽ More

    Submitted 22 February, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

    Comments: In AAAI Workshop on Privacy-Preserving Artificial Intelligence, 2020

  6. arXiv:2006.07281  [pdf, other

    cs.LG cs.CE cs.GT stat.ML

    Algorithms and Learning for Fair Portfolio Design

    Authors: Emily Diana, Travis Dick, Hadi Elzayn, Michael Kearns, Aaron Roth, Zachary Schutzman, Saeed Sharifi-Malvajerdi, Juba Ziani

    Abstract: We consider a variation on the classical finance problem of optimal portfolio design. In our setting, a large population of consumers is drawn from some distribution over risk tolerances, and each consumer must be assigned to a portfolio of lower risk than her tolerance. The consumers may also belong to underlying groups (for instance, of demographic properties or wealth), and the goal is to desig… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  7. arXiv:2002.03517  [pdf, other

    cs.LG cs.CR stat.ML

    Random Smoothing Might be Unable to Certify $\ell_\infty$ Robustness for High-Dimensional Images

    Authors: Avrim Blum, Travis Dick, Naren Manoj, Hongyang Zhang

    Abstract: We show a hardness result for random smoothing to achieve certified adversarial robustness against attacks in the $\ell_p$ ball of radius $ε$ when $p>2$. Although random smoothing has been well understood for the $\ell_2$ case using the Gaussian distribution, much remains unknown concerning the existence of a noise distribution that works for the case of $p>2$. This has been posed as an open probl… ▽ More

    Submitted 5 March, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: 20 pages, 2 figures; Code is available at https://github.com/hongyanz/TRADES-smoothing

  8. arXiv:1908.02894  [pdf, other

    cs.LG stat.ML

    How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design

    Authors: Maria-Florina Balcan, Dan DeBlasio, Travis Dick, Carl Kingsford, Tuomas Sandholm, Ellen Vitercik

    Abstract: Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case approximation ratio or runtime bound. Worst-case in… ▽ More

    Submitted 25 April, 2021; v1 submitted 7 August, 2019; originally announced August 2019.

  9. arXiv:1907.09137  [pdf, other

    cs.LG stat.ML

    Learning piecewise Lipschitz functions in changing environments

    Authors: Maria-Florina Balcan, Travis Dick, Dravyansh Sharma

    Abstract: Optimization in the presence of sharp (non-Lipschitz), unpredictable (w.r.t. time and amount) changes is a challenging and largely unexplored problem of great significance. We consider the class of piecewise Lipschitz functions, which is the most general online setting considered in the literature for the problem, and arises naturally in various combinatorial algorithm selection problems where uti… ▽ More

    Submitted 6 August, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

  10. arXiv:1904.09014  [pdf, other

    cs.LG stat.ML

    Semi-bandit Optimization in the Dispersed Setting

    Authors: Maria-Florina Balcan, Travis Dick, Wesley Pegden

    Abstract: The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of parameterized algorithms and then optimize those parameters to obtain good performance on example instances from the application domain. In the online setting,… ▽ More

    Submitted 21 December, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

  11. arXiv:1809.08700  [pdf, other

    cs.LG cs.GT stat.ML

    Envy-Free Classification

    Authors: Maria-Florina Balcan, Travis Dick, Ritesh Noothigattu, Ariel D. Procaccia

    Abstract: In classic fair division problems such as cake cutting and rent division, envy-freeness requires that each individual (weakly) prefer his allocation to anyone else's. On a conceptual level, we argue that envy-freeness also provides a compelling notion of fairness for classification tasks. Our technical focus is the generalizability of envy-free classification, i.e., understanding whether a classif… ▽ More

    Submitted 24 September, 2020; v1 submitted 23 September, 2018; originally announced September 2018.

    Journal ref: Advances in Neural Information Processing Systems, 2019, pp. 1240-1250

  12. arXiv:1512.04848  [pdf, other

    cs.LG cs.DS stat.ML

    Data Driven Resource Allocation for Distributed Learning

    Authors: Travis Dick, Mu Li, Venkata Krishna Pillutla, Colin White, Maria Florina Balcan, Alex Smola

    Abstract: In distributed machine learning, data is dispatched to multiple machines for processing. Motivated by the fact that similar data points often belong to the same or similar classes, and more generally, classification rules of high accuracy tend to be "locally simple but globally complex" (Vapnik & Bottou 1993), we propose data dependent dispatching that takes advantage of such structure. We present… ▽ More

    Submitted 15 December, 2016; v1 submitted 15 December, 2015; originally announced December 2015.