Skip to main content

Showing 1–14 of 14 results for author: McKenzie, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.05750  [pdf, other

    stat.ML cs.DS cs.LG math.DG

    Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms

    Authors: Nicolás García Trillos, Anna Little, Daniel McKenzie, James M. Murphy

    Abstract: We analyze the convergence properties of Fermat distances, a family of density-driven metrics defined on Riemannian manifolds with an associated probability measure. Fermat distances may be defined either on discrete samples from the underlying measure, in which case they are random, or in the continuum setting, in which they are induced by geodesics under a density-distorted Riemannian metric. We… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  2. arXiv:2305.18779  [pdf, other

    cs.LG math.AP math.OC stat.ML

    It begins with a boundary: A geometric view on probabilistically robust learning

    Authors: Leon Bungert, Nicolás García Trillos, Matt Jacobs, Daniel McKenzie, Đorđe Nikolić, Qingsong Wang

    Abstract: Although deep neural networks have achieved super-human performance on many classification tasks, they often exhibit a worrying lack of robustness towards adversarially generated examples. Thus, considerable effort has been invested into reformulating Empirical Risk Minimization (ERM) into an adversarially robust framework. Recently, attention has shifted towards approaches which interpolate betwe… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  3. arXiv:2301.13395  [pdf, other

    cs.LG

    Learning to Solve Integer Linear Programs with Davis-Yin Splitting

    Authors: Daniel McKenzie, Samy Wu Fung, Howard Heaton

    Abstract: In many applications, a combinatorial problem must be repeatedly solved with similar, but distinct parameters. Yet, the parameters $w$ are not directly observed; only contextual data $d$ that correlates with $w$ is available. It is tempting to use a neural network to predict $w$ given $d$. However, training such a model requires reconciling the discrete nature of combinatorial optimization with th… ▽ More

    Submitted 21 March, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  4. arXiv:2109.13391  [pdf, other

    math.OC cs.LG

    Curvature-Aware Derivative-Free Optimization

    Authors: Bumsu Kim, HanQin Cai, Daniel McKenzie, Wotao Yin

    Abstract: The paper discusses derivative-free optimization (DFO), which involves minimizing a function without access to gradients or directional derivatives, only function evaluations. Classical DFO methods, which mimic gradient-based methods, such as Nelder-Mead and direct search have limited scalability for high-dimensional problems. Zeroth-order methods have been gaining popularity due to the demands of… ▽ More

    Submitted 12 April, 2023; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: 31 pages, 9 figures

    MSC Class: 49M15; 65K05; 68Q25; 90C56

  5. arXiv:2106.00906  [pdf, other

    cs.LG cs.GT math.OC

    Operator Splitting for Learning to Predict Equilibria in Convex Games

    Authors: Daniel McKenzie, Howard Heaton, Qiuwei Li, Samy Wu Fung, Stanley Osher, Wotao Yin

    Abstract: Systems of competing agents can often be modeled as games. Assuming rationality, the most likely outcomes are given by an equilibrium (e.g. a Nash equilibrium). In many practical settings, games are influenced by context, i.e. additional data beyond the control of any agent (e.g. weather for traffic and fiscal policy for market economies). Often the exact game mechanics are unknown, yet vast amoun… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: To appear in SIMODS

  6. arXiv:2103.12803  [pdf, other

    cs.LG

    JFB: Jacobian-Free Backpropagation for Implicit Networks

    Authors: Samy Wu Fung, Howard Heaton, Qiuwei Li, Daniel McKenzie, Stanley Osher, Wotao Yin

    Abstract: A promising trend in deep learning replaces traditional feedforward networks with implicit networks. Unlike traditional networks, implicit networks solve a fixed point equation to compute inferences. Solving for the fixed point varies in complexity, depending on provided data and an error tolerance. Importantly, implicit networks may be trained with fixed memory costs in stark contrast to feedforw… ▽ More

    Submitted 24 December, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  7. arXiv:2102.10707  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization

    Authors: HanQin Cai, Yuchen Lou, Daniel McKenzie, Wotao Yin

    Abstract: We consider the zeroth-order optimization problem in the huge-scale setting, where the dimension of the problem is so large that performing even basic vector operations on the decision variables is infeasible. In this paper, we propose a novel algorithm, coined ZO-BCD, that exhibits favorable overall query complexity and has a much smaller per-iteration computational complexity. In addition, we di… ▽ More

    Submitted 11 June, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Accepted to ICML 2021

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:1193-1203, 2021

  8. arXiv:2012.09385  [pdf, other

    stat.ML cs.DS cs.LG

    Balancing Geometry and Density: Path Distances on High-Dimensional Data

    Authors: Anna Little, Daniel McKenzie, James Murphy

    Abstract: New geometric and computational analyses of power-weighted shortest-path distances (PWSPDs) are presented. By illuminating the way these metrics balance density and geometry in the underlying data, we clarify their key parameters and discuss how they may be chosen in practice. Comparisons are made with related data-driven metrics, which illustrate the broader role of density in kernel-based unsupe… ▽ More

    Submitted 7 June, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    MSC Class: 05C85; 05C80 ACM Class: I.5.3

  9. arXiv:2011.11804  [pdf, other

    cs.LG cs.SI

    Who killed Lilly Kane? A case study in applying knowledge graphs to crime fiction

    Authors: Mariam Alaverdian, William Gilroy, Veronica Kirgios, Xia Li, Carolina Matuk, Daniel Mckenzie, Tachin Ruangkriengsin, Andrea Bertozzi, Jeffrey Brantingham

    Abstract: We present a preliminary study of a knowledge graph created from season one of the television show Veronica Mars, which follows the eponymous young private investigator as she attempts to solve the murder of her best friend Lilly Kane. We discuss various techniques for mining the knowledge graph for clues and potential suspects. We also discuss best practice for collaboratively constructing knowle… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  10. arXiv:2010.02479  [pdf, other

    math.OC cs.AI cs.LG

    A One-bit, Comparison-Based Gradient Estimator

    Authors: HanQin Cai, Daniel Mckenzie, Wotao Yin, Zhenliang Zhang

    Abstract: We study zeroth-order optimization for convex functions where we further assume that function evaluations are unavailable. Instead, one only has access to a $\textit{comparison oracle}$, which given two points $x$ and $y$ returns a single bit of information indicating which point has larger function value, $f(x)$ or $f(y)$. By treating the gradient as an unknown signal to be recovered, we show how… ▽ More

    Submitted 23 April, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    Journal ref: Applied and Computational Harmonic Analysis, 60 (2022): 242-266

  11. arXiv:2003.13001  [pdf, other

    math.OC cs.LG

    Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling

    Authors: HanQin Cai, Daniel Mckenzie, Wotao Yin, Zhenliang Zhang

    Abstract: We consider the problem of minimizing a high-dimensional objective function, which may include a regularization term, using (possibly noisy) evaluations of the function. Such optimization is also called derivative-free, zeroth-order, or black-box optimization. We propose a new $\textbf{Z}$eroth-$\textbf{O}$rder $\textbf{R}$egularized $\textbf{O}$ptimization method, dubbed ZORO. When the underlying… ▽ More

    Submitted 30 November, 2021; v1 submitted 29 March, 2020; originally announced March 2020.

    Journal ref: SIAM Journal on Optimization 32, no. 2 (2022): 687-714

  12. arXiv:1905.13345  [pdf, other

    cs.LG stat.ML

    Power Weighted Shortest Paths for Clustering Euclidean Data

    Authors: Daniel Mckenzie, Steven Damelin

    Abstract: We study the use of power weighted shortest path distance functions for clustering high dimensional Euclidean data, under the assumption that the data is drawn from a collection of disjoint low dimensional manifolds. We argue, theoretically and experimentally, that this leads to higher clustering accuracy. We also present a fast algorithm for computing these distances.

    Submitted 3 September, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: 24 pages. Final version. To appear in Foundations of Data Science

    MSC Class: 05C85; 05C80 ACM Class: I.5.3

  13. arXiv:1808.05780  [pdf, other

    cs.IT cs.SI math.NA

    Compressive Sensing for cut improvement and local clustering

    Authors: Ming-Jun Lai, Daniel Mckenzie

    Abstract: We show how one can phrase the cut improvement problem for graphs as a sparse recovery problem, whence one can use algorithms originally developed for use in compressive sensing (such as SubspacePursuit or CoSaMP) to solve it. We show that this approach to cut improvement is fast, both in theory and practice and moreover enjoys statistical guarantees of success when applied to graphs drawn from pr… ▽ More

    Submitted 25 February, 2020; v1 submitted 17 August, 2018; originally announced August 2018.

    Comments: 25 pages. Generalizes and improves upon the earlier versions arxiv: 1808.05780 and arXiv:1708.09477. To appear in SIMODS

    MSC Class: 68Q25; 68R10; 68U05; 94A12

  14. arXiv:1708.09477  [pdf, other

    cs.IT cs.LG stat.ML

    A Compressive Sensing Approach to Community Detection with Applications

    Authors: Ming-Jun Lai, Daniel Mckenzie

    Abstract: The community detection problem for graphs asks one to partition the n vertices V of a graph G into k communities, or clusters, such that there are many intracluster edges and few intercluster edges. Of course this is equivalent to finding a permutation matrix P such that, if A denotes the adjacency matrix of G, then PAP^T is approximately block diagonal. As there are k^n possible partitions of n… ▽ More

    Submitted 20 August, 2018; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: 39 pages, 10 figures Version 2, disabled 'showkeys' package. Note that there is an error in the proof of Lemma 5.1. A correct version of this lemma, as well as a greatly improved version of the central algorithm of this paper, is available at: arXiv:1808.05780