Skip to main content

Showing 1–50 of 142 results for author: Wright, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17739  [pdf, other

    cs.AI cs.HC

    The Widening Gap: The Benefits and Harms of Generative AI for Novice Programmers

    Authors: James Prather, Brent Reeves, Juho Leinonen, Stephen MacNeil, Arisoa S. Randrianasolo, Brett Becker, Bailey Kimmel, Jared Wright, Ben Briggs

    Abstract: Novice programmers often struggle through programming problem solving due to a lack of metacognitive awareness and strategies. Previous research has shown that novices can encounter multiple metacognitive difficulties while programming. Novices are typically unaware of how these difficulties are hindering their progress. Meanwhile, many novices are now programming with generative AI (GenAI), which… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICER 2024

  2. arXiv:2404.15231  [pdf, other

    physics.optics cs.AI

    Direct Zernike Coefficient Prediction from Point Spread Functions and Extended Images using Deep Learning

    Authors: Yong En Kok, Alexander Bentley, Andrew Parkes, Amanda J. Wright, Michael G. Somekh, Michael Pound

    Abstract: Optical imaging quality can be severely degraded by system and sample induced aberrations. Existing adaptive optics systems typically rely on iterative search algorithm to correct for aberrations and improve images. This study demonstrates the application of convolutional neural networks to characterise the optical aberration by directly predicting the Zernike coefficients from two to three phase-… ▽ More

    Submitted 24 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures, 4 tables

  3. arXiv:2403.10547  [pdf, ps, other

    math.OC cs.AI cs.DS cs.LG

    Robust Second-Order Nonconvex Optimization and Its Application to Low Rank Matrix Sensing

    Authors: Shuyao Li, Yu Cheng, Ilias Diakonikolas, Jelena Diakonikolas, Rong Ge, Stephen J. Wright

    Abstract: Finding an approximate second-order stationary point (SOSP) is a well-studied and fundamental problem in stochastic nonconvex optimization with many applications in machine learning. However, this problem is poorly understood in the presence of outliers, limiting the use of existing nonconvex algorithms in adversarial settings. In this paper, we study the problem of finding SOSPs in the strong c… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  4. arXiv:2402.11173  [pdf, other

    cs.LG cs.CR math.OC

    How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization

    Authors: Andrew Lowy, Jonathan Ullman, Stephen J. Wright

    Abstract: We provide a simple and flexible framework for designing differentially private algorithms to find approximate stationary points of non-convex loss functions. Our framework is based on using a private approximate risk minimizer to "warm start" another private algorithm for finding stationary points. We use this framework to obtain improved, and sometimes optimal, rates for several classes of non-c… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2402.05071  [pdf, other

    math.OC cs.LG stat.ML

    Extending the Reach of First-Order Algorithms for Nonconvex Min-Max Problems with Cohypomonotonicity

    Authors: Ahmet Alacaoglu, Donghwan Kim, Stephen J. Wright

    Abstract: We focus on constrained, $L$-smooth, nonconvex-nonconcave min-max problems either satisfying $ρ$-cohypomonotonicity or admitting a solution to the $ρ$-weakly Minty Variational Inequality (MVI), where larger values of the parameter $ρ>0$ correspond to a greater degree of nonconvexity. These problem classes include examples in two player reinforcement learning, interaction dominant min-max problems,… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  6. arXiv:2312.09978  [pdf, other

    cs.LG nlin.AO

    Small jet engine reservoir computing digital twin

    Authors: C. J. Wright, N. Biederman, B. Gyovai, D. J. Gauthier, J. P. Wilhelm

    Abstract: Machine learning was applied to create a digital twin of a numerical simulation of a single-scroll jet engine. A similar model based on the insights gained from this numerical study was used to create a digital twin of a JetCat P100-RX jet engine using only experimental data. Engine data was collected from a custom sensor system measuring parameters such as thrust, exhaust gas temperature, shaft s… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  7. arXiv:2311.11046  [pdf

    q-bio.QM cs.LG q-bio.NC

    DenseNet and Support Vector Machine classifications of major depressive disorder using vertex-wise cortical features

    Authors: Vladimir Belov, Tracy Erwin-Grabner, Ling-Li Zeng, Christopher R. K. Ching, Andre Aleman, Alyssa R. Amod, Zeynep Basgoze, Francesco Benedetti, Bianca Besteher, Katharina Brosch, Robin Bülow, Romain Colle, Colm G. Connolly, Emmanuelle Corruble, Baptiste Couvy-Duchesne, Kathryn Cullen, Udo Dannlowski, Christopher G. Davey, Annemiek Dols, Jan Ernsting, Jennifer W. Evans, Lukas Fisch, Paola Fuentes-Claramonte, Ali Saffet Gonul, Ian H. Gotlib , et al. (63 additional authors not shown)

    Abstract: Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  8. arXiv:2311.00678  [pdf, other

    math.OC cs.LG stat.ML

    Complexity of Single Loop Algorithms for Nonlinear Programming with Stochastic Objective and Constraints

    Authors: Ahmet Alacaoglu, Stephen J. Wright

    Abstract: We analyze the complexity of single-loop quadratic penalty and augmented Lagrangian algorithms for solving nonconvex optimization problems with functional equality constraints. We consider three cases, in all of which the objective is stochastic and smooth, that is, an expectation over an unknown distribution that is accessed by sampling. The nature of the equality constraints differs among the th… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  9. arXiv:2311.00488  [pdf, other

    cs.LG cs.CL

    Comparing Optimization Targets for Contrast-Consistent Search

    Authors: Hugo Fry, Seamus Fallows, Ian Fan, Jamie Wright, Nandi Schoots

    Abstract: We investigate the optimization target of Contrast-Consistent Search (CCS), which aims to recover the internal representations of truth of a large language model. We present a new loss function that we call the Midpoint-Displacement (MD) loss function. We demonstrate that for a certain hyper-parameter value this MD loss function leads to a prober with very similar weights to CCS. We further show t… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Socially Responsible Language Modelling Research (SoLaR) NeurIPS 2023

  10. arXiv:2310.18841  [pdf, ps, other

    math.OC cs.LG

    A randomized algorithm for nonconvex minimization with inexact evaluations and complexity guarantees

    Authors: Shuyao Li, Stephen J. Wright

    Abstract: We consider minimization of a smooth nonconvex function with inexact oracle access to gradient and Hessian (without assuming access to the function value) to achieve approximate second-order optimality. A novel feature of our method is that if an approximate direction of negative curvature is chosen as the step, we choose its sense to be positive or negative with equal probability. We allow gradie… ▽ More

    Submitted 26 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

  11. arXiv:2310.11518  [pdf, other

    cs.GT cs.AI cs.LG

    Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability

    Authors: Revan MacQueen, James R. Wright

    Abstract: Self-play is a technique for machine learning in multi-agent systems where a learning algorithm learns by interacting with copies of itself. Self-play is useful for generating large quantities of data for learning, but has the drawback that the agents the learner will face post-training may have dramatically different behavior than the learner came to expect by interacting with itself. For the spe… ▽ More

    Submitted 29 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: To appear at NeurIPS 2023

  12. arXiv:2310.10039  [pdf, other

    cs.LG eess.SP

    TpopT: Efficient Trainable Template Optimization on Low-Dimensional Manifolds

    Authors: **gkai Yan, Shiyu Wang, Xinyu Rain Wei, Jimmy Wang, Zsuzsanna Márka, Szabolcs Márka, John Wright

    Abstract: In scientific and engineering scenarios, a recurring task is the detection of low-dimensional families of signals or patterns. A classic family of approaches, exemplified by template matching, aims to cover the search space with a dense template bank. While simple and highly interpretable, it suffers from poor computational efficiency due to unfavorable scaling in the signal space dimensionality.… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  13. arXiv:2310.08870  [pdf, other

    quant-ph cs.CC cs.CR

    A one-query lower bound for unitary synthesis and breaking quantum cryptography

    Authors: Alex Lombardi, Fermi Ma, John Wright

    Abstract: The Unitary Synthesis Problem (Aaronson-Kuperberg 2007) asks whether any $n$-qubit unitary $U$ can be implemented by an efficient quantum algorithm $A$ augmented with an oracle that computes an arbitrary Boolean function $f$. In other words, can the task of implementing any unitary be efficiently reduced to the task of implementing any Boolean function? In this work, we prove a one-query lower b… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  14. arXiv:2310.04006  [pdf, other

    math.OC cs.LG

    Accelerating optimization over the space of probability measures

    Authors: Shi Chen, Qin Li, Oliver Tse, Stephen J. Wright

    Abstract: The acceleration of gradient-based optimization methods is a subject of significant practical and theoretical importance, particularly within machine learning applications. While much attention has been directed towards optimizing within Euclidean space, the need to optimize over spaces of probability measures in machine learning motivates exploration of accelerated gradient methods in this contex… ▽ More

    Submitted 18 June, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  15. arXiv:2306.04778  [pdf, other

    cs.LG cs.GT

    How to Evaluate Behavioral Models

    Authors: Greg d'Eon, Sophie Greenwood, Kevin Leyton-Brown, James R. Wright

    Abstract: Researchers building behavioral models, such as behavioral game theorists, use experimental data to evaluate predictive models of human behavior. However, there is little agreement about which loss function should be used in evaluations, with error rate, negative log-likelihood, cross-entropy, Brier score, and squared L2 error all being common choices. We attempt to offer a principled answer to th… ▽ More

    Submitted 22 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 15 pages (7 pages body + references and appendix). To appear at AAAI 2024

  16. arXiv:2306.02192  [pdf, other

    cs.LG math.NA

    Correcting auto-differentiation in neural-ODE training

    Authors: Yewei Xu, Shi Chen, Qin Li, Stephen J. Wright

    Abstract: Does the use of auto-differentiation yield reasonable updates to deep neural networks that represent neural ODEs? Through mathematical analysis and numerical evidence, we find that when the neural network employs high-order forms to approximate the underlying ODE flows (such as the Linear Multistep Method (LMM)), brute-force computation using auto-differentiation often produces non-converging arti… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  17. arXiv:2302.04972  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Differentially Private Optimization for Smooth Nonconvex ERM

    Authors: Changyu Gao, Stephen J. Wright

    Abstract: We develop simple differentially private optimization algorithms that move along directions of (expected) descent to find an approximate second-order solution for nonconvex ERM. We use line search, mini-batching, and a two-phase strategy to improve the speed and practicality of the algorithm. Numerical experiments demonstrate the effectiveness of these approaches.

    Submitted 9 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  18. arXiv:2301.13146  [pdf, other

    math.NA cs.LG

    Enhancing Neural Network Differential Equation Solvers

    Authors: Matthew J. H. Wright

    Abstract: We motivate the use of neural networks for the construction of numerical solutions to differential equations. We prove that there exists a feed-forward neural network that can arbitrarily minimise an objective function that is zero at the solution of Poisson's equation, allowing us to guarantee that neural network solution estimates can get arbitrarily close to the exact solutions. We also show ho… ▽ More

    Submitted 28 December, 2022; originally announced January 2023.

    Comments: The source code for this project can be found at https://github.com/mjhwright/error-correction

  19. arXiv:2301.07831  [pdf, other

    math.NA cs.MS stat.CO

    Multi-output multilevel best linear unbiased estimators via semidefinite programming

    Authors: M. Croci, K. E. Willcox, S. J. Wright

    Abstract: Multifidelity forward uncertainty quantification (UQ) problems often involve multiple quantities of interest and heterogeneous models (e.g., different grids, equations, dimensions, physics, surrogate and reduced-order models). While computational efficiency is key in this context, multi-output strategies in multilevel/multifidelity methods are either sub-optimal or non-existent. In this paper we e… ▽ More

    Submitted 15 May, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: 22 pages, 5 figures, 3 tables

  20. arXiv:2301.00878  [pdf

    astro-ph.IM astro-ph.SR cs.DL physics.data-an physics.space-ph

    Science Platforms for Heliophysics Data Analysis

    Authors: Monica G. Bobra, Will T. Barnes, Thomas Y. Chen, Mark C. M. Cheung, Laura A. Hayes, Jack Ireland, Miho Janvier, Michael S. F. Kirk, James P. Mason, Stuart J. Mumford, Paul J. Wright

    Abstract: We recommend that NASA maintain and fund science platforms that enable interactive and scalable data analysis in order to maximize the scientific return of data collected from space-based instruments.

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: Heliophysics 2050 White Paper

  21. arXiv:2212.08805  [pdf, other

    cs.DC cs.AR cs.PF

    Understanding the Impact of Input Entropy on FPU, CPU, and GPU Power

    Authors: Sridutt Bhalachandra, Brian Austin, Samuel Williams, Nicholas J. Wright

    Abstract: Power is increasingly becoming a limiting resource in high-performance, GPU-accelerated computing systems. Understanding the range and sources of power variation is essential in setting realistic bounds on rack and system peak power, and develo** techniques that minimize energy. While variations arising during manufacturing and other factors like algorithm among others have been previously studi… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

    ACM Class: C.1.2; C.1.4; C.4

  22. arXiv:2212.05088  [pdf, other

    math.OC cs.LG

    Cyclic Block Coordinate Descent With Variance Reduction for Composite Nonconvex Optimization

    Authors: Xufeng Cai, Chaobing Song, Stephen J. Wright, Jelena Diakonikolas

    Abstract: Nonconvex optimization is central in solving many machine learning problems, in which block-wise structure is commonly encountered. In this work, we propose cyclic block coordinate methods for nonconvex optimization problems with non-asymptotic gradient norm guarantees. Our convergence analysis is based on a gradient Lipschitz condition with respect to a Mahalanobis norm, inspired by a recent prog… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  23. arXiv:2211.10495  [pdf, other

    cs.DC cs.NI

    A DPU Solution for Container Overlay Networks

    Authors: Anton Njavro, James Tau, Taylor Groves, Nicholas J. Wright, Richard West

    Abstract: There is an increasing demand to incorporate hybrid environments as part of workflows across edge, cloud, and HPC systems. In a such converging environment of cloud and HPC, containers are starting to play a more prominent role, bringing their networking infrastructure along with them. However, the current body of work shows that container overlay networks, which are often used to connect containe… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Pre-print version presented at SuperCompCloud workshop at SC22 conference

  24. arXiv:2208.06521  [pdf, other

    cs.GT

    Non-strategic Econometrics (for Initial Play)

    Authors: Daniel Chui, Jason Hartline, James R. Wright

    Abstract: Modelling agent preferences has applications in a range of fields including economics and increasingly, artificial intelligence. These preferences are not always known and thus may need to be estimated from observed behavior, in which case a model is required to map agent preferences to behavior, also known as structural estimation. Traditional models are based on the assumption that agents are pe… ▽ More

    Submitted 28 February, 2023; v1 submitted 12 August, 2022; originally announced August 2022.

  25. arXiv:2207.11583  [pdf, other

    astro-ph.IM cs.LG gr-qc

    Boosting the Efficiency of Parametric Detection with Hierarchical Neural Networks

    Authors: **gkai Yan, Robert Colgan, John Wright, Zsuzsa Márka, Imre Bartos, Szabolcs Márka

    Abstract: Gravitational wave astronomy is a vibrant field that leverages both classic and modern data processing techniques for the understanding of the universe. Various approaches have been proposed for improving the efficiency of the detection scheme, with hierarchical matched filtering being an important strategy. Meanwhile, deep learning methods have recently demonstrated both consistency with matched… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

  26. arXiv:2205.12031   

    cs.GT cs.AI

    Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections

    Authors: Dustin Morrill, Ryan D'Orazio, Marc Lanctot, James R. Wright, Michael Bowling, Amy R. Greenwald

    Abstract: Hindsight rationality is an approach to playing general-sum games that prescribes no-regret learning dynamics for individual agents with respect to a set of deviations, and further describes jointly rational behavior among multiple agents with mediated equilibria. To develop hindsight rational learning in sequential decision-making settings, we formalize behavioral deviations as a general class of… ▽ More

    Submitted 1 June, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: Please see version 4 of arXiv:2102.06973 (arXiv:2102.06973v4). This submission was a version of that paper with highlighted corrections. After submitting, I figured out that it would be better to submit this report as another version of arXiv:2102.06973

  27. arXiv:2203.10899  [pdf, other

    astro-ph.EP astro-ph.IM cs.CR physics.pop-ph

    The Case for Technosignatures: Why They May Be Abundant, Long-lived, Highly Detectable, and Unambiguous

    Authors: Jason T. Wright, Jacob Haqq-Misra, Adam Frank, Ravi Kopparapu, Manasvi Lingam, Sofia Z. Sheikh

    Abstract: The intuition suggested by the Drake equation implies that technology should be less prevalent than biology in the galaxy. However, it has been appreciated for decades in the SETI community that technosignatures could be more abundant, longer-lived, more detectable, and less ambiguous than biosignatures. We collect the arguments for and against technosignatures' ubiquity and discuss the implicatio… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Published in ApJ Letters

    Journal ref: 2022 ApJL 927 L30

  28. arXiv:2203.05086  [pdf, other

    astro-ph.IM cs.LG gr-qc

    Detecting and Diagnosing Terrestrial Gravitational-Wave Mimics Through Feature Learning

    Authors: Robert E. Colgan, Zsuzsa Márka, **gkai Yan, Imre Bartos, John N. Wright, Szabolcs Márka

    Abstract: As engineered systems grow in complexity, there is an increasing need for automatic methods that can detect, diagnose, and even correct transient anomalies that inevitably arise and can be difficult or impossible to diagnose and fix manually. Among the most sensitive and complex systems of our civilization are the detectors that search for incredibly small variations in distance caused by gravitat… ▽ More

    Submitted 5 July, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

  29. arXiv:2203.05006  [pdf, other

    cs.CV cs.LG math.OC

    Resource-Efficient Invariant Networks: Exponential Gains by Unrolled Optimization

    Authors: Sam Buchanan, **gkai Yan, Ellie Haber, John Wright

    Abstract: Achieving invariance to nuisance transformations is a fundamental challenge in the construction of robust and reliable vision systems. Existing approaches to invariance scale exponentially with the dimension of the family of transformations, making them unable to cope with natural variabilities in visual data such as changes in pose and perspective. We identify a common limitation of these approac… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  30. Architectural Optimization and Feature Learning for High-Dimensional Time Series Datasets

    Authors: Robert E. Colgan, **gkai Yan, Zsuzsa Márka, Imre Bartos, Szabolcs Márka, John N. Wright

    Abstract: As our ability to sense increases, we are experiencing a transition from data-poor problems, in which the central issue is a lack of relevant data, to data-rich problems, in which the central issue is to identify a few relevant features in a sea of observations. Motivated by applications in gravitational-wave astrophysics, we study the problem of predicting the presence of transient noise artifact… ▽ More

    Submitted 5 July, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

  31. arXiv:2202.12280  [pdf

    cs.HC

    Tactile Materials in Practice: Understanding the Experiences of Teachers of the Visually Impaired

    Authors: Mahika Phutane, Julie Wright, Brenda Veronica Castro, Lei Shi, Simone R. Stern, Holly M. Lawson, Shiri Azenkot

    Abstract: Teachers of the visually impaired (TVIs) regularly present tactile materials (tactile graphics, 3D models, and real objects) to students with vision impairments. Researchers have been increasingly interested in designing tools to support the use of tactile materials, but we still lack an in-depth understanding of how tactile materials are created and used in practice today. To address this gap, we… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 35 pages, 6 figures, 3 tables, to be published in TACCESS

  32. arXiv:2201.07684  [pdf, other

    math.OC cs.LG stat.ML

    On the Complexity of a Practical Primal-Dual Coordinate Method

    Authors: Ahmet Alacaoglu, Volkan Cevher, Stephen J. Wright

    Abstract: We prove complexity bounds for the primal-dual algorithm with random extrapolation and coordinate descent (PURE-CD), which has been shown to obtain good practical performance for solving convex-concave min-max problems with bilinear coupling. Our complexity bounds either match or improve the best-known results in the literature for both dense and sparse (strongly)-convex-(strongly)-concave problem… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  33. arXiv:2201.01824  [pdf, ps, other

    quant-ph cs.CC cs.DS

    Testing matrix product states

    Authors: Mehdi Soleimanifar, John Wright

    Abstract: Devising schemes for testing the amount of entanglement in quantum systems has played a crucial role in quantum computing and information theory. Here, we study the problem of testing whether an unknown state $|ψ\rangle$ is a matrix product state (MPS) in the property testing model. MPS are a class of physically-relevant quantum states which arise in the study of quantum many-body systems. A quant… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 30 pages, 2 figures

  34. arXiv:2111.15101  [pdf, other

    cs.LG physics.comp-ph physics.med-ph

    A novel data-driven algorithm to predict anomalous prescription based on patient's feature set

    Authors: Qiongge Li, Jean Wright, Russell Hales, Ranh Voong, Todd McNutt

    Abstract: Appropriate dosing of radiation is crucial to patient safety in radiotherapy. Current quality assurance depends heavily on a peer-review process, where the physicians' peer review on each patient's treatment plan, including dose and fractionation. However, such a process is manual and laborious. Physicians may not identify errors due to time constraints and caseload. We designed a novel prescripti… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  35. arXiv:2111.08131  [pdf, other

    quant-ph cs.CC math.OA

    Quantum soundness of testing tensor codes

    Authors: Zhengfeng Ji, Anand Natarajan, Thomas Vidick, John Wright, Henry Yuen

    Abstract: A locally testable code is an error-correcting code that admits very efficient probabilistic tests of membership. Tensor codes provide a simple family of combinatorial constructions of locally testable codes that generalize the family of Reed-Muller codes. The natural test for tensor codes, the axis-parallel line vs. point test, plays an essential role in constructions of probabilistically checkab… ▽ More

    Submitted 6 December, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: v3: published version

    Journal ref: Discrete Analysis, 2022:17

  36. arXiv:2111.08066  [pdf, other

    cs.LG

    Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning

    Authors: Vincent Liu, James R. Wright, Martha White

    Abstract: Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for general MDPs. These results motivate the need to look at specific classes of MDPs where offline reinforcement learning might be feasible. In this work, we explore a restricted class of MDPs to obtain guarantees for offline reinforcement learning. The key property, which we call Action Impact Regular… ▽ More

    Submitted 3 May, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

    Journal ref: Journal of Artificial Intelligence Research, 77 (2023) 71-101

  37. arXiv:2111.01842  [pdf, other

    math.OC cs.LG

    Coordinate Linear Variance Reduction for Generalized Linear Programming

    Authors: Chaobing Song, Cheuk Yin Lin, Stephen J. Wright, Jelena Diakonikolas

    Abstract: We study a class of generalized linear programs (GLP) in a large-scale setting, which includes simple, possibly nonsmooth convex regularizer and simple convex set constraints. By reformulating (GLP) as an equivalent convex-concave min-max problem, we show that the linear structure in the problem can be used to design an efficient, scalable first-order algorithm, to which we give the name \emph{Coo… ▽ More

    Submitted 6 April, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 39 pages, NeurIPS 2022

  38. arXiv:2111.01254  [pdf, ps, other

    quant-ph cs.CC

    Unique Games hardness of Quantum Max-Cut, and a conjectured vector-valued Borell's inequality

    Authors: Yeongwoo Hwang, Joe Neeman, Ojas Parekh, Kevin Thompson, John Wright

    Abstract: The Gaussian noise stability of a function $f:\mathbb{R}^n \to \{-1, 1\}$ is the expected value of $f(\boldsymbol{x}) \cdot f(\boldsymbol{y})$ over $ρ$-correlated Gaussian random variables $\boldsymbol{x}$ and $\boldsymbol{y}$. Borell's inequality states that for $-1 \leq ρ\leq 0$, this is minimized by the halfspace $f(x) = \mathrm{sign}(x_1)$. In this work, we generalize this result to hold for f… ▽ More

    Submitted 28 September, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 76 pages; v3 treats the vector-valued Borell's inequality as a conjecture rather than a theorem, due to an error in previous versions

  39. arXiv:2110.13562  [pdf, other

    cs.CR cs.CY

    Measuring the Effectiveness of Digital Hygiene using Historical DNS Data

    Authors: Oliver Farnan, Gregory Walton, Joss Wright

    Abstract: This paper describes an ongoing experiment evaluating the efficacy of a digital safety intervention in six high-risk, low capacity Civil Society Organisations (CSOs) in Central Asia. The evaluation takes the form of statistical analysis of DNS traffic in each organisation, obtained via security tools installed by researchers. The hypothesis is that the digital safety intervention strengthens the… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  40. arXiv:2110.01754  [pdf, other

    cs.CV cs.IR

    An Integrated System for Mobile Image-Based Dietary Assessment

    Authors: Zeman Shao, Yue Han, Jiangpeng He, Runyu Mao, Janine Wright, Deborah Kerr, Carol Boushey, Fengqing Zhu

    Abstract: Accurate assessment of dietary intake requires improved tools to overcome limitations of current methods including user burden and measurement error. Emerging technologies such as image-based approaches using advanced machine learning techniques coupled with widely available mobile devices present new opportunities to improve the accuracy of dietary assessment that is cost-effective, convenient an… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  41. arXiv:2107.14324  [pdf, other

    stat.ML cs.LG math.OC

    Deep Networks Provably Classify Data on Curves

    Authors: Tingran Wang, Sam Buchanan, Dar Gilboa, John Wright

    Abstract: Data with low-dimensional nonlinear structure are ubiquitous in engineering and scientific problems. We study a model problem with such structure -- a binary classification task that uses a deep fully-connected neural network to classify data drawn from two disjoint smooth curves on the unit sphere. Aside from mild regularity conditions, we place no restrictions on the configuration of the curves.… ▽ More

    Submitted 28 October, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  42. arXiv:2107.02887  [pdf

    cs.DL astro-ph.IM

    Furthering a Comprehensive SETI Bibliography

    Authors: Julia LaFond, Jason T. Wright, Macy J. Huston

    Abstract: In 2019, Reyes & Wright used the NASA Astrophysics Data System (ADS) to initiate a comprehensive bibliography for SETI accessible to the public. Since then, updates to the library have been incomplete, partly due to the difficulty in managing the large number of false positive publications generated by searching ADS using simple search terms. In preparation for a recent update, the scope of the li… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 7 pages, 3 figures, accepted to JBIS

    Journal ref: JBIS 74 (2021) 252-255

  43. arXiv:2107.00758  [pdf, other

    cs.LG stat.ML

    The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models

    Authors: Greg d'Eon, Jason d'Eon, James R. Wright, Kevin Leyton-Brown

    Abstract: Supervised learning models often make systematic errors on rare subsets of the data. When these subsets correspond to explicit labels in the data (e.g., gender, race) such poor performance can be identified straightforwardly. This paper introduces a method for discovering systematic errors that do not correspond to such explicitly labelled subgroups. The key idea is that similar inputs tend to hav… ▽ More

    Submitted 15 October, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  44. arXiv:2106.09847  [pdf, ps, other

    cs.GT cs.AI econ.TH

    Disinformation, Stochastic Harm, and Costly Effort: A Principal-Agent Analysis of Regulating Social Media Platforms

    Authors: Shehroze Khan, James R. Wright

    Abstract: The spread of disinformation on social platforms is harmful to society. This harm may manifest as a gradual degradation of public discourse; but it can also take the form of sudden dramatic events such as the 2021 insurrection on Capitol Hill. The platforms themselves are in the best position to prevent the spread of disinformation, as they have the best access to relevant data and the expertise t… ▽ More

    Submitted 27 June, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

  45. arXiv:2106.09211  [pdf, other

    cs.LG eess.SP math.OC

    Square Root Principal Component Pursuit: Tuning-Free Noisy Robust Matrix Recovery

    Authors: Junhui Zhang, **gkai Yan, John Wright

    Abstract: We propose a new framework -- Square Root Principal Component Pursuit -- for low-rank matrix recovery from observations corrupted with noise and outliers. Inspired by the square root Lasso, this new formulation does not require prior knowledge of the noise level. We show that a single, universal choice of the regularization parameter suffices to achieve reconstruction error proportional to the (a… ▽ More

    Submitted 28 October, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

  46. arXiv:2105.10446  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction

    Authors: Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

    Abstract: This work attempts to provide a plausible theoretical framework that aims to interpret modern deep (convolutional) networks from the principles of data compression and discriminative representation. We argue that for high-dimensional multi-class data, the optimal linear discriminative representation maximizes the coding rate difference between the whole dataset and the average of all the subsets.… ▽ More

    Submitted 28 November, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: This paper integrates previous two manuscripts: arXiv:2006.08558 and arXiv:2010.14765, with significantly improved organization, presentation, and new results; V2 polishes writing and adds citation; V3 polishes writing, adds citation and experiments

  47. arXiv:2104.11079  [pdf, other

    cs.AI cs.CE

    Randomized Algorithms for Scientific Computing (RASC)

    Authors: Aydin Buluc, Tamara G. Kolda, Stefan M. Wild, Mihai Anitescu, Anthony DeGennaro, John Jakeman, Chandrika Kamath, Ramakrishnan Kannan, Miles E. Lopes, Per-Gunnar Martinsson, Kary Myers, Jelani Nelson, Juan M. Restrepo, C. Seshadhri, Draguna Vrabie, Brendt Wohlberg, Stephen J. Wright, Chao Yang, Peter Zwart

    Abstract: Randomized algorithms have propelled advances in artificial intelligence and represent a foundational research area in advancing AI for Science. Future advancements in DOE Office of Science priority areas such as climate science, astrophysics, fusion, advanced materials, combustion, and quantum computing all require randomized algorithms for surmounting challenges of complexity, robustness, and sc… ▽ More

    Submitted 21 March, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

  48. arXiv:2104.03961  [pdf, other

    astro-ph.IM astro-ph.HE cs.LG gr-qc

    Generalized Approach to Matched Filtering using Neural Networks

    Authors: **gkai Yan, Mariam Avagyan, Robert E. Colgan, Doğa Veske, Imre Bartos, John Wright, Zsuzsa Márka, Szabolcs Márka

    Abstract: Gravitational wave science is a pioneering field with rapidly evolving data analysis methodology currently assimilating and inventing deep learning techniques. The bulk of the sophisticated flagship searches of the field rely on the time-tested matched filtering principle within their core. In this paper, we make a key observation on the relationship between the emerging deep learning and the trad… ▽ More

    Submitted 2 February, 2022; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: 18 pages, 13 figures

  49. arXiv:2103.07562  [pdf, other

    cs.CV

    Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation

    Authors: Zeman Shao, Shaobo Fang, Runyu Mao, Jiangpeng He, Janine Wright, Deborah Kerr, Carol Jo Boushey, Fengqing Zhu

    Abstract: We aim to estimate food portion size, a property that is strongly related to the presence of food object in 3D space, from single monocular images under real life setting. Specifically, we are interested in end-to-end estimation of food portion size, which has great potential in the field of personal health management. Unlike image segmentation or object recognition where annotation can be obtaine… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  50. arXiv:2102.13643  [pdf, other

    math.OC cs.LG math.NA

    Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

    Authors: Chaobing Song, Stephen J. Wright, Jelena Diakonikolas

    Abstract: We study structured nonsmooth convex finite-sum optimization that appears widely in machine learning applications, including support vector machines and least absolute deviation. For the primal-dual formulation of this problem, we propose a novel algorithm called \emph{Variance Reduction via Primal-Dual Accelerated Dual Averaging (\vrpda)}. In the nonsmooth and general convex setting, \vrpda~has t… ▽ More

    Submitted 7 April, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: 33 pages, 18 figures