Skip to main content

Showing 1–41 of 41 results for author: Wright, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.05071  [pdf, other

    math.OC cs.LG stat.ML

    Extending the Reach of First-Order Algorithms for Nonconvex Min-Max Problems with Cohypomonotonicity

    Authors: Ahmet Alacaoglu, Donghwan Kim, Stephen J. Wright

    Abstract: We focus on constrained, $L$-smooth, nonconvex-nonconcave min-max problems either satisfying $ρ$-cohypomonotonicity or admitting a solution to the $ρ$-weakly Minty Variational Inequality (MVI), where larger values of the parameter $ρ>0$ correspond to a greater degree of nonconvexity. These problem classes include examples in two player reinforcement learning, interaction dominant min-max problems,… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  2. arXiv:2311.00678  [pdf, other

    math.OC cs.LG stat.ML

    Complexity of Single Loop Algorithms for Nonlinear Programming with Stochastic Objective and Constraints

    Authors: Ahmet Alacaoglu, Stephen J. Wright

    Abstract: We analyze the complexity of single-loop quadratic penalty and augmented Lagrangian algorithms for solving nonconvex optimization problems with functional equality constraints. We consider three cases, in all of which the objective is stochastic and smooth, that is, an expectation over an unknown distribution that is accessed by sampling. The nature of the equality constraints differs among the th… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  3. arXiv:2302.04972  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Differentially Private Optimization for Smooth Nonconvex ERM

    Authors: Changyu Gao, Stephen J. Wright

    Abstract: We develop simple differentially private optimization algorithms that move along directions of (expected) descent to find an approximate second-order solution for nonconvex ERM. We use line search, mini-batching, and a two-phase strategy to improve the speed and practicality of the algorithm. Numerical experiments demonstrate the effectiveness of these approaches.

    Submitted 9 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  4. arXiv:2301.07831  [pdf, other

    math.NA cs.MS stat.CO

    Multi-output multilevel best linear unbiased estimators via semidefinite programming

    Authors: M. Croci, K. E. Willcox, S. J. Wright

    Abstract: Multifidelity forward uncertainty quantification (UQ) problems often involve multiple quantities of interest and heterogeneous models (e.g., different grids, equations, dimensions, physics, surrogate and reduced-order models). While computational efficiency is key in this context, multi-output strategies in multilevel/multifidelity methods are either sub-optimal or non-existent. In this paper we e… ▽ More

    Submitted 15 May, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: 22 pages, 5 figures, 3 tables

  5. arXiv:2201.07684  [pdf, other

    math.OC cs.LG stat.ML

    On the Complexity of a Practical Primal-Dual Coordinate Method

    Authors: Ahmet Alacaoglu, Volkan Cevher, Stephen J. Wright

    Abstract: We prove complexity bounds for the primal-dual algorithm with random extrapolation and coordinate descent (PURE-CD), which has been shown to obtain good practical performance for solving convex-concave min-max problems with bilinear coupling. Our complexity bounds either match or improve the best-known results in the literature for both dense and sparse (strongly)-convex-(strongly)-concave problem… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  6. arXiv:2111.00104  [pdf, other

    stat.AP eess.SP stat.ME stat.OT

    Principal Component Pursuit for Pattern Identification in Environmental Mixtures

    Authors: Elizabeth A. Gibson, Junhui Zhang, **gkai Yan, Lawrence Chillrud, Jaime Benavides, Yanelli Nunez, Julie B. Herbstman, Jeff Goldsmith, John Wright, Marianthi-Anna Kioumourtzoglou

    Abstract: Environmental health researchers often aim to identify sources/behaviors that give rise to potentially harmful exposures. We adapted principal component pursuit (PCP)-a robust technique for dimensionality reduction in computer vision and signal processing-to identify patterns in environmental mixtures. PCP decomposes the exposure mixture into a low-rank matrix containing consistent exposure patter… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

    Comments: 32 pages, 11 figures, 4 tables

  7. arXiv:2107.14324  [pdf, other

    stat.ML cs.LG math.OC

    Deep Networks Provably Classify Data on Curves

    Authors: Tingran Wang, Sam Buchanan, Dar Gilboa, John Wright

    Abstract: Data with low-dimensional nonlinear structure are ubiquitous in engineering and scientific problems. We study a model problem with such structure -- a binary classification task that uses a deep fully-connected neural network to classify data drawn from two disjoint smooth curves on the unit sphere. Aside from mild regularity conditions, we place no restrictions on the configuration of the curves.… ▽ More

    Submitted 28 October, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021

  8. arXiv:2107.00758  [pdf, other

    cs.LG stat.ML

    The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models

    Authors: Greg d'Eon, Jason d'Eon, James R. Wright, Kevin Leyton-Brown

    Abstract: Supervised learning models often make systematic errors on rare subsets of the data. When these subsets correspond to explicit labels in the data (e.g., gender, race) such poor performance can be identified straightforwardly. This paper introduces a method for discovering systematic errors that do not correspond to such explicitly labelled subgroups. The key idea is that similar inputs tend to hav… ▽ More

    Submitted 15 October, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  9. arXiv:2105.10446  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction

    Authors: Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

    Abstract: This work attempts to provide a plausible theoretical framework that aims to interpret modern deep (convolutional) networks from the principles of data compression and discriminative representation. We argue that for high-dimensional multi-class data, the optimal linear discriminative representation maximizes the coding rate difference between the whole dataset and the average of all the subsets.… ▽ More

    Submitted 28 November, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: This paper integrates previous two manuscripts: arXiv:2006.08558 and arXiv:2010.14765, with significantly improved organization, presentation, and new results; V2 polishes writing and adds citation; V3 polishes writing, adds citation and experiments

  10. arXiv:2010.14765  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    Deep Networks from the Principle of Rate Reduction

    Authors: Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

    Abstract: This work attempts to interpret modern deep (convolutional) networks from the principles of rate reduction and (shift) invariant classification. We show that the basic iterative gradient ascent scheme for optimizing the rate reduction of learned features naturally leads to a multi-layer deep network, one iteration per layer. The layered architectures, linear and nonlinear operators, and even param… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  11. arXiv:2010.11366  [pdf, ps, other

    stat.ML cs.LG

    Random Coordinate Underdamped Langevin Monte Carlo

    Authors: Zhiyan Ding, Qin Li, Jianfeng Lu, Stephen J. Wright

    Abstract: The Underdamped Langevin Monte Carlo (ULMC) is a popular Markov chain Monte Carlo sampling method. It requires the computation of the full gradient of the log-density at each iteration, an expensive operation if the dimension of the problem is high. We propose a sampling method called Random Coordinate ULMC (RC-ULMC), which selects a single coordinate at each iteration to be updated and leaves the… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  12. arXiv:2010.01405  [pdf, ps, other

    stat.ML cs.LG

    Random Coordinate Langevin Monte Carlo

    Authors: Zhiyan Ding, Qin Li, Jianfeng Lu, Stephen J. Wright

    Abstract: Langevin Monte Carlo (LMC) is a popular Markov chain Monte Carlo sampling method. One drawback is that it requires the computation of the full gradient at each iteration, an expensive operation if the dimension of the problem is high. We propose a new sampling method: Random Coordinate LMC (RC-LMC). At each iteration, a single coordinate is randomly selected to be updated by a multiple of the part… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

  13. arXiv:2008.11245  [pdf, other

    stat.ML cs.LG math.OC

    Deep Networks and the Multiple Manifold Problem

    Authors: Sam Buchanan, Dar Gilboa, John Wright

    Abstract: We study the multiple manifold problem, a binary classification task modeled on applications in machine vision, in which a deep fully-connected neural network is trained to separate two low-dimensional submanifolds of the unit sphere. We provide an analysis of the one-dimensional case, proving for a simple manifold configuration that when the network depth $L$ is large relative to certain geometri… ▽ More

    Submitted 6 May, 2021; v1 submitted 25 August, 2020; originally announced August 2020.

    Comments: ICLR 2021

  14. arXiv:2007.06753  [pdf, other

    cs.LG cs.CV cs.IT math.OC stat.ML

    From Symmetry to Geometry: Tractable Nonconvex Problems

    Authors: Yuqian Zhang, Qing Qu, John Wright

    Abstract: As science and engineering have become increasingly data-driven, the role of optimization has expanded to touch almost every stage of the data analysis pipeline, from signal and data acquisition to modeling and prediction. The optimization problems encountered in practice are often nonconvex. While challenges vary from problem to problem, one common source of nonconvexity is nonlinearity in the da… ▽ More

    Submitted 8 July, 2022; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: review paper, 38 pages, 10 figures, revision: correction of typos, adding more discussion on recent advances on deep learning

  15. arXiv:2005.13815  [pdf, ps, other

    cs.LG math.OC stat.ML

    Adversarial Classification via Distributional Robustness with Wasserstein Ambiguity

    Authors: Nam Ho-Nguyen, Stephen J. Wright

    Abstract: We study a model for adversarial classification based on distributionally robust chance constraints. We show that under Wasserstein ambiguity, the model aims to minimize the conditional value-at-risk of the distance to misclassification, and we explore links to adversarial classification models proposed earlier and to maximum-margin classifiers. We also provide a reformulation of the distributiona… ▽ More

    Submitted 3 November, 2021; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 32 pages

  16. arXiv:2001.06970  [pdf, other

    cs.LG cs.IT eess.IV math.OC stat.ML

    Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications

    Authors: Qing Qu, Zhihui Zhu, Xiao Li, Manolis C. Tsakiris, John Wright, René Vidal

    Abstract: The problem of finding the sparsest vector (direction) in a low dimensional subspace can be considered as a homogeneous variant of the sparse recovery problem, which finds applications in robust subspace recovery, dictionary learning, sparse blind deconvolution, and many other problems in signal processing and machine learning. However, in contrast to the classical sparse recovery problem, the mos… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: QQ and ZZ contributed equally to the work. Invited review paper for IEEE Signal Processing Magazine Special Issue on non-convex optimization for signal processing and machine learning. This article contains 26 pages with 11 figures

  17. arXiv:1912.08756  [pdf, other

    cs.LG cs.IR stat.ML

    Interleaved Composite Quantization for High-Dimensional Similarity Search

    Authors: Soroosh Khoram, Stephen J Wright, **g Li

    Abstract: Similarity search retrieves the nearest neighbors of a query vector from a dataset of high-dimensional vectors. As the size of the dataset grows, the cost of performing the distance computations needed to implement a query can become prohibitive. A method often used to reduce this computational cost is quantization of the vector space and location-based encoding of the dataset vectors. These encod… ▽ More

    Submitted 18 December, 2019; v1 submitted 18 December, 2019; originally announced December 2019.

  18. arXiv:1912.06508  [pdf, other

    cs.LG math.OC stat.ML

    A Distributed Quasi-Newton Algorithm for Primal and Dual Regularized Empirical Risk Minimization

    Authors: Ching-pei Lee, Cong Han Lim, Stephen J. Wright

    Abstract: We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving empirical risk minimization (ERM) problems with a nonsmooth regularization term. Our algorithm is applicable to both the primal and the dual ERM problem. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: arXiv admin note: text overlap with arXiv:1803.01370

  19. arXiv:1911.01486  [pdf, other

    cs.LG astro-ph.SR eess.IV stat.ML

    Probabilistic Super-Resolution of Solar Magnetograms: Generating Many Explanations and Measuring Uncertainties

    Authors: Xavier Gitiaux, Shane A. Maloney, Anna Jungbluth, Carl Shneider, Paul J. Wright, Atılım Güneş Baydin, Michel Deudon, Yarin Gal, Alfredo Kalaitzis, Andrés Muñoz-Jaramillo

    Abstract: Machine learning techniques have been successfully applied to super-resolution tasks on natural images where visually pleasing results are sufficient. However in many scientific domains this is not adequate and estimations of errors and uncertainties are crucial. To address this issue we propose a Bayesian framework that decomposes uncertainties into epistemic and aleatoric uncertainties. We test… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

  20. arXiv:1908.10959  [pdf, other

    eess.SP cs.LG eess.IV math.OC stat.ML

    Short-and-Sparse Deconvolution -- A Geometric Approach

    Authors: Yenson Lau, Qing Qu, Han-Wen Kuo, Pengcheng Zhou, Yuqian Zhang, John Wright

    Abstract: Short-and-sparse deconvolution (SaSD) is the problem of extracting localized, recurring motifs in signals with spatial or temporal structure. Variants of this problem arise in applications such as image deblurring, microscopy, neural spike sorting, and more. The problem is challenging in both theory and practice, as natural optimization formulations are nonconvex. Moreover, practical deconvolution… ▽ More

    Submitted 1 October, 2019; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: *YL and QQ contributed equally to this work; 30 figures, 45 pages; This version: added an experiment comparing with other methods, corrected typos and added references

  21. arXiv:1906.02435  [pdf, other

    cs.LG eess.SP stat.CO stat.ML

    Complete Dictionary Learning via $\ell^4$-Norm Maximization over the Orthogonal Group

    Authors: Yuexiang Zhai, Zitong Yang, Zhenyu Liao, John Wright, Yi Ma

    Abstract: This paper considers the fundamental problem of learning a complete (orthogonal) dictionary from samples of sparsely generated signals. Most existing methods solve the dictionary (and sparse representations) based on heuristic algorithms, usually without theoretical guarantees for either optimality or complexity. The recent $\ell^1$-minimization based methods do provide such guarantees but the ass… ▽ More

    Submitted 6 April, 2021; v1 submitted 6 June, 2019; originally announced June 2019.

  22. arXiv:1904.12417  [pdf, other

    stat.AP stat.ME

    Kernel Machine and Distributed Lag Models for Assessing Windows of Susceptibility to Environmental Mixtures in Children's Health Studies

    Authors: Ander Wilson, Hsiao-Hsien Leon Hsu, Yueh-Hsiu Mathilda Chiu, Robert O. Wright, Rosalind J. Wright, Brent A. Coull

    Abstract: Exposures to environmental chemicals during gestation can alter health status later in life. Most studies of maternal exposure to chemicals during pregnancy have focused on a single chemical exposure observed at high temporal resolution. Recent research has turned to focus on exposure to mixtures of multiple chemicals, generally observed at a single time point. We consider statistical methods for… ▽ More

    Submitted 21 September, 2021; v1 submitted 28 April, 2019; originally announced April 2019.

    Journal ref: Ann. Appl. Stat. 16(2): 1090-1110 (June 2022)

  23. arXiv:1806.00338  [pdf, other

    eess.SP cs.IT math.OC stat.ML

    Structured Local Optima in Sparse Blind Deconvolution

    Authors: Yuqian Zhang, Han-Wen Kuo, John Wright

    Abstract: Blind deconvolution is a ubiquitous problem of recovering two unknown signals from their convolution. Unfortunately, this is an ill-posed problem in general. This paper focuses on the {\em short and sparse} blind deconvolution problem, where the one unknown signal is short and the other one is sparsely and randomly supported. This variant captures the structure of the unknown signals in several im… ▽ More

    Submitted 21 July, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: 63 pages, 7 figures

  24. arXiv:1803.01370  [pdf, other

    math.OC cs.LG stat.ML

    A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

    Authors: Ching-pei Lee, Cong Han Lim, Stephen J. Wright

    Abstract: We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving ERM problems with a nonsmooth regularization term. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting or work only for specific regularizers. Our algorithm uses successive quadratic approximations, and we… ▽ More

    Submitted 26 May, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

    Comments: In the proceedings of The 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

  25. arXiv:1801.08019  [pdf, other

    cs.LG stat.ML

    Training Set Debugging Using Trusted Items

    Authors: Xuezhou Zhang, Xiao** Zhu, Stephen J. Wright

    Abstract: Training set bugs are flaws in the data that adversely affect machine learning. The training set is usually too large for man- ual inspection, but one may have the resources to verify a few trusted items. The set of trusted items may not by itself be adequate for learning, so we propose an algorithm that uses these items to identify bugs in the training set and thus im- proves learning. Specifical… ▽ More

    Submitted 24 January, 2018; originally announced January 2018.

    Comments: AAAI 2018

  26. arXiv:1712.00716  [pdf, other

    stat.CO cs.IT math.NA math.OC stat.ML

    Convolutional Phase Retrieval via Gradient Descent

    Authors: Qing Qu, Yuqian Zhang, Yonina C. Eldar, John Wright

    Abstract: We study the convolutional phase retrieval problem, of recovering an unknown signal $\mathbf x \in \mathbb C^n $ from $m$ measurements consisting of the magnitude of its cyclic convolution with a given kernel $\mathbf a \in \mathbb C^m $. This model is motivated by applications such as channel estimation, optics, and underwater acoustic communication, where the signal of interest is acted on by a… ▽ More

    Submitted 5 October, 2019; v1 submitted 3 December, 2017; originally announced December 2017.

    Comments: 64 pages , 9 figures, appeared in NeurIPS 2017. Accepted at IEEE Transactions on Information Theory. This is the final (minor) update: fixed typos and grammar issues

  27. Bayesian Distributed Lag Interaction Models to Identify Perinatal Windows of Vulnerability in Children's Health

    Authors: Ander Wilson, Yueh-Hsiu Mathilda Chiu, Hsiao-Hsien Leon Hsu, Robert O. Wright, Rosalind J. Wright, Brent A. Coull

    Abstract: Epidemiological research supports an association between maternal exposure to air pollution during pregnancy and adverse children's health outcomes. Advances in exposure assessment and statistics allow for estimation of both critical windows of vulnerability and exposure effect heterogeneity. Simultaneous estimation of windows of vulnerability and effect heterogeneity can be accomplished by fittin… ▽ More

    Submitted 17 December, 2016; originally announced December 2016.

    Journal ref: Biostatistics 2007

  28. arXiv:1602.06664  [pdf, other

    cs.IT math.OC stat.ML

    A Geometric Analysis of Phase Retrieval

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: Can we recover a complex signal from its Fourier magnitudes? More generally, given a set of $m$ measurements, $y_k = |\mathbf a_k^* \mathbf x|$ for $k = 1, \dots, m$, is it possible to recover $\mathbf x \in \mathbb{C}^n$ (i.e., length-$n$ complex vector)? This **generalized phase retrieval** (GPR) problem is a fundamental task in various disciplines, and has been the subject of much recent invest… ▽ More

    Submitted 1 January, 2017; v1 submitted 22 February, 2016; originally announced February 2016.

    Comments: 61 pages, 5 figures. A short version can be found here http://sunju.org/docs/PR_G4_16.pdf . Revised according to reviewers' feedback

    Journal ref: Foundations of Computational Mathematics, 18(5):1131--1198, 2018

  29. arXiv:1511.04777  [pdf, other

    cs.IT cs.CV math.OC stat.ML

    Complete Dictionary Recovery over the Sphere II: Recovery by Riemannian Trust-region Method

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb{R}^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals and fin… ▽ More

    Submitted 1 September, 2016; v1 submitted 15 November, 2015; originally announced November 2015.

    Comments: The second of two papers based on the report arXiv:1504.06785. Accepted by IEEE Transaction on Information Theory; revised according to the reviewers' comments

    Journal ref: IEEE Trans. Information Theory, 63(2): 885 - 914 (2017)

  30. arXiv:1511.03607  [pdf, other

    cs.IT cs.CV math.OC stat.ML

    Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb{R}^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals and fin… ▽ More

    Submitted 1 September, 2016; v1 submitted 11 November, 2015; originally announced November 2015.

    Comments: Accepted by IEEE Transaction on Information Theory; revised according to the reviewers' comments

    Journal ref: IEEE Trans. Information Theory, 63(2): 853 - 884 (2017)

  31. arXiv:1510.06096  [pdf, other

    math.OC cs.IT stat.ML

    When Are Nonconvex Problems Not Scary?

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: In this note, we focus on smooth nonconvex optimization problems that obey: (1) all local minimizers are also global; and (2) around any saddle point or local maximizer, the objective has a negative directional curvature. Concrete applications such as dictionary learning, generalized phase retrieval, and orthogonal tensor decomposition are known to induce such structures. We describe a second-orde… ▽ More

    Submitted 22 April, 2016; v1 submitted 20 October, 2015; originally announced October 2015.

    Comments: 6 pages, 3 figures. New examples on phase synchronization and community detection added; emphasis on all local minimizers being global added; exposition is polished. This is a concise expository article that avoids much technical rigor. We will make a separate submission with full technical details in future

  32. arXiv:1504.06785  [pdf, other

    cs.IT cs.CV cs.LG math.OC stat.ML

    Complete Dictionary Recovery over the Sphere

    Authors: Ju Sun, Qing Qu, John Wright

    Abstract: We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb R^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to the theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals, and… ▽ More

    Submitted 17 November, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: 104 pages, 5 figures. Due to length constraint of publication, this long paper are subsequently divided into two papers (arXiv:1511.03607 and arXiv:1511.04777). Further updates will be made only to the two papers

    MSC Class: 68P30; 58C05; 94A12; 94A08; 68T05; 90C26; 90C48; 90C55

  33. arXiv:1412.4659  [pdf, other

    cs.IT cs.CV cs.LG math.OC stat.ML

    Finding a sparse vector in a subspace: Linear sparsity using alternating directions

    Authors: Qing Qu, Ju Sun, John Wright

    Abstract: Is it possible to find the sparsest vector (direction) in a generic subspace $\mathcal{S} \subseteq \mathbb{R}^p$ with $\mathrm{dim}(\mathcal{S})= n < p$? This problem can be considered a homogeneous variant of the sparse recovery problem, and finds connections to sparse dictionary learning, sparse PCA, and many other problems in signal processing and machine learning. In this paper, we focus on a… ▽ More

    Submitted 19 July, 2016; v1 submitted 15 December, 2014; originally announced December 2014.

    Comments: Accepted by IEEE Trans. Information Theory. The paper has been revised by the reviewers' comments. The proofs have been streamlined

    Journal ref: IEEE Transaction on Information Theory, 62(10):5855 - 5880, 2016

  34. arXiv:1404.7208  [pdf, other

    stat.OT

    Validating Sample Average Approximation Solutions with Negatively Dependent Batches

    Authors: Jiajie Chen, Cong Han Lim, Peter Z. G. Qian, Jeff Linderoth, Stephen J. Wright

    Abstract: Sample-average approximations (SAA) are a practical means of finding approximate solutions of stochastic programming problems involving an extremely large (or infinite) number of scenarios. SAA can also be used to find estimates of a lower bound on the optimal objective value of the true problem which, when coupled with an upper bound, provides confidence intervals for the true optimal objective v… ▽ More

    Submitted 6 May, 2014; v1 submitted 28 April, 2014; originally announced April 2014.

  35. arXiv:1403.7588  [pdf, other

    math.OC cs.CV math.NA stat.ML

    Scalable Robust Matrix Recovery: Frank-Wolfe Meets Proximal Methods

    Authors: Cun Mu, Yuqian Zhang, John Wright, Donald Goldfarb

    Abstract: Recovering matrices from compressive and grossly corrupted observations is a fundamental problem in robust statistics, with rich applications in computer vision and machine learning. In theory, under certain conditions, this problem can be solved in polynomial time via a natural convex relaxation, known as Compressive Principal Component Pursuit (CPCP). However, all existing provable algorithms fo… ▽ More

    Submitted 29 May, 2017; v1 submitted 29 March, 2014; originally announced March 2014.

    Journal ref: SIAM Journal on Scientific Computing, 2016, Vol. 38, No. 5 : pp. A3291-A3317

  36. arXiv:1307.5870  [pdf, other

    stat.ML cs.LG

    Square Deal: Lower Bounds and Improved Relaxations for Tensor Recovery

    Authors: Cun Mu, Bo Huang, John Wright, Donald Goldfarb

    Abstract: Recovering a low-rank tensor from incomplete information is a recurring problem in signal processing and machine learning. The most popular convex relaxation of this problem minimizes the sum of the nuclear norms of the unfoldings of the tensor. We show that this approach can be substantially suboptimal: reliably recovering a $K$-way tensor of length $n$ and Tucker rank $r$ from Gaussian measureme… ▽ More

    Submitted 15 August, 2013; v1 submitted 22 July, 2013; originally announced July 2013.

    Comments: Slight modifications are made in this second version (mainly, Lemma 5)

  37. arXiv:1307.5494  [pdf, other

    math.NA cs.LG stat.ML

    On GROUSE and Incremental SVD

    Authors: Laura Balzano, Stephen J. Wright

    Abstract: GROUSE (Grassmannian Rank-One Update Subspace Estimation) is an incremental algorithm for identifying a subspace of Rn from a sequence of vectors in this subspace, where only a subset of components of each vector is revealed at each iteration. Recent analysis has shown that GROUSE converges locally at an expected linear rate, under certain assumptions. GROUSE has a similar flavor to the incrementa… ▽ More

    Submitted 20 July, 2013; originally announced July 2013.

  38. arXiv:1211.0757  [pdf, other

    stat.ML cs.CV stat.AP

    Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision

    Authors: Ju Sun, Yuqian Zhang, John Wright

    Abstract: Motivated by vision tasks such as robust face and object recognition, we consider the following general problem: given a collection of low-dimensional linear subspaces in a high-dimensional ambient (image) space and a query point (image), efficiently determine the nearest subspace to the query in $\ell^1$ distance. We show in theory that Cauchy random embedding of the objects into significantly-lo… ▽ More

    Submitted 4 November, 2012; originally announced November 2012.

    Comments: To appear in NIPS workshop on big learning, 2012

  39. arXiv:1208.0432  [pdf, other

    cs.CV cs.LG stat.ML

    Efficient Point-to-Subspace Query in $\ell^1$ with Application to Robust Object Instance Recognition

    Authors: Ju Sun, Yuqian Zhang, John Wright

    Abstract: Motivated by vision tasks such as robust face and object recognition, we consider the following general problem: given a collection of low-dimensional linear subspaces in a high-dimensional ambient (image) space, and a query point (image), efficiently determine the nearest subspace to the query in $\ell^1$ distance. In contrast to the naive exhaustive search which entails large-scale linear progra… ▽ More

    Submitted 6 March, 2014; v1 submitted 2 August, 2012; originally announced August 2012.

    Comments: Revised based on reviewers' feedback; one new experiment on synthesized data added; one section discussing the speed up added

    Journal ref: SIAM Journal on Imaging Sciences, 7(4):2105 - 2138, 2014

  40. arXiv:1207.0577  [pdf, ps, other

    stat.ML cs.LG

    Robust Dequantized Compressive Sensing

    Authors: Ji Liu, Stephen J. Wright

    Abstract: We consider the reconstruction problem in compressed sensing in which the observations are recorded in a finite number of bits. They may thus contain quantization errors (from being rounded to the nearest representable value) and saturation errors (from being outside the range of representable values). Our formulation has an objective of weighted $\ell_2$-$\ell_1$ type, along with constraints that… ▽ More

    Submitted 10 October, 2013; v1 submitted 3 July, 2012; originally announced July 2012.

  41. arXiv:1104.4385  [pdf, other

    cs.CV stat.ML

    Convex Approaches to Model Wavelet Sparsity Patterns

    Authors: Nikhil S Rao, Robert D. Nowak, Stephen J. Wright, Nick G. Kingsbury

    Abstract: Statistical dependencies among wavelet coefficients are commonly represented by graphical models such as hidden Markov trees(HMTs). However, in linear inverse problems such as deconvolution, tomography, and compressed sensing, the presence of a sensing or observation matrix produces a linear mixing of the simple Markovian dependency structure. This leads to reconstruction problems that are non-con… ▽ More

    Submitted 22 April, 2011; originally announced April 2011.