Skip to main content

Showing 101–119 of 119 results for author: Sra, S

.
  1. arXiv:1411.4107  [pdf, ps, other

    math.FA

    Explicit diagonalization of an anti-triangular Cesaró matrix

    Authors: Suvrit Sra

    Abstract: We study a specific "anti-triangular" Cesaró matrix corresponding to a Markov chain. We derive closed forms for all the eigenvalues and eigenvectors of this matrix.

    Submitted 4 April, 2015; v1 submitted 14 November, 2014; originally announced November 2014.

    Comments: Bugfix version (new proof of Prop. 3.5)

  2. arXiv:1411.0589  [pdf, other

    stat.ML math.OC

    Modular proximal optimization for multidimensional total-variation regularization

    Authors: Álvaro Barbero, Suvrit Sra

    Abstract: We study \emph{TV regularization}, a widely used technique for eliciting structured sparsity. In particular, we propose efficient algorithms for computing prox-operators for $\ell_p$-norm TV. The most important among these is $\ell_1$-norm TV, for whose prox-operator we present a new geometric analysis which unveils a hitherto unknown connection to taut-string methods. This connection turns out to… ▽ More

    Submitted 30 December, 2017; v1 submitted 3 November, 2014; originally announced November 2014.

    Comments: 67 pages, 32 figures, new non-iterative fast TV algorithm, extensive new experiments, corresponds to the github proxtv repository now

  3. arXiv:1411.0065  [pdf, ps, other

    math.FA

    Hlawka-Popoviciu inequalities on positive definite tensors

    Authors: Wolfgang Berndt, Suvrit Sra

    Abstract: We prove inequalities on symmetric tensor sums of positive definite operators. In particular, we prove multivariable operator inequalities inspired by generalizations to the well-known Hlawka and Popoviciu inequalities. As corollaries, we obtain generalized Hlawka and Popoviciu inequalities for determinants, permanents, and generalized matrix functions. The new operator inequalities and their coro… ▽ More

    Submitted 15 November, 2014; v1 submitted 1 November, 2014; originally announced November 2014.

    Comments: 9 pages; a new section on tensor Popoviciu inequalities

    MSC Class: 15A15; 15A39; 15A69; 46M05; 47A63

  4. arXiv:1410.4812  [pdf, other

    stat.CO math.OC stat.ML

    Inference and Mixture Modeling with the Elliptical Gamma Distribution

    Authors: Reshad Hosseini, Suvrit Sra, Lucas Theis, Matthias Bethge

    Abstract: We study modeling and inference with the Elliptical Gamma Distribution (EGD). We consider maximum likelihood (ML) estimation for EGD scatter matrices, a task for which we develop new fixed-point algorithms. Our algorithms are efficient and converge to global optima despite nonconvexity. Moreover, they turn out to be much faster than both a well-known iterative algorithm of Kent & Tyler (1991) and… ▽ More

    Submitted 20 December, 2015; v1 submitted 17 October, 2014; originally announced October 2014.

    Comments: 23 pages, 11 figures

    Journal ref: Computational Statistics & Data Analysis 2016, Vol. 101, 29-43

  5. arXiv:1410.1958  [pdf, ps, other

    math.FA

    Completely strong superadditivity of generalized matrix functions

    Authors: Minghua Lin, Suvrit Sra

    Abstract: We prove that generalized matrix functions satisfy a block-matrix strong superadditivity inequality over the cone of positive semidefinite matrices. Our result extends a recent result of Paksoy-Turkmen-Zhang (V. Paksoy, R. Turkmen, F. Zhang, Inequalities of generalized matrix functions via tensor products, Electron. J. Linear Algebra 27 (2014) 332-341.). As an application, we obtain a short proof… ▽ More

    Submitted 7 October, 2014; originally announced October 2014.

    Comments: 6 pages, no figures

    MSC Class: 15A45; 15A69

  6. arXiv:1409.6086  [pdf, other

    stat.ML math.OC

    Parallel and Distributed Block-Coordinate Frank-Wolfe Algorithms

    Authors: Yu-Xiang Wang, Veeranjaneyulu Sadhanala, Wei Dai, Willie Neiswanger, Suvrit Sra, Eric P. Xing

    Abstract: We develop parallel and distributed Frank-Wolfe algorithms; the former on shared memory machines with mini-batching, and the latter in a delayed update framework. Whenever possible, we perform computations asynchronously, which helps attain speedups on multicore machines as well as in distributed environments. Moreover, instead of worst-case bounded delays, our methods only depend (mildly) on \emp… ▽ More

    Submitted 12 February, 2016; v1 submitted 22 September, 2014; originally announced September 2014.

  7. arXiv:1409.2617  [pdf, other

    math.OC stat.ML

    Large-scale randomized-coordinate descent methods with non-separable linear constraints

    Authors: Sashank Reddi, Ahmed Hefny, Carlton Downey, Avinava Dubey, Suvrit Sra

    Abstract: We develop randomized (block) coordinate descent (CD) methods for linearly constrained convex optimization. Unlike most CD methods, we do not assume the constraints to be separable, but let them be coupled linearly. To our knowledge, ours is the first CD method that allows linear coupling constraints, without making the global iteration complexity have an exponential dependence on the number of co… ▽ More

    Submitted 10 June, 2015; v1 submitted 9 September, 2014; originally announced September 2014.

  8. arXiv:1402.0119  [pdf, other

    stat.ML cs.LG

    Randomized Nonlinear Component Analysis

    Authors: David Lopez-Paz, Suvrit Sra, Alex Smola, Zoubin Ghahramani, Bernhard Schölkopf

    Abstract: Classical methods such as Principal Component Analysis (PCA) and Canonical Correlation Analysis (CCA) are ubiquitous in statistics. However, these techniques are only able to reveal linear relationships in data. Although nonlinear variants of PCA and CCA have been proposed, these are computationally prohibitive in the large scale. In a separate strand of recent research, randomized methods have… ▽ More

    Submitted 13 May, 2014; v1 submitted 1 February, 2014; originally announced February 2014.

    Comments: Appearing in ICML 2014

  9. Conic geometric optimisation on the manifold of positive definite matrices

    Authors: Suvrit Sra, Reshad Hosseini

    Abstract: We develop \emph{geometric optimisation} on the manifold of Hermitian positive definite (HPD) matrices. In particular, we consider optimising two types of cost functions: (i) geodesically convex (g-convex); and (ii) log-nonexpansive (LN). G-convex functions are nonconvex in the usual euclidean sense, but convex along the manifold and thus allow global optimisation. LN functions may fail to be even… ▽ More

    Submitted 12 December, 2014; v1 submitted 4 December, 2013; originally announced December 2013.

    Comments: 27 pages; updated version with simplified presentation; 7 figures

    Journal ref: SIAM Journal on Optimization 2015, Vol. 25, No. 1, 713-739

  10. arXiv:1311.7656  [pdf, ps, other

    stat.ML cs.DM math.OC stat.CO stat.ME

    Statistical estimation for optimization problems on graphs

    Authors: Mikhail Langovoy, Suvrit Sra

    Abstract: Large graphs abound in machine learning, data mining, and several related areas. A useful step towards analyzing such graphs is that of obtaining certain summary statistics - e.g., or the expected length of a shortest path between two nodes, or the expected weight of a minimum spanning tree of the graph, etc. These statistics provide insight into the structure of a graph, and they can help predict… ▽ More

    Submitted 29 November, 2013; originally announced November 2013.

    Comments: Paper for the NIPS Workshop on Discrete Optimization for Machine Learning (DISCML) (2011): Uncertainty, Generalization and Feedback

  11. arXiv:1311.4296  [pdf, ps, other

    cs.LG cs.RO math.NA math.OC

    Reflection methods for user-friendly submodular optimization

    Authors: Stefanie Jegelka, Francis Bach, Suvrit Sra

    Abstract: Recently, it has become evident that submodularity naturally captures widely occurring concepts in machine learning, signal processing and computer vision. Consequently, there is need for efficient optimization procedures for submodular functions, especially for minimization problems. While general submodular minimization is challenging, we propose a new method that exploits existing decomposabili… ▽ More

    Submitted 18 November, 2013; originally announced November 2013.

    Comments: Neural Information Processing Systems (NIPS), États-Unis (2013)

  12. arXiv:1204.1437  [pdf, ps, other

    stat.ML cs.LG math.OC

    Fast projections onto mixed-norm balls with applications

    Authors: Suvrit Sra

    Abstract: Joint sparsity offers powerful structural cues for feature selection, especially for variables that are expected to demonstrate a "grouped" behavior. Such behavior is commonly modeled via group-lasso, multitask lasso, and related methods where feature selection is effected via mixed-norms. Several mixed-norm based sparse models have received substantial attention, and for some cases efficient algo… ▽ More

    Submitted 6 April, 2012; originally announced April 2012.

    Comments: Preprint of paper under review

  13. arXiv:1201.4651  [pdf, ps, other

    math.NA

    Explicit eigenvalues of certain scaled trigonometric matrices

    Authors: Suvrit Sra

    Abstract: In a very recent paper "\emph{On eigenvalues and equivalent transformation of trigonometric matrices}" (D. Zhang, Z. Lin, and Y. Liu, LAA 436, 71--78 (2012)), the authors motivated and discussed a trigonometric matrix that arises in the design of finite impulse response (FIR) digital filters. The eigenvalues of this matrix shed light on the FIR filter design, so obtaining them in closed form was i… ▽ More

    Submitted 29 April, 2012; v1 submitted 23 January, 2012; originally announced January 2012.

    Comments: 7 pages; fixed Lemma 2, tightened inequalities

    MSC Class: 15A18; 15A26

  14. arXiv:1110.1773  [pdf, ps, other

    math.FA stat.ML

    Positive definite matrices and the S-divergence

    Authors: Suvrit Sra

    Abstract: Positive definite matrices abound in a dazzling variety of applications. This ubiquity can be in part attributed to their rich geometric structure: positive definite matrices form a self-dual convex cone whose strict interior is a Riemannian manifold. The manifold view is endowed with a "natural" distance function while the conic view is not. Nevertheless, drawing motivation from the conic view, w… ▽ More

    Submitted 27 December, 2013; v1 submitted 8 October, 2011; originally announced October 2011.

    Comments: 24 pages with several new results; a fraction of this paper also appeared at the Neural Information Processing Systems (NIPS) Conference, Dec. 2012

  15. arXiv:1109.0258  [pdf, ps, other

    math.OC stat.ML

    Nonconvex proximal splitting: batch and incremental algorithms

    Authors: Suvrit Sra

    Abstract: Within the unmanageably large class of nonconvex optimization, we consider the rich subclass of nonsmooth problems that have composite objectives---this already includes the extensively studied convex, composite objective problems as a special case. For this subclass, we introduce a powerful, new framework that permits asymptotically non-vanishing perturbations. In particular, we develop perturbat… ▽ More

    Submitted 17 September, 2012; v1 submitted 1 September, 2011; originally announced September 2011.

    Comments: revised version 12 pages, 2 figures; superset of shorter counterpart in NIPS 2012

  16. arXiv:1106.5175  [pdf, other

    stat.ML

    Sparse Inverse Covariance Estimation via an Adaptive Gradient-Based Method

    Authors: Suvrit Sra, Dongmin Kim

    Abstract: We study the problem of estimating from data, a sparse approximation to the inverse covariance matrix. Estimating a sparsity constrained inverse covariance matrix is a key component in Gaussian graphical model learning, but one that is numerically very challenging. We address this challenge by develo** a new adaptive gradient-based method that carefully combines gradient information with an adap… ▽ More

    Submitted 25 June, 2011; originally announced June 2011.

    Comments: 13 pages

  17. arXiv:1104.4422  [pdf, ps, other

    stat.CO math.CA

    The Multivariate Watson Distribution: Maximum-Likelihood Estimation and other Aspects

    Authors: Suvrit Sra, Dmitrii Karp

    Abstract: This paper studies fundamental aspects of modelling data using multivariate Watson distributions. Although these distributions are natural for modelling axially symmetric data (i.e., unit vectors where $\pm \x$ are equivalent), for high-dimensions using them can be difficult. Why so? Largely because for Watson distributions even basic tasks such as maximum-likelihood are numerically challenging. T… ▽ More

    Submitted 25 May, 2012; v1 submitted 22 April, 2011; originally announced April 2011.

    Comments: 24 pages; extensively updated numerical results

    MSC Class: 62H11; 33C15; 26D07

  18. arXiv:0906.4805  [pdf, ps, other

    cs.IT

    A Trivial Observation related to Sparse Recovery

    Authors: Suvrit Sra

    Abstract: We make a trivial modification to the elegant analysis of Garg and Khandekar (\emph{Gradient Descent with Sparsification} ICML 2009) that replaces the standard Restricted Isometry Property (RIP), with another RIP-type property (which could be simpler than the RIP, but we are not sure; it could be as hard as the RIP to check, thereby rendering this little writeup totally worthless).

    Submitted 27 June, 2009; v1 submitted 26 June, 2009; originally announced June 2009.

    Comments: Replaces previous correct but useless version with another correct, but hopefully somewhat less useless version

  19. arXiv:0812.0389  [pdf, ps, other

    cs.DS cs.LG

    Approximation Algorithms for Bregman Co-clustering and Tensor Clustering

    Authors: Stefanie Jegelka, Suvrit Sra, Arindam Banerjee

    Abstract: In the past few years powerful generalizations to the Euclidean k-means problem have been made, such as Bregman clustering [7], co-clustering (i.e., simultaneous clustering of rows and columns of an input matrix) [9,18], and tensor clustering [8,34]. Like k-means, these more general problems also suffer from the NP-hardness of the associated optimization. Researchers have developed approximation… ▽ More

    Submitted 9 November, 2009; v1 submitted 1 December, 2008; originally announced December 2008.

    Comments: 18 pages; improved metric case

    Journal ref: short version in ALT 2009