Skip to main content

Showing 1–6 of 6 results for author: Stamos, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.05732  [pdf, other

    cs.LG stat.ML

    Online Parameter-Free Learning of Multiple Low Variance Tasks

    Authors: Giulia Denevi, Dimitris Stamos, Massimiliano Pontil

    Abstract: We propose a method to learn a common bias vector for a growing sequence of low-variance tasks. Unlike state-of-the-art approaches, our method does not require tuning any hyper-parameter. Our approach is presented in the non-statistical setting and can be of two variants. The "aggressive" one updates the bias after each datapoint, the "lazy" one updates the bias only at the end of each task. We de… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Journal ref: Conference on Uncertainty in Artificial Intelligence (UAI) 2020

  2. arXiv:1903.00667  [pdf, ps, other

    cs.LG stat.ML

    Leveraging Low-Rank Relations Between Surrogate Tasks in Structured Prediction

    Authors: Giulia Luise, Dimitris Stamos, Massimiliano Pontil, Carlo Ciliberto

    Abstract: We study the interplay between surrogate methods for structured prediction and techniques from multitask learning designed to leverage relationships between surrogate outputs. We propose an efficient algorithm based on trace norm regularization which, differently from previous methods, does not require explicit knowledge of the coding/decoding functions of the surrogate framework. As a result, our… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

    Comments: 42 pages, 1 table

  3. arXiv:1803.08089  [pdf, ps, other

    stat.ML cs.LG

    Incremental Learning-to-Learn with Statistical Guarantees

    Authors: Giulia Denevi, Carlo Ciliberto, Dimitris Stamos, Massimiliano Pontil

    Abstract: In learning-to-learn the goal is to infer a learning algorithm that works well on a class of tasks sampled from an unknown meta distribution. In contrast to previous work on batch learning-to-learn, we consider a scenario where tasks are presented sequentially and the algorithm needs to adapt incrementally to improve its performance on future tasks. Key to this setting is for the algorithm to rapi… ▽ More

    Submitted 21 March, 2018; originally announced March 2018.

  4. arXiv:1706.08934  [pdf, ps, other

    cs.LG stat.ML

    Reexamining Low Rank Matrix Factorization for Trace Norm Regularization

    Authors: Carlo Ciliberto, Dimitris Stamos, Massimiliano Pontil

    Abstract: Trace norm regularization is a widely used approach for learning low rank matrices. A standard optimization strategy is based on formulating the problem as one of low rank matrix factorization which, however, leads to a non-convex problem. In practice this approach works well, and it is often computationally faster than standard convex solvers such as proximal gradient methods. Nevertheless, it is… ▽ More

    Submitted 31 July, 2017; v1 submitted 27 June, 2017; originally announced June 2017.

    Comments: 22 pages, 4 figures, 1 Table

  5. arXiv:1601.00449  [pdf, other

    cs.LG stat.ML

    Fitting Spectral Decay with the $k$-Support Norm

    Authors: Andrew M. McDonald, Massimiliano Pontil, Dimitris Stamos

    Abstract: The spectral $k$-support norm enjoys good estimation properties in low rank matrix learning problems, empirically outperforming the trace norm. Its unit ball is the convex hull of rank $k$ matrices with unit Frobenius norm. In this paper we generalize the norm to the spectral $(k,p)$-support norm, whose additional parameter $p$ can be used to tailor the norm to the decay of the spectrum of the und… ▽ More

    Submitted 4 January, 2016; originally announced January 2016.

  6. arXiv:1512.08204  [pdf, other

    cs.LG stat.ML

    New Perspectives on $k$-Support and Cluster Norms

    Authors: Andrew M. McDonald, Massimiliano Pontil, Dimitris Stamos

    Abstract: We study a regularizer which is defined as a parameterized infimum of quadratics, and which we call the box-norm. We show that the k-support norm, a regularizer proposed by [Argyriou et al, 2012] for sparse vector prediction problems, belongs to this family, and the box-norm can be generated as a perturbation of the former. We derive an improved algorithm to compute the proximity operator of the s… ▽ More

    Submitted 27 December, 2015; originally announced December 2015.