Skip to main content

Showing 1–14 of 14 results for author: Bento, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2206.01332  [pdf, other

    stat.ML cs.AI cs.LG

    Optimal Activation Functions for the Random Features Regression Model

    Authors: Jianxin Wang, José Bento

    Abstract: The asymptotic mean squared test error and sensitivity of the Random Features Regression model (RFR) have been recently studied. We build on this work and identify in closed-form the family of Activation Functions (AFs) that minimize a combination of the test error and sensitivity of the RFR under different notions of functional parsimony. We find scenarios under which the optimal AFs are linear,… ▽ More

    Submitted 24 March, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  2. arXiv:2009.02604  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Optimization, Averaging via ADMM, and Network Topology

    Authors: Guilherme França, José Bento

    Abstract: There has been an increasing necessity for scalable optimization methods, especially due to the explosion in the size of datasets and model complexity in modern machine learning applications. Scalable solvers often distribute the computation over a network of processing units. For simple algorithms such as gradient descent the dependency of the convergence time with the topology of this network is… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: to appear in "Proceedings of the IEEE"

  3. arXiv:2001.11114  [pdf, other

    cs.LG cs.DM math.FA stat.ML

    A Family of Pairwise Multi-Marginal Optimal Transports that Define a Generalized Metric

    Authors: Liang Mi, Azadeh Sheikholeslami, José Bento

    Abstract: The Optimal transport (OT) problem is rapidly finding its way into machine learning. Favoring its use are its metric properties. Many problems admit solutions with guarantees only for objects embedded in metric spaces, and the use of non-metrics can complicate solving them. Multi-marginal OT (MMOT) generalizes OT to simultaneously transporting multiple distributions. It captures important relation… ▽ More

    Submitted 22 December, 2022; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Machine Learning (2022)

  4. arXiv:1801.04987  [pdf, other

    cs.LG cs.CC math.OC stat.ML

    On the Complexity of the Weighted Fused Lasso

    Authors: Jose Bento, Ralph Furmaniak, Surjyendu Ray

    Abstract: The solution path of the 1D fused lasso for an $n$-dimensional input is piecewise linear with $\mathcal{O}(n)$ segments (Hoefling et al. 2010 and Tibshirani et al 2011). However, existing proofs of this bound do not hold for the weighted fused lasso. At the same time, results for the generalized lasso, of which the weighted fused lasso is a special case, allow $Ω(3^n)$ segments (Mairal et al. 2012… ▽ More

    Submitted 19 April, 2018; v1 submitted 15 January, 2018; originally announced January 2018.

  5. arXiv:1710.00889  [pdf, other

    stat.ML math.OC

    How is Distributed ADMM Affected by Network Topology?

    Authors: Guilherme França, José Bento

    Abstract: When solving consensus optimization problems over a graph, there is often an explicit characterization of the convergence rate of Gradient Descent (GD) using the spectrum of the graph Laplacian. The same type of problems under the Alternating Direction Method of Multipliers (ADMM) are, however, poorly understood. For instance, simple but important non-strongly-convex consensus problems have not ye… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

  6. arXiv:1703.03863  [pdf, other

    stat.ML math.DS math.OC

    Tuning Over-Relaxed ADMM

    Authors: Guilherme França, José Bento

    Abstract: The framework of Integral Quadratic Constraints (IQC) reduces the computation of upper bounds on the convergence rate of several optimization algorithms to a semi-definite program (SDP). In the case of over-relaxed Alternating Direction Method of Multipliers (ADMM), an explicit and closed form solution to this SDP was derived in our recent work [1]. The purpose of this paper is twofold. First, we… ▽ More

    Submitted 5 March, 2018; v1 submitted 10 March, 2017; originally announced March 2017.

    Comments: NIPS 2016, Optimizing the Optimizer Workshop

  7. arXiv:1703.03859  [pdf, other

    stat.ML cs.DS cs.IT cs.LG math.OC

    Markov Chain Lifting and Distributed ADMM

    Authors: Guilherme França, José Bento

    Abstract: The time to converge to the steady state of a finite Markov chain can be greatly reduced by a lifting operation, which creates a new Markov chain on an expanded state space. For a class of quadratic objectives, we show an analogous behavior where a distributed ADMM algorithm can be seen as a lifting of Gradient Descent algorithm. This provides a deep insight for its faster convergence rate under o… ▽ More

    Submitted 10 March, 2017; originally announced March 2017.

    Comments: This work was also selected for a talk at NIPS 2016, Optimization for Machine Learning Workshop (OPT 2016)

    Journal ref: IEEE Signal Processing Letters (Volume: 24, Issue: 3, March 2017)

  8. arXiv:1702.07956  [pdf, ps, other

    cs.LG stat.ML

    Generative Adversarial Active Learning

    Authors: Jia-Jie Zhu, José Bento

    Abstract: We propose a new active learning by query synthesis approach using Generative Adversarial Networks (GAN). Different from regular active learning, the resulting algorithm adaptively synthesizes training instances for querying to increase learning speed. We generate queries according to the uncertainty principle, but our idea can work with other active learning principles. We report results from var… ▽ More

    Submitted 15 November, 2017; v1 submitted 25 February, 2017; originally announced February 2017.

  9. An Explicit Rate Bound for the Over-Relaxed ADMM

    Authors: Guilherme França, José Bento

    Abstract: The framework of Integral Quadratic Constraints of Lessard et al. (2014) reduces the computation of upper bounds on the convergence rate of several optimization algorithms to semi-definite programming (SDP). Followup work by Nishihara et al. (2015) applies this technique to the entire family of over-relaxed Alternating Direction Method of Multipliers (ADMM). Unfortunately, they only provide an exp… ▽ More

    Submitted 5 March, 2018; v1 submitted 7 December, 2015; originally announced December 2015.

    Comments: IEEE International Symposium on Information Theory (ISIT), 2016

  10. arXiv:1505.02867  [pdf, other

    cs.LG cs.DS cs.IR stat.ML

    The Boundary Forest Algorithm for Online Supervised and Unsupervised Learning

    Authors: Charles Mathy, Nate Derbinsky, José Bento, Jonathan Rosenthal, Jonathan Yedidia

    Abstract: We describe a new instance-based learning algorithm called the Boundary Forest (BF) algorithm, that can be used for supervised and unsupervised learning. The algorithm builds a forest of trees whose nodes store previously seen examples. It can be shown data points one at a time and updates itself incrementally, hence it is naturally online. Few instance-based algorithms have this property while be… ▽ More

    Submitted 11 May, 2015; originally announced May 2015.

    Comments: 7 pages, 4 figs, 1 page supp. info

    Journal ref: Proc. of the 29th AAAI Conference on Artificial Intelligence (AAAI), 2864-2870. Austin, TX, USA. (2015)

  11. arXiv:1207.6379  [pdf, ps, other

    cs.IR cs.LG stat.ML

    Identifying Users From Their Rating Patterns

    Authors: José Bento, Nadia Fawaz, Andrea Montanari, Stratis Ioannidis

    Abstract: This paper reports on our analysis of the 2011 CAMRa Challenge dataset (Track 2) for context-aware movie recommendation systems. The train dataset comprises 4,536,891 ratings provided by 171,670 users on 23,974$ movies, as well as the household grou**s of a subset of the users. The test dataset comprises 5,450 ratings for which the user label is missing, but the household label is provided. The… ▽ More

    Submitted 26 July, 2012; originally announced July 2012.

    Comments: Winner of the 2011 Challenge on Context-Aware Movie Recommendation (RecSys 2011 - CAMRa2011)

  12. arXiv:1110.1769  [pdf, ps, other

    stat.ML cs.LG physics.data-an

    On the trade-off between complexity and correlation decay in structural learning algorithms

    Authors: José Bento, Andrea Montanari

    Abstract: We consider the problem of learning the structure of Ising models (pairwise binary Markov random fields) from i.i.d. samples. While several methods have been proposed to accomplish this task, their relative merits and limitations remain somewhat obscure. By analyzing a number of concrete examples, we show that low-complexity algorithms often fail when the Markov random field develops long-range co… ▽ More

    Submitted 8 October, 2011; originally announced October 2011.

  13. arXiv:1103.1689  [pdf, other

    cs.IT cs.LG math.ST q-fin.ST stat.ML

    Information Theoretic Limits on Learning Stochastic Differential Equations

    Authors: José Bento, Morteza Ibrahimi, Andrea Montanari

    Abstract: Consider the problem of learning the drift coefficient of a stochastic differential equation from a sample path. In this paper, we assume that the drift is parametrized by a high dimensional vector. We address the question of how long the system needs to be observed in order to learn this vector of parameters. We prove a general lower bound on this time complexity by using a characterization of mu… ▽ More

    Submitted 8 March, 2011; originally announced March 2011.

    Comments: 6 pages, 2 figures, conference version

  14. arXiv:0910.5761  [pdf, ps, other

    stat.ML cond-mat.stat-mech cs.LG

    Which graphical models are difficult to learn?

    Authors: Jose Bento, Andrea Montanari

    Abstract: We consider the problem of learning the structure of Ising models (pairwise binary Markov random fields) from i.i.d. samples. While several methods have been proposed to accomplish this task, their relative merits and limitations remain somewhat obscure. By analyzing a number of concrete examples, we show that low-complexity algorithms systematically fail when the Markov random field develops lo… ▽ More

    Submitted 29 October, 2009; originally announced October 2009.