Skip to main content

Showing 1–9 of 9 results for author: Salehi, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2210.15323  [pdf, other

    cs.LG math.AP math.OC stat.ML

    Stochastic Mirror Descent in Average Ensemble Models

    Authors: Taylan Kargin, Fariborz Salehi, Babak Hassibi

    Abstract: The stochastic mirror descent (SMD) algorithm is a general class of training algorithms, which includes the celebrated stochastic gradient descent (SGD), as a special case. It utilizes a mirror potential to influence the implicit bias of the training algorithm. In this paper we explore the performance of the SMD iterates on mean-field ensemble models. Our results generalize earlier ones obtained f… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  2. arXiv:2010.15379  [pdf, ps, other

    stat.ML cs.LG

    The Performance Analysis of Generalized Margin Maximizer (GMM) on Separable Data

    Authors: Fariborz Salehi, Ehsan Abbasi, Babak Hassibi

    Abstract: Logistic models are commonly used for binary classification tasks. The success of such models has often been attributed to their connection to maximum-likelihood estimators. It has been shown that gradient descent algorithm, when applied on the logistic loss, converges to the max-margin classifier (a.k.a. hard-margin SVM). The performance of the max-margin classifier has been recently analyzed. In… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: ICML 2020 (submitted February 2020)

  3. arXiv:2007.15378  [pdf, ps, other

    cs.LG stat.ML

    Generalization Comparison of Deep Neural Networks via Output Sensitivity

    Authors: Mahsa Forouzesh, Farnood Salehi, Patrick Thiran

    Abstract: Although recent works have brought some insights into the performance improvement of techniques used in state-of-the-art deep-learning models, more work is needed to understand their generalization properties. We shed light on this matter by linking the loss function to the output's sensitivity to its input. We find a rather strong empirical relation between the output sensitivity and the variance… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

  4. arXiv:1912.00131  [pdf, other

    cs.DC cs.CR cs.LG stat.ML

    Federated Learning with Autotuned Communication-Efficient Secure Aggregation

    Authors: Keith Bonawitz, Fariborz Salehi, Jakub Konečný, Brendan McMahan, Marco Gruteser

    Abstract: Federated Learning enables mobile devices to collaboratively learn a shared inference model while kee** all the training data on a user's device, decoupling the ability to do machine learning from the need to store the data in the cloud. Existing work on federated learning with limited communication demonstrates how random rotation can enable users' model updates to be quantized much more effici… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: 5 pages, 3 figures. To appear at the IEEE Asilomar Conference on Signals, Systems, and Computers 2019

  5. arXiv:1911.00292  [pdf, other

    cs.LG stat.ML

    Learning Hawkes Processes from a Handful of Events

    Authors: Farnood Salehi, William Trouleau, Matthias Grossglauser, Patrick Thiran

    Abstract: Learning the causal-interaction network of multivariate Hawkes processes is a useful task in many applications. Maximum-likelihood estimation is the most common approach to solve the problem in the presence of long observation sequences. However, when only short sequences are available, the lack of data amplifies the risk of overfitting and regularization becomes critical. Due to the challenges of… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: Appearing at NeurIPS 2019

  6. arXiv:1907.01068  [pdf, other

    stat.ML cs.AI cs.LG

    Augmenting and Tuning Knowledge Graph Embeddings

    Authors: Robert Bamler, Farnood Salehi, Stephan Mandt

    Abstract: Knowledge graph embeddings rank among the most successful methods for link prediction in knowledge graphs, i.e., the task of completing an incomplete collection of relational facts. A downside of these models is their strong sensitivity to model hyperparameters, in particular regularizers, which have to be extensively tuned to reach good performance [Kadlec et al., 2017]. We propose an efficient m… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: Published version, Conference on Uncertainty in Artificial Intelligence (UAI 2019)

  7. arXiv:1906.03761  [pdf, other

    stat.ML cs.IT cs.LG math.PR

    The Impact of Regularization on High-dimensional Logistic Regression

    Authors: Fariborz Salehi, Ehsan Abbasi, Babak Hassibi

    Abstract: Logistic regression is commonly used for modeling dichotomous outcomes. In the classical setting, where the number of observations is much larger than the number of parameters, properties of the maximum likelihood estimator in logistic regression are well understood. Recently, Sur and Candes have studied logistic regression in the high-dimensional regime, where the number of observations and param… ▽ More

    Submitted 13 November, 2019; v1 submitted 9 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of NeurIPS 2019

  8. arXiv:1712.03010  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Coordinate Descent with Bandit Sampling

    Authors: Farnood Salehi, Patrick Thiran, L. Elisa Celis

    Abstract: Coordinate descent methods usually minimize a cost function by updating a random decision variable (corresponding to one coordinate) at a time. Ideally, we would update the decision variable that yields the largest decrease in the cost function. However, finding this coordinate would require checking all of them, which would effectively negate the improvement in computational tractability that coo… ▽ More

    Submitted 4 December, 2018; v1 submitted 8 December, 2017; originally announced December 2017.

    Comments: appearing at NeurIPS 2018

  9. arXiv:1708.02544  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Stochastic Optimization with Bandit Sampling

    Authors: Farnood Salehi, L. Elisa Celis, Patrick Thiran

    Abstract: Many stochastic optimization algorithms work by estimating the gradient of the cost function on the fly by sampling datapoints uniformly at random from a training set. However, the estimator might have a large variance, which inadvertently slows down the convergence rate of the algorithms. One way to reduce this variance is to sample the datapoints from a carefully selected non-uniform distributio… ▽ More

    Submitted 9 August, 2017; v1 submitted 8 August, 2017; originally announced August 2017.