Skip to main content

Showing 1–16 of 16 results for author: Cortes, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.07140  [pdf, other

    cs.LG stat.ML

    Cardinality-Aware Set Prediction and Top-$k$ Classification

    Authors: Corinna Cortes, Anqi Mao, Christopher Mohri, Mehryar Mohri, Yutao Zhong

    Abstract: We present a detailed study of cardinality-aware top-$k$ classification, a novel approach that aims to learn an accurate top-$k$ set predictor while maintaining a low cardinality. We introduce a new target loss function tailored to this setting that accounts for both the classification error and the cardinality of the set predicted. To optimize this loss function, we propose two families of surrog… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2403.19625

  2. arXiv:2306.08838  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Domain Adaptation with Theoretical Guarantees

    Authors: Raef Bassily, Corinna Cortes, Anqi Mao, Mehryar Mohri

    Abstract: In many applications, the labeled data at the learner's disposal is subject to privacy constraints and is relatively limited. To derive a more accurate predictor for the target domain, it is often beneficial to leverage publicly available labeled data from an alternative domain, somewhat close to the target domain. This is the modern problem of supervised domain adaptation from a public source to… ▽ More

    Submitted 4 February, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  3. arXiv:2305.05816  [pdf, other

    cs.LG stat.ML

    Best-Effort Adaptation

    Authors: Pranjal Awasthi, Corinna Cortes, Mehryar Mohri

    Abstract: We study a problem of best-effort adaptation motivated by several applications and considerations, which consists of determining an accurate predictor for a target domain, for which a moderate amount of labeled samples are available, while leveraging information from another domain for which substantially more labeled samples are at one's disposal. We present a new and general discrepancy-based th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  4. arXiv:2205.12004  [pdf, other

    quant-ph cs.AI cs.LG stat.ML

    Quantum Kerr Learning

    Authors: Junyu Liu, Changchun Zhong, Matthew Otten, Anirban Chandra, Cristian L. Cortes, Chaoyang Ti, Stephen K Gray, Xu Han

    Abstract: Quantum machine learning is a rapidly evolving field of research that could facilitate important applications for quantum computing and also significantly impact data-driven sciences. In our work, based on various arguments from complexity theory and physics, we demonstrate that a single Kerr mode can provide some "quantum enhancements" when dealing with kernel-based methods. Using kernel properti… ▽ More

    Submitted 30 November, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: 20 pages, many figures. v2: significant updates, author added

    Journal ref: Mach. Learn.: Sci. Technol. 4 025003, 2023

  5. arXiv:2008.11036  [pdf, other

    cs.LG stat.ML

    A Discriminative Technique for Multiple-Source Adaptation

    Authors: Corinna Cortes, Mehryar Mohri, Ananda Theertha Suresh, Ningshan Zhang

    Abstract: We present a new discriminative technique for the multiple-source adaptation, MSA, problem. Unlike previous work, which relies on density estimation for each source domain, our solution only requires conditional probabilities that can easily be accurately estimated from unlabeled data from the source domains. We give a detailed analysis of our new technique, including general guarantees based on R… ▽ More

    Submitted 12 February, 2021; v1 submitted 25 August, 2020; originally announced August 2020.

  6. arXiv:2008.09490  [pdf, other

    cs.LG stat.ML

    Beyond Individual and Group Fairness

    Authors: Pranjal Awasthi, Corinna Cortes, Yishay Mansour, Mehryar Mohri

    Abstract: We present a new data-driven model of fairness that, unlike existing static definitions of individual or group fairness is guided by the unfairness complaints received by the system. Our model supports multiple fairness criteria and takes into account their potential incompatibilities. We consider both a stochastic and an adversarial setting of our model. In the stochastic setting, we show that ou… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  7. arXiv:2006.14950  [pdf, other

    cs.LG stat.ML

    Relative Deviation Margin Bounds

    Authors: Corinna Cortes, Mehryar Mohri, Ananda Theertha Suresh

    Abstract: We present a series of new and more favorable margin-based learning guarantees that depend on the empirical margin loss of a predictor. We give two types of learning bounds, both distribution-dependent and valid for general families, in terms of the Rademacher complexity or the empirical $\ell_\infty$ covering number of the hypothesis set used. Furthermore, using our relative deviation margin boun… ▽ More

    Submitted 28 October, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: 29 pages

  8. arXiv:2002.07348  [pdf, other

    cs.LG stat.ML

    Adaptive Region-Based Active Learning

    Authors: Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang

    Abstract: We present a new active learning algorithm that adaptively partitions the input space into a finite number of regions, and subsequently seeks a distinct predictor for each region, both phases actively requesting labels. We prove theoretical guarantees for both the generalization error and the label complexity of our algorithm, and analyze the number of regions defined by the algorithm under some m… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  9. arXiv:1910.08965  [pdf, other

    cs.LG stat.ML

    Learning GANs and Ensembles Using Discrepancy

    Authors: Ben Adlam, Corinna Cortes, Mehryar Mohri, Ningshan Zhang

    Abstract: Generative adversarial networks (GANs) generate data based on minimizing a divergence between two distributions. The choice of that divergence is therefore critical. We argue that the divergence must take into account the hypothesis set and the loss function used in a subsequent learning task, where the data generated by a GAN serves for training. Taking that structural information into account is… ▽ More

    Submitted 5 November, 2019; v1 submitted 20 October, 2019; originally announced October 2019.

  10. arXiv:1905.00080  [pdf, other

    cs.LG stat.ML

    AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

    Authors: Charles Weill, Javier Gonzalvo, Vitaly Kuznetsov, Scott Yang, Scott Yak, Hanna Mazzawi, Eugen Hotaj, Ghassen Jerfel, Vladimir Macko, Ben Adlam, Mehryar Mohri, Corinna Cortes

    Abstract: AdaNet is a lightweight TensorFlow-based (Abadi et al., 2015) framework for automatically learning high-quality ensembles with minimal expert intervention. Our framework is inspired by the AdaNet algorithm (Cortes et al., 2017) which learns the structure of a neural network as an ensemble of subnetworks. We designed it to: (1) integrate with the existing TensorFlow ecosystem, (2) offer sensible de… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

  11. arXiv:1804.06518  [pdf, ps, other

    cs.LG stat.ML

    Online Non-Additive Path Learning under Full and Partial Information

    Authors: Corinna Cortes, Vitaly Kuznetsov, Mehryar Mohri, Holakou Rahmanian, Manfred K. Warmuth

    Abstract: We study the problem of online path learning with non-additive gains, which is a central problem appearing in several applications, including ensemble structured prediction. We present new online algorithms for path learning with non-additive count-based gains for the three settings of full information, semi-bandit and full bandit with very favorable regret guarantees. A key component of our algor… ▽ More

    Submitted 18 March, 2019; v1 submitted 17 April, 2018; originally announced April 2018.

  12. arXiv:1705.08921  [pdf, other

    stat.ML cs.LG

    Consistent Kernel Density Estimation with Non-Vanishing Bandwidth

    Authors: Efrén Cruz Cortés, Clayton Scott

    Abstract: Consistency of the kernel density estimator requires that the kernel bandwidth tends to zero as the sample size grows. In this paper we investigate the question of whether consistency is possible when the bandwidth is fixed, if we consider a more general class of weighted KDEs. To answer this question in the affirmative, we introduce the fixed-bandwidth KDE (fbKDE), obtained by solving a quadratic… ▽ More

    Submitted 29 May, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: 17 pages, updated abstract

  13. arXiv:1605.06443  [pdf, other

    stat.ML cs.LG

    Structured Prediction Theory Based on Factor Graph Complexity

    Authors: Corinna Cortes, Mehryar Mohri, Vitaly Kuznetsov, Scott Yang

    Abstract: We present a general theoretical analysis of structured prediction with a series of new results. We give new data-dependent margin guarantees for structured prediction for a very wide family of loss functions and a general family of hypotheses, with an arbitrary factor graph decomposition. These are the tightest margin bounds known for both standard multi-class and general structured prediction pr… ▽ More

    Submitted 1 December, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

  14. arXiv:1503.00323  [pdf, other

    stat.ML cs.LG

    Sparse Approximation of a Kernel Mean

    Authors: E. Cruz Cortés, C. Scott

    Abstract: Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately, kernel means are faced with scalability issues. A single point evaluation of the kernel density estimator, for example, requires a computation time linear in the… ▽ More

    Submitted 1 March, 2015; originally announced March 2015.

  15. arXiv:1205.2653  [pdf

    cs.LG stat.ML

    L2 Regularization for Learning Kernels

    Authors: Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Abstract: The choice of the kernel is critical to the success of many learning algorithms but it is typically left to the user. Instead, the training data can be used to learn the kernel by selecting it out of a given family, such as that of non-negative linear combinations of p base kernels, constrained by a trace or L1 regularization. This paper studies the problem of learning kernels with the same family… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-109-116

  16. arXiv:1202.3712  [pdf

    cs.LG stat.ML

    Ensembles of Kernel Predictors

    Authors: Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Abstract: This paper examines the problem of learning with a finite and possibly large set of p base kernels. It presents a theoretical and empirical analysis of an approach addressing this problem based on ensembles of kernel predictors. This includes novel theoretical guarantees based on the Rademacher complexity of the corresponding hypothesis sets, the introduction and analysis of a learning algorithm b… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-145-152