Skip to main content

Showing 1–9 of 9 results for author: Stern, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07357  [pdf, other

    stat.ML cs.LG

    Regression Trees for Fast and Adaptive Prediction Intervals

    Authors: Luben M. C. Cabezas, Mateus P. Otto, Rafael Izbicki, Rafael B. Stern

    Abstract: Predictive models make mistakes. Hence, there is a need to quantify the uncertainty associated with their predictions. Conformal inference has emerged as a powerful tool to create statistically valid prediction regions around point predictions, but its naive application to regression problems yields non-adaptive regions. New conformal scores, often relying upon quantile regressors or conditional d… ▽ More

    Submitted 13 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  2. arXiv:2303.09438  [pdf, other

    cs.CL cs.SD eess.AS

    Trustera: A Live Conversation Redaction System

    Authors: Evandro Gouvêa, Ali Dadgar, Shahab Jalalvand, Rathi Chengalvarayan, Badrinath Jayakumar, Ryan Price, Nicholas Ruiz, Jennifer McGovern, Srinivas Bangalore, Ben Stern

    Abstract: Trustera, the first functional system that redacts personally identifiable information (PII) in real-time spoken conversations to remove agents' need to hear sensitive information while preserving the naturalness of live customer-agent conversations. As opposed to post-call redaction, audio masking starts as soon as the customer begins speaking to a PII entity. This significantly reduces the risk… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 5

  3. arXiv:2301.09671  [pdf, other

    stat.ME cs.LG stat.ML

    Flexible conditional density estimation for time series

    Authors: Gustavo Grivol, Rafael Izbicki, Alex A. Okuno, Rafael B. Stern

    Abstract: This paper introduces FlexCodeTS, a new conditional density estimator for time series. FlexCodeTS is a flexible nonparametric conditional density estimator, which can be based on an arbitrary regression method. It is shown that FlexCodeTS inherits the rate of convergence of the chosen regression method. Hence, FlexCodeTS can adapt its convergence by employing the regression method that best fits t… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: 19 pages, 7 figures

    MSC Class: 00-01; 99-00

  4. arXiv:2112.01372  [pdf, other

    stat.ME cs.LG

    Hierarchical clustering: visualization, feature importance and model selection

    Authors: Luben M. C. Cabezas, Rafael Izbicki, Rafael B. Stern

    Abstract: We propose methods for the analysis of hierarchical clustering that fully use the multi-resolution structure provided by a dendrogram. Specifically, we propose a loss for choosing between clustering methods, a feature importance score and a graphical tool for visualizing the segmentation of features in a dendrogram. Current approaches to these tasks lead to loss of information since they require t… ▽ More

    Submitted 27 January, 2023; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: 29 pages, 9 figures, 3 tables

    ACM Class: I.5.3

  5. arXiv:2007.12778  [pdf, other

    stat.ML cs.LG stat.ME

    CD-split and HPD-split: efficient conformal regions in high dimensions

    Authors: Rafael Izbicki, Gilson Shimizu, Rafael B. Stern

    Abstract: Conformal methods create prediction bands that control average coverage assuming solely i.i.d. data. Although the literature has mostly focused on prediction intervals, more general regions can often better represent uncertainty. For instance, a bimodal target is better represented by the union of two intervals. Such prediction regions are obtained by CD-split , which combines the split method and… ▽ More

    Submitted 4 October, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: 34 pages, 15 figures

    MSC Class: 62G15

  6. arXiv:1910.05575  [pdf, other

    stat.ME cs.LG stat.ML

    Flexible distribution-free conditional predictive bands using density estimators

    Authors: Rafael Izbicki, Gilson T. Shimizu, Rafael B. Stern

    Abstract: Conformal methods create prediction bands that control average coverage under no assumptions besides i.i.d. data. Besides average coverage, one might also desire to control conditional coverage, that is, coverage for every new testing point. However, without strong assumptions, conditional coverage is unachievable. Given this limitation, the literature has focused on methods with asymptotical cond… ▽ More

    Submitted 9 December, 2019; v1 submitted 12 October, 2019; originally announced October 2019.

  7. arXiv:1908.00105  [pdf, other

    stat.ML cs.LG

    Conditional independence testing: a predictive perspective

    Authors: Marco Henrique de Almeida Inácio, Rafael Izbicki, Rafael Bassi Stern

    Abstract: Conditional independence testing is a key problem required by many machine learning and statistics tools. In particular, it is one way of evaluating the usefulness of some features on a supervised prediction problem. We propose a novel conditional independence test in a predictive setting, and show that it achieves better power than competing approaches in several settings. Our approach consists i… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

  8. arXiv:1807.03929  [pdf, other

    stat.ML cs.LG

    Quantification under prior probability shift: the ratio estimator and its extensions

    Authors: Afonso Fernandes Vaz, Rafael Izbicki, Rafael Bassi Stern

    Abstract: The quantification problem consists of determining the prevalence of a given label in a target population. However, one often has access to the labels in a sample from the training population but not in the target population. A common assumption in this situation is that of prior probability shift, that is, once the labels are known, the distribution of the features is the same in the training and… ▽ More

    Submitted 5 April, 2019; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: 33 pages, 15 figures

    MSC Class: 62F12; 62G05; 62G08

  9. arXiv:1405.3292  [pdf, other

    stat.ME cs.LG

    Learning with many experts: model selection and sparsity

    Authors: Rafael Izbicki, Rafael Bassi Stern

    Abstract: Experts classifying data are often imprecise. Recently, several models have been proposed to train classifiers using the noisy labels generated by these experts. How to choose between these models? In such situations, the true labels are unavailable. Thus, one cannot perform model selection using the standard versions of methods such as empirical risk minimization and cross validation. In order to… ▽ More

    Submitted 13 May, 2014; originally announced May 2014.

    Comments: This is the pre-peer reviewed version

    Journal ref: Izbicki, R., Stern, R. B. "Learning with many experts: Model selection and sparsity." Statistical Analysis and Data Mining 6.6 (2013): 565-577