Skip to main content

Showing 1–17 of 17 results for author: Lopes, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2301.06459  [pdf, other

    stat.ME

    Sparse Bayesian factor analysis when the number of factors is unknown

    Authors: Sylvia Frühwirth-Schnatter, Darjus Hosszejni, Hedibert Freitas Lopes

    Abstract: There has been increased research interest in the subfield of sparse Bayesian factor analysis with shrinkage priors, which achieve additional sparsity beyond the natural parsimonity of factor models. In this spirit, we estimate the number of common factors in the highly implemented sparse latent factor model with spike-and-slab priors on the factor loadings matrix. Our framework leads to a natural… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:1804.04231

    MSC Class: 62H25 (Primary) 62F15; 65C05 (Secondary)

  2. When it counts -- Econometric identification of the basic factor model based on GLT structures

    Authors: Sylvia Frühwirth-Schnatter, Darjus Hosszejni, Hedibert Freitas Lopes

    Abstract: Despite the popularity of factor models with sparse loading matrices, little attention has been given to formally address identifiability of these models beyond standard rotation-based identification such as the positive lower triangular (PLT) constraint. To fill this gap, we review the advantages of variance identification in sparse factor analysis and introduce the generalized lower triangular (… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    MSC Class: 62H25 (Primary) 15A24; 62F15 (Secondary)

  3. arXiv:2105.09512  [pdf, other

    stat.CO cs.MS math.PR stat.AP

    Uncertainty quantification through Monte Carlo method in a cloud computing setting

    Authors: A. Cunha Jr, R. Nasser, R. Sampaio, H. Lopes, K. Breitman

    Abstract: The Monte Carlo (MC) method is the most common technique used for uncertainty quantification, due to its simplicity and good statistical results. However, its computational cost is extremely high, and, in many cases, prohibitive. Fortunately, the MC algorithm is easily parallelizable, which allows its use in simulations where the computation of a single realization is very costly. This work presen… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    MSC Class: 62D05 ACM Class: G.3

    Journal ref: Computer Physics Communications, vol. 185, pp. 1355-1363, 2014

  4. arXiv:2009.14296  [pdf, other

    stat.ME stat.CO stat.ML

    The Illusion of the Illusion of Sparsity: An exercise in prior sensitivity

    Authors: Bruno Fava, Hedibert F. Lopes

    Abstract: The emergence of Big Data raises the question of how to model economic relations when there is a large number of possible explanatory variables. We revisit the issue by comparing the possibility of using dense or sparse models in a Bayesian approach, allowing for variable selection and shrinkage. More specifically, we discuss the results reached by Giannone, Lenza, and Primiceri (2020) through a "… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 33 pages, 11 figures

  5. arXiv:2009.14131  [pdf, other

    stat.ME stat.CO stat.ML

    Dynamic sparsity on dynamic regression models

    Authors: Paloma W. Uribe, Hedibert F. Lopes

    Abstract: In the present work, we consider variable selection and shrinkage for the Gaussian dynamic linear regression within a Bayesian framework. In particular, we propose a novel method that allows for time-varying sparsity, based on an extension of spike-and-slab priors for dynamic models. This is done by assigning appropriate Markov switching priors for the time-varying coefficients' variances, extendi… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 31 pages, 5 figures

  6. arXiv:2006.11908  [pdf, ps, other

    stat.ME

    Decoupling Shrinkage and Selection in Gaussian Linear Factor Analysis

    Authors: Henrique Bolfarine, Carlos M. Carvalho, Hedibert F. Lopes, Jared S. Murray

    Abstract: Factor Analysis is a popular method for modeling dependence in multivariate data. However, determining the number of factors and obtaining a sparse orientation of the loadings are still major challenges. In this paper, we propose a decision-theoretic approach that brings to light the relation between a sparse representation of the loadings and factor dimension. This relation is done through a summ… ▽ More

    Submitted 24 July, 2021; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: 22 pages, 7 figures

  7. arXiv:2003.05377  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Brazilian Lyrics-Based Music Genre Classification Using a BLSTM Network

    Authors: Raul de Araújo Lima, Rômulo César Costa de Sousa, Simone Diniz Junqueira Barbosa, Hélio Cortês Vieira Lopes

    Abstract: Organize songs, albums, and artists in groups with shared similarity could be done with the help of genre labels. In this paper, we present a novel approach for automatic classifying musical genre in Brazilian music using only the song lyrics. This kind of classification remains a challenge in the field of Natural Language Processing. We construct a dataset of 138,368 Brazilian song lyrics distrib… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 7 pages, 4 figures, 3 tables

    MSC Class: 68T50(Primary); 68T05 (Secondary) ACM Class: I.2.7; I.2.6

  8. arXiv:1907.03155  [pdf, other

    stat.ME

    Learning a latent pattern of heterogeneity in the innovation rates of a time series of counts

    Authors: Helton Graziadei, Hedibert F. Lopes, Paulo C. Marques F

    Abstract: We develop a Bayesian hierarchical semiparametric model for phenomena related to time series of counts. The main feature of the model is its capability to learn a latent pattern of heterogeneity in the distribution of the process innovation rates, which are softly clustered through time with the help of a Dirichlet process placed at the top of the model hierarchy. The probabilistic forecasting cap… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

  9. arXiv:1808.09507  [pdf, other

    stat.ME

    Tree-Based Bayesian Treatment Effect Analysis

    Authors: Pedro Henrique Filipini dos Santos, Hedibert Freitas Lopes

    Abstract: The inclusion of the propensity score as a covariate in Bayesian regression trees for causal inference can reduce the bias in treatment effect estimations, which occurs due to the regularization-induced confounding phenomenon. This study advocate for the use of the propensity score by evaluating it under a full-Bayesian variable selection setting, and the use of Individual Conditional Expectation… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

  10. arXiv:1806.05738  [pdf, other

    stat.CO stat.ML

    Efficient sampling for Gaussian linear regression with arbitrary priors

    Authors: P. Richard Hahn, **gyu He, Hedibert Lopes

    Abstract: This paper develops a slice sampler for Bayesian linear regression models with arbitrary priors. The new sampler has two advantages over current approaches. One, it is faster than many custom implementations that rely on auxiliary latent variables, if the number of regressors is large. Two, it can be used with any prior with a density function that can be evaluated up to a normalizing constant, ma… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

  11. arXiv:1804.04231  [pdf, other

    stat.ME

    Sparse Bayesian Factor Analysis when the Number of Factors is Unknown

    Authors: Sylvia Fruehwirth-Schnatter, Hedibert Freitas Lopes

    Abstract: Despite the popularity of sparse factor models, little attention has been given to formally address identifiability of these models beyond standard rotation-based identification such as the positive lower triangular constraint. To fill this gap, we provide a counting rule on the number of nonzero factor loadings that is sufficient for achieving uniqueness of the variance decomposition in the facto… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: 62 pages, 7 figures, 7 tables,

  12. arXiv:1710.08901  [pdf

    econ.EM stat.ML

    Calibration of Machine Learning Classifiers for Probability of Default Modelling

    Authors: Pedro G. Fonseca, Hugo D. Lopes

    Abstract: Binary classification is highly used in credit scoring in the estimation of probability of default. The validation of such predictive models is based both on rank ability, and also on calibration (i.e. how accurately the probabilities output by the model map to the observed probabilities). In this study we cover the current best practices regarding calibration for binary classification, and explor… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: Keywords: Binary classification, Probability of Default, Calibration, Credit Risk, Isotonic Regression, Platt Scaling

  13. arXiv:1602.08154  [pdf, other

    stat.CO econ.EM stat.AP stat.ME

    Efficient Bayesian Inference for Multivariate Factor Stochastic Volatility Models

    Authors: Gregor Kastner, Sylvia Frühwirth-Schnatter, Hedibert Freitas Lopes

    Abstract: We discuss efficient Bayesian estimation of dynamic covariance matrices in multivariate time series through a factor stochastic volatility model. In particular, we propose two interweaving strategies (Yu and Meng, Journal of Computational and Graphical Statistics, 20(3), 531-570, 2011) to substantially accelerate convergence and mixing of standard MCMC approaches. Similar to marginal data augmenta… ▽ More

    Submitted 19 July, 2017; v1 submitted 25 February, 2016; originally announced February 2016.

    Journal ref: Journal of Computational and Graphical Statistics 26(4), 905-917 (2017)

  14. arXiv:1602.08066  [pdf, other

    stat.AP

    Scalable semiparametric inference for the means of heavy-tailed distributions

    Authors: Matt Taddy, Hedibert Freitas Lopes, Matt Gardner

    Abstract: Heavy tailed distributions present a tough setting for inference. They are also common in industrial applications, particularly with Internet transaction datasets, and machine learners often analyze such data without considering the biases and risks associated with the misuse of standard tools. This paper outlines a procedure for inference about the mean of a (possibly conditional) heavy tailed di… ▽ More

    Submitted 13 October, 2016; v1 submitted 25 February, 2016; originally announced February 2016.

  15. arXiv:1408.0462  [pdf, other

    stat.ME

    Shrinkage priors for linear instrumental variable models with many instruments

    Authors: P. Richard Hahn, Hedibert Lopes

    Abstract: This paper addresses the weak instruments problem in linear instrumental variable models from a Bayesian perspective. The new approach has two components. First, a novel predictor-dependent shrinkage prior is developed for the many instruments setting. The prior is constructed based on a factor model decomposition of the matrix of observed instruments, allowing many instruments to be incorporated… ▽ More

    Submitted 3 August, 2014; originally announced August 2014.

    Comments: 27 pages, 6 figures, 3 tables

  16. Measuring the vulnerability of the Uruguayan population to vector-borne diseases via spatially hierarchical factor models

    Authors: Hedibert F. Lopes, Alexandra M. Schmidt, Esther Salazar, Mariana Gómez, Marcel Achkar

    Abstract: We propose a model-based vulnerability index of the population from Uruguay to vector-borne diseases. We have available measurements of a set of variables in the census tract level of the 19 Departmental capitals of Uruguay. In particular, we propose an index that combines different sources of information via a set of micro-environmental indicators and geographical location in the country. Our ind… ▽ More

    Submitted 19 March, 2012; originally announced March 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOAS497 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS497

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 1, 284-303

  17. Particle Learning and Smoothing

    Authors: Carlos M. Carvalho, Michael S. Johannes, Hedibert F. Lopes, Nicholas G. Polson

    Abstract: Particle learning (PL) provides state filtering, sequential parameter learning and smoothing in a general class of state space models. Our approach extends existing particle methods by incorporating the estimation of static parameters via a fully-adapted filter that utilizes conditional sufficient statistics for parameters and/or states as particles. State smoothing in the presence of parameter un… ▽ More

    Submitted 4 November, 2010; originally announced November 2010.

    Comments: Published in at http://dx.doi.org/10.1214/10-STS325 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS325

    Journal ref: Statistical Science 2010, Vol. 25, No. 1, 88-106