Skip to main content

Showing 1–50 of 64 results for author: Sugasawa, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.01883  [pdf, other

    stat.ME

    Robust Linear Mixed Models using Hierarchical Gamma-Divergence

    Authors: Shonosuke Sugasawa, Francis K. C. Hui, Alan H. Welsh

    Abstract: Linear mixed models (LMMs), which typically assume normality for both the random effects and error terms, are a popular class of methods for analyzing longitudinal and clustered data. However, such models can be sensitive to outliers, and this can lead to poor statistical results (e.g., biased inference on model parameters and inaccurate prediction of random effects) if the data are contaminated.… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 30 pages (main) + 6 pages (supplement)

  2. arXiv:2404.07586  [pdf, other

    stat.AP stat.ME

    State-Space Modeling of Shape-constrained Functional Time Series

    Authors: Daichi Hiraki, Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Functional time series data frequently appears in economic applications, where the functions of interest are subject to some shape constraints, including monotonicity and convexity, as typical of the estimation of the Lorenz curve. This paper proposes a state-space model for time-varying functions to extract trends and serial dependence from functional time series while imposing the shape constrai… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 34 pages, 7 figures, 6 tables

  3. arXiv:2401.12776  [pdf

    stat.ME

    Sub-model aggregation for scalable eigenvector spatial filtering: Application to spatially varying coefficient modeling

    Authors: Daisuke Murakami, Shonosuke Sugasawa, Hajime Seya, Daniel A. Griffith

    Abstract: This study proposes a method for aggregating/synthesizing global and local sub-models for fast and flexible spatial regression modeling. Eigenvector spatial filtering (ESF) was used to model spatially varying coefficients and spatial dependence in the residuals by sub-model, while the generalized product-of-experts method was used to aggregate these sub-models. The major advantages of the proposed… ▽ More

    Submitted 24 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  4. arXiv:2312.12710  [pdf, other

    stat.ME

    Semiparametric Copula Estimation for Spatially Correlated Multivariate Mixed Outcomes: Analyzing Visual Sightings of Fin Whales from Line Transect Survey

    Authors: Tomotaka Momozaki, Tomoyuki Nakagawa, Shonosuke Sugasawa, Hiroko Kato Solvang

    Abstract: Multivariate data having both continuous and discrete variables is known as mixed outcomes and has widely appeared in a variety of fields such as ecology, epidemiology, and climatology. In order to understand the probability structure of multivariate data, the estimation of the dependence structure among mixed outcomes is very important. However, when location information is equipped with multivar… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 23 pages, 5 figures

    MSC Class: 62F15 (Primary); 62H11 (Secondary)

  5. arXiv:2309.01404  [pdf, other

    stat.ME

    Hierarchical Regression Discontinuity Design: Pursuing Subgroup Treatment Effects

    Authors: Shonosuke Sugasawa, Takuya Ishihara, Daisuke Kurisu

    Abstract: Regression discontinuity design (RDD) is widely adopted for causal inference under intervention determined by a continuous variable. While one is interested in treatment effect heterogeneity by subgroups in many applications, RDD typically suffers from small subgroup-wise sample sizes, which makes the estimation results highly instable. To solve this issue, we introduce hierarchical RDD (HRDD), a… ▽ More

    Submitted 19 June, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 24 pages

  6. arXiv:2308.11081  [pdf, other

    stat.ME stat.AP

    An Unbiased Predictor for Skewed Response Variable with Measurement Error in Covariate

    Authors: Sepideh Mosaferi, Malay Ghosh, Shonosuke Sugasawa

    Abstract: We introduce a new small area predictor when the Fay-Herriot normal error model is fitted to a logarithmically transformed response variable, and the covariate is measured with error. This framework has been previously studied by Mosaferi et al. (2023). The empirical predictor given in their manuscript cannot perform uniformly better than the direct estimator. Our proposed predictor in this manusc… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  7. arXiv:2308.06134  [pdf, other

    stat.AP

    Predicting COVID-19 hospitalisation using a mixture of Bayesian predictive syntheses

    Authors: Genya Kobayashi, Shonosuke Sugasawa, Yuki Kawakubo, Dongu Han, Taeryon Choi

    Abstract: This paper proposes a novel methodology called the mixture of Bayesian predictive syntheses (MBPS) for multiple time series count data for the challenging task of predicting the numbers of COVID-19 inpatients and isolated cases in Japan and Korea at the subnational-level. MBPS combines a set of predictive models and partitions the multiple time series into clusters based on their contribution to p… ▽ More

    Submitted 19 March, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

  8. arXiv:2308.01704  [pdf, other

    stat.ME stat.AP

    Similarity-based Random Partition Distribution for Clustering Functional Data

    Authors: Tomoya Wakayama, Shonosuke Sugasawa, Genya Kobayashi

    Abstract: Random partition distribution is a crucial tool for model-based clustering. This study advances the field of random partition in the context of functional spatial data, focusing on the challenges posed by hourly population data across various regions and dates. We propose an extended generalized Dirichlet process, named the similarity-based generalized Dirichlet process (SGDP), to address the limi… ▽ More

    Submitted 22 June, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: 27 pages

  9. arXiv:2304.07726  [pdf, other

    stat.ME

    Bayesian Causal Synthesis for Meta-Inference on Heterogeneous Treatment Effects

    Authors: Shonosuke Sugasawa, Kosaku Takanashi, Kenichiro McAlinn, Edoardo M. Airoldi

    Abstract: The estimation of heterogeneous treatment effects in the potential outcome setting is biased when there exists model misspecification or unobserved confounding. As these biases are unobservable, what model to use when remains a critical open question. In this paper, we propose a novel Bayesian methodology to mitigate misspecification and improve estimation via a synthesis of multiple causal estima… ▽ More

    Submitted 8 May, 2024; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: 30 pages (Main document) + 14 pages (Supplement)

  10. arXiv:2303.00281  [pdf, other

    stat.ME math.ST

    Posterior Robustness with Milder Conditions: Contamination Models Revisited

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Robust Bayesian linear regression is a classical but essential statistical tool. Although novel robustness properties of posterior distributions have been proved recently under a certain class of error distributions, their sufficient conditions are restrictive and exclude several important situations. In this work, we revisit a classical two-component mixture model for response variables, also kno… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 19 pages, 1 figure

  11. Gibbs Sampler for Matrix Generalized Inverse Gaussian Distributions

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Sampling from matrix generalized inverse Gaussian (MGIG) distributions is required in Markov Chain Monte Carlo (MCMC) algorithms for a variety of statistical models. However, an efficient sampling scheme for the MGIG distributions has not been fully developed. We here propose a novel blocked Gibbs sampler for the MGIG distributions, based on the Choleski decomposition. We show that the full condit… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 34 pages, 5 figures

  12. arXiv:2302.04412  [pdf, other

    stat.AP stat.ME

    Spatiotemporal factor models for functional data with application to population map forecast

    Authors: Tomoya Wakayama, Shonosuke Sugasawa

    Abstract: The proliferation of mobile devices has led to the collection of large amounts of population data. This situation has prompted the need to utilize this rich, multidimensional data in practical applications. In response to this trend, we have integrated functional data analysis (FDA) and factor analysis to address the challenge of predicting hourly population changes across various districts in Tok… ▽ More

    Submitted 6 June, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  13. arXiv:2212.00984  [pdf, other

    stat.ME

    Fully Data-driven Normalized and Exponentiated Kernel Density Estimator with Hyvärinen Score

    Authors: Shunsuke Imai, Takuya Koriyama, Shouto Yonekura, Shonosuke Sugasawa, Yoshihiko Nishiyama

    Abstract: We introduce a new deal of kernel density estimation using an exponentiated form of kernel density estimators. The density estimator has two hyperparameters flexibly controlling the smoothness of the resulting density. We tune them in a data-driven manner by minimizing an objective function based on the Hyvärinen score to avoid the optimization involving the intractable normalizing constant due to… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

  14. arXiv:2211.04666  [pdf, other

    stat.ME

    Fast and Locally Adaptive Bayesian Quantile Smoothing using Calibrated Variational Approximations

    Authors: Takahiro Onizuka, Shintaro Hashimoto, Shonosuke Sugasawa

    Abstract: Quantiles are useful characteristics of random variables that can provide substantial information on distributions compared with commonly used summary statistics such as means. In this paper, we propose a Bayesian quantile trend filtering method to estimate non-stationary trend of quantiles. We introduce general shrinkage priors to induce locally adaptive Bayesian inference on trends and mixture r… ▽ More

    Submitted 20 October, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: 51 pages, 7 figures. arXiv admin note: text overlap with arXiv:2202.09534

  15. arXiv:2208.07535  [pdf, other

    stat.ME

    Semiparametric imputation using latent sparse conditional Gaussian mixtures for multivariate mixed outcomes

    Authors: Shonosuke Sugasawa, Jae Kwang Kim, Kosuke Morikawa

    Abstract: This paper proposes a flexible Bayesian approach to multiple imputation using conditional Gaussian mixtures. We introduce novel shrinkage priors for covariate-dependent mixing proportions in the mixture models to automatically select the suitable number of components used in the imputation step. We develop an efficient sampling algorithm for posterior computation and multiple imputation via Markov… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 29 pages, 5 figures

  16. arXiv:2208.05121  [pdf, other

    stat.ME

    Locally Adaptive Bayesian Isotonic Regression using Half Shrinkage Priors

    Authors: Ryo Okano, Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Isotonic regression or monotone function estimation is a problem of estimating function values under monotonicity constraints, which appears naturally in many scientific fields. This paper proposes a new Bayesian method with global-local shrinkage priors for estimating monotone function values. Specifically, we introduce half shrinkage priors for positive valued random variables and assign them fo… ▽ More

    Submitted 6 February, 2024; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: 47 pages

    Journal ref: Scandinavian Journal of Statistics, 2024

  17. arXiv:2207.08384  [pdf, other

    stat.AP

    Spatio-temporal smoothing, interpolation and prediction of income distributions based on grouped data

    Authors: Genya Kobayashi, Shonosuke Sugasawa, Yuki Kawakubo

    Abstract: In Japan, the Housing and Land Survey (HLS) provides municipality-level grouped data on household incomes. Although these data can be used for effective local policymaking, their analyses are hindered by several challenges, such as limited information attributed to grou**, the presence of non-sampled areas, and the very low frequency of implementing surveys. To address these challenges, we propo… ▽ More

    Submitted 30 June, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

  18. arXiv:2204.09898  [pdf, other

    stat.ME stat.CO

    Functional Horseshoe Smoothing for Functional Trend Estimation

    Authors: Tomoya Wakayama, Shonosuke Sugasawa

    Abstract: Due to developments in instruments and computers, functional observations are increasingly popular. However, effective methodologies for flexibly estimating the underlying trends with valid uncertainty quantification for a sequence of functional data (e.g. functional time series) are still scarce. In this work, we develop a locally adaptive smoothing method, called functional horseshoe smoothing,… ▽ More

    Submitted 20 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

  19. Sparse Bayesian inference on gamma-distributed observations using shape-scale inverse-gamma mixtures

    Authors: Yasuyuki Hamura, Takahiro Onizuka, Shintaro Hashimoto, Shonosuke Sugasawa

    Abstract: In various applications, we deal with high-dimensional positive-valued data that often exhibits sparsity. This paper develops a new class of continuous global-local shrinkage priors tailored to analyzing gamma-distributed observations where most of the underlying means are concentrated around a certain value. Unlike existing shrinkage priors, our new prior is a shape-scale mixture of inverse-gamma… ▽ More

    Submitted 30 November, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 57 pages, 8 figures

  20. arXiv:2203.05197  [pdf, other

    stat.ME stat.ML

    Bayesian Spatial Predictive Synthesis

    Authors: Danielle Cabel, Shonosuke Sugasawa, Masahiro Kato, Kosaku Takanashi, Kenichiro McAlinn

    Abstract: Spatial data are characterized by their spatial dependence, which is often complex, non-linear, and difficult to capture with a single model. Significant levels of model uncertainty -- arising from these characteristics -- cannot be resolved by model selection or simple ensemble methods. We address this issue by proposing a novel methodology that captures spatially varying model uncertainty, which… ▽ More

    Submitted 20 January, 2023; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: 41 pages

  21. On Data Augmentation for Models Involving Reciprocal Gamma Functions

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: In this paper, we introduce a new and efficient data augmentation approach to the posterior inference of the models with shape parameters when the reciprocal gamma function appears in full conditional densities. Our approach is to approximate full conditional densities of shape parameters by using Gauss's multiplication formula and Stirling's formula for the gamma function, where the approximation… ▽ More

    Submitted 26 August, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 41 pages, 6 figures

  22. arXiv:2202.09534  [pdf, other

    stat.ME

    Locally Adaptive Spatial Quantile Smoothing: Application to Monitoring Crime Density in Tokyo

    Authors: Takahiro Onizuka, Shintaro Hashimoto, Shonosuke Sugasawa

    Abstract: Spatial trend estimation under potential heterogeneity is an important problem to extract spatial characteristics and hazards such as criminal activity. By focusing on quantiles, which provide substantial information on distributions compared with commonly used summary statistics such as means, it is often useful to estimate not only the average trend but also the high (low) risk trend additionall… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 February, 2022; originally announced February 2022.

    Comments: 38 pages, 9 figures

  23. arXiv:2111.00964  [pdf, other

    stat.ME

    Dynamic Spatio-temporal Zero-inflated Poisson Models for Predicting Capelin Distribution in the Barents Sea

    Authors: Shonosuke Sugasawa, Tomoyuki Nakagawa, Hiroko Kato Solvang, Sam Subbey, Salah Alrabeei

    Abstract: We consider modeling and prediction of Capelin distribution in the Barents sea based on zero-inflated count observation data that vary continuously over a specified survey region. The model is a mixture of two components; a one-point distribution at the origin and a Poisson distribution with spatio-temporal intensity, where both intensity and mixing proportions are modeled by some auxiliary variab… ▽ More

    Submitted 19 October, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 25 pages

  24. arXiv:2108.11551  [pdf, other

    stat.ME

    Adaptively Robust Small Area Estimation: Balancing Robustness and Efficiency of Empirical Bayes Confidence Intervals

    Authors: Daisuke Kurisu, Takuya Ishihara, Shonosuke Sugasawa

    Abstract: Empirical Bayes small area estimation based on the well-known Fay-Herriot model may produce unreliable estimates when outlying areas exist. Existing robust methods against outliers or model misspecification are generally inefficient when the assumed distribution is plausible. This paper proposes a simple modification of the standard empirical Bayes methods with adaptively balancing robustness and… ▽ More

    Submitted 27 June, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: 20 pages (main text) + 19 pages (supplementary material)

  25. arXiv:2106.15811  [pdf, other

    stat.ME

    Adaptively Robust Geographically Weighted Regression

    Authors: Shonosuke Sugasawa, Daisuke Murakami

    Abstract: We develop a new robust geographically weighted regression method in the presence of outliers. We embed the standard geographically weighted regression in robust objective function based on $γ$-divergence. A novel feature of the proposed approach is that two tuning parameters that control robustness and spatial smoothness are automatically tuned in a data-dependent manner. Further, the proposed me… ▽ More

    Submitted 14 October, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: 22 pages

  26. On Selection Criteria for the Tuning Parameter in Robust Divergence

    Authors: Shonosuke Sugasawa, Shouto Yonekura

    Abstract: While robust divergence such as density power divergence and $γ$-divergence is helpful for robust statistical inference in the presence of outliers, the tuning parameter that controls the degree of robustness is chosen in a rule-of-thumb, which may lead to an inefficient inference. We here propose a selection criterion based on an asymptotic approximation of the Hyvarinen score applied to an unnor… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: 15 pages

  27. arXiv:2106.10503  [pdf, other

    stat.ME

    Robust Bayesian Modeling of Counts with Zero inflation and Outliers: Theoretical Robustness and Efficient Computation

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Count data with zero inflation and large outliers are ubiquitous in many scientific applications. However, posterior analysis under a standard statistical model, such as Poisson or negative binomial distribution, is sensitive to such contamination. This study introduces a novel framework for Bayesian modeling of counts that is robust to both zero inflation and large outliers. In doing so, we intro… ▽ More

    Submitted 8 May, 2024; v1 submitted 19 June, 2021; originally announced June 2021.

    Comments: 32 pages (main text) and 23 pages (supplementary material)

  28. arXiv:2106.06902  [pdf, other

    stat.ME stat.CO

    Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence

    Authors: Shouto Yonekura, Shonosuke Sugasawa

    Abstract: We introduce a methodology for robust Bayesian estimation with robust divergence (e.g., density power divergence or γ-divergence), indexed by a single tuning parameter. It is well known that the posterior density induced by robust divergence gives highly robust estimators against outliers if the tuning parameter is appropriately and carefully chosen. In a Bayesian framework, one way to find the op… ▽ More

    Submitted 30 June, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  29. arXiv:2105.07563  [pdf, ps, other

    stat.ME

    General Unbiased Estimating Equations for Variance Components in Linear Mixed Models

    Authors: Tatsuya Kubokawa, Shonosuke Sugasawa, Hiromasa Tamae, Sanjay Chaudhuri

    Abstract: This paper introduces a general framework for estimating variance components in the linear mixed models via general unbiased estimating equations, which include some well-used estimators such as the restricted maximum likelihood estimator. We derive the asymptotic covariance matrices and second-order biases under general estimating equations without assuming the normality of the underlying distrib… ▽ More

    Submitted 16 May, 2021; originally announced May 2021.

    Comments: 15 pages

  30. arXiv:2104.02456  [pdf, other

    stat.ME stat.CO

    Trend Filtering for Functional Data

    Authors: Tomoya Wakayama, Shonosuke Sugasawa

    Abstract: Despite increasing accessibility to function data, effective methods for flexibly estimating underlying functional trend are still scarce. We thereby develop functional version of trend filtering for estimating trend of functional data indexed by time or on general graph by extending the conventional trend filtering, a powerful nonparametric trend estimation technique, for scalar data. We formulat… ▽ More

    Submitted 18 February, 2022; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: 29 pages

  31. arXiv:2011.01493  [pdf, other

    stat.ME

    Spatially Clustered Regression

    Authors: Shonosuke Sugasawa, Daisuke Murakami

    Abstract: Spatial regression or geographically weighted regression models have been widely adopted to capture the effects of auxiliary information on a response variable of interest over a region. In contrast, relationships between response and auxiliary variables are expected to exhibit complex spatial patterns in many applications. This paper proposes a new approach for spatial regression, called spatiall… ▽ More

    Submitted 28 April, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: 28 pages, 6 figures

  32. arXiv:2006.14820  [pdf, other

    stat.ME

    Parametric Bootstrap Confidence Intervals for the Multivariate Fay-Herriot Model

    Authors: Takumi Saegusa, Shonosuke Sugasawa, Partha Lahiri

    Abstract: The multivariate Fay-Herriot model is quite effective in combining information through correlations among small area survey estimates of related variables or historical survey estimates of the same variable or both. Though the literature on small area estimation is already very rich, construction of second-order efficient confidence intervals from multivariate models have so far received very litt… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: 21 pages

  33. arXiv:2006.06180  [pdf, other

    stat.ME

    Grouped Generalized Estimating Equations for Longitudinal Data Analysis

    Authors: Tsubasa Ito, Shonosuke Sugasawa

    Abstract: Generalized estimating equation (GEE) is widely adopted for regression modeling for longitudinal data, taking account of potential correlations within the same subjects. Although the standard GEE assumes common regression coefficients among all the subjects, such an assumption may not be realistic when there is potential heterogeneity in regression coefficients among subjects. In this paper, we de… ▽ More

    Submitted 8 July, 2022; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 59 pages

  34. arXiv:2005.02800  [pdf, other

    stat.ME

    Log-Regularly Varying Scale Mixture of Normals for Robust Regression

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Linear regression with the classical normality assumption for the error distribution may lead to an undesirable posterior inference of regression coefficients due to the potential outliers. This paper considers the finite mixture of two components with thin and heavy tails as the error distribution, which has been routinely employed in applied statistics. For the heavily-tailed component, we intro… ▽ More

    Submitted 9 January, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 62 pages

  35. arXiv:2004.13483  [pdf, other

    stat.AP

    Predicting Infection of COVID-19 in Japan: State Space Modeling Approach

    Authors: Genya Kobayashi, Shonosuke Sugasawa, Hiromasa Tamae, Takayuki Ozu

    Abstract: The number of confirmed cases of the coronavirus disease (COVID-19) in Japan has been increasing day by day and has had a serious impact on the society especially after the declaration of the state of emergency on April 7, 2020. This study analyzes the real time data from March 1 to April 22, 2020 by adopting a sophisticated statistical modeling tool based on the state space model combined with th… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 12 pages (main part) + 9 pages (supplement)

  36. arXiv:2004.03751  [pdf, other

    stat.ME

    Robust Fitting of Mixture Models using Weighted Complete Estimating Equations

    Authors: Shonosuke Sugasawa, Genya Kobayashi

    Abstract: Mixture modeling, which considers the potential heterogeneity in data, is widely adopted for classification and clustering problems. Mixture models can be estimated using the Expectation-Maximization algorithm, which works with the complete estimating equations conditioned by the latent membership variables of the cluster assignment based on the hierarchical expression of mixture models. However,… ▽ More

    Submitted 16 March, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 40 pages

  37. arXiv:2003.05611  [pdf, other

    stat.ME

    Efficient testing and effect size estimation for set-based genetic association inference via semiparametric multilevel mixture modeling: Application to a genome-wide association study of coronary artery disease

    Authors: Shonosuke Sugasawa, Hisashi Noma

    Abstract: In genetic association studies, rare variants with extremely small allele frequency play a crucial role in complex traits, and the set-based testing methods that jointly assess the effects of groups of single nucleotide polymorphisms (SNPs) were developed to improve powers for the association tests. However, the powers of these tests are still severely limited due to the extremely small allele fre… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: 22 pages

  38. arXiv:2001.08465  [pdf, other

    stat.ME

    Shrinkage with Robustness: Log-Adjusted Priors for Sparse Signals

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: We introduce a new class of distributions named log-adjusted shrinkage priors for the analysis of sparse signals, which extends the three parameter beta priors by multiplying an additional log-term to their densities. The proposed prior has density tails that are heavier than even those of the Cauchy distribution and realizes the tail-robustness of the Bayes estimator, while kee** the strong shr… ▽ More

    Submitted 26 January, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: 40 pages

  39. Robust Bayesian Regression with Synthetic Posterior

    Authors: Shintaro Hashimoto, Shonosuke Sugasawa

    Abstract: Although linear regression models are fundamental tools in statistical science, the estimation results can be sensitive to outliers. While several robust methods have been proposed in frequentist frameworks, statistical inference is not necessarily straightforward. We here propose a Bayesian approach to robust inference on linear regression models using synthetic posterior distributions based on… ▽ More

    Submitted 26 May, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: 23 pages, 5 figures

  40. arXiv:1909.02878  [pdf, other

    stat.ME

    Bayesian Semiparametric Modeling of Response Mechanism for Nonignorable Missing Data

    Authors: Shonosuke Sugasawa, Kosuke Morikawa, Keisuke Takahata

    Abstract: Statistical inference with nonresponse is quite challenging, especially when the response mechanism is nonignorable. In this case, the validity of statistical inference depends on untestable correct specification of the response model. To avoid the misspecification, we propose semiparametric Bayesian estimation in which an outcome model is parametric, but the response model is semiparametric in th… ▽ More

    Submitted 14 January, 2021; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: 25 pages; The title has been changed from "Bayesian semiparametric estimation under nonignorable nonresponse"

  41. arXiv:1908.06772  [pdf, other

    stat.ME

    Bayesian approach to Lorenz curve using time series grouped data

    Authors: Genya Kobayashi, Yuta Yamauchi, Kazuhiko Kakamu, Yuki Kawakubo, Shonosuke Sugasawa

    Abstract: This study is concerned with estimating the inequality measures associated with the underlying hypothetical income distribution from the times series grouped data on the Lorenz curve. We adopt the Dirichlet pseudo likelihood approach where the parameters of the Dirichlet likelihood are set to the differences between the Lorenz curve of the hypothetical income distribution for the consecutive incom… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  42. arXiv:1907.01333  [pdf, other

    stat.ME

    On Global-local Shrinkage Priors for Count Data

    Authors: Yasuyuki Hamura, Kaoru Irie, Shonosuke Sugasawa

    Abstract: Global-local shrinkage prior has been recognized as useful class of priors which can strongly shrink small signals towards prior means while kee** large signals unshrunk. Although such priors have been extensively discussed under Gaussian responses, we intensively encounter count responses in practice in which the previous knowledge of global-local shrinkage priors cannot be directly imported. I… ▽ More

    Submitted 16 August, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: 28 pages (main text) + 14 pages (supplementary material)

  43. arXiv:1906.08428  [pdf, other

    stat.ME

    Improved Confidence Regions in Meta-analysis of Diagnostic Test Accuracy

    Authors: Tsubasa Ito, Shonosuke Sugasawa

    Abstract: Meta-analyses of diagnostic test accuracy (DTA) studies have been gathering attention in research in clinical epidemiology and health technology development, and bivariate random-effects model is becoming a standard tool. However, standard inference methods usually underestimate statistical errors and possibly provide highly overconfident results under realistic situations since they ignore the va… ▽ More

    Submitted 18 June, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: 15 pages (main text) + 10 pages (supplementary material)

  44. arXiv:1906.04398  [pdf, other

    stat.ME

    An Approximate Bayesian Approach to Model-assisted Survey Estimation with Many Auxiliary Variables

    Authors: Shonosuke Sugasawa, Jae Kwang Kim

    Abstract: Model-assisted estimation with complex survey data is an important practical problem in survey sampling. When there are many auxiliary variables, selecting significant variables associated with the study variable would be necessary to achieve efficient estimation of population parameters of interest. In this paper, we formulate a regularized regression estimator in the framework of Bayesian infere… ▽ More

    Submitted 31 March, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 37 pages

  45. arXiv:1905.01582  [pdf, other

    stat.ME

    Efficient screening of predictive biomarkers for individual treatment selection

    Authors: Shonosuke Sugasawa, Hisashi Noma

    Abstract: The development of molecular diagnostic tools to achieve individualized medicine requires identifying predictive biomarkers associated with subgroups of individuals who might receive beneficial or harmful effects from different available treatments. However, due to the large number of candidate biomarkers in the large-scale genetic and molecular studies, and complex relationships among clinical ou… ▽ More

    Submitted 17 January, 2020; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: 22 pages

  46. arXiv:1904.11109  [pdf, other

    stat.ME

    Estimation and inference for area-wise spatial income distributions from grouped data

    Authors: Shonosuke Sugasawa, Genya Kobayashi, Yuki Kawakubo

    Abstract: Estimating income distributions plays an important role in the measurement of inequality and poverty over space. The existing literature on income distributions predominantly focuses on estimating an income distribution for a country or a region separately and the simultaneous estimation of multiple income distributions has not been discussed in spite of its practical importance. In this work, we… ▽ More

    Submitted 3 July, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: 25 pages

  47. arXiv:1804.00888  [pdf, other

    stat.ME

    Grouped Heterogeneous Mixture Modeling for Clustered Data

    Authors: Shonosuke Sugasawa

    Abstract: Clustered data is ubiquitous in a variety of scientific fields. In this paper, we propose a flexible and interpretable modeling approach, called grouped heterogenous mixture modeling, for clustered data, which models cluster-wise conditional distributions by mixtures of latent conditional distributions common to all the clusters. In the model, we assume that clusters are divided into a finite numb… ▽ More

    Submitted 6 February, 2020; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: 34 pages

  48. arXiv:1711.06393  [pdf, other

    stat.ME

    A Unified Method for Improved Inference in Random-effects Meta-analysis

    Authors: Shonosuke Sugasawa, Hisashi Noma

    Abstract: Random-effects meta-analyses have been widely applied in evidence synthesis for various types of medical studies. However, standard inference methods (e.g. restricted maximum likelihood estimation) usually underestimate statistical errors and possibly provide highly overconfident results under realistic situations; for instance, coverage probabilities of confidence intervals can be substantially b… ▽ More

    Submitted 9 May, 2019; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: 29 pages

  49. arXiv:1705.04136  [pdf, other

    stat.ME

    Adaptively Transformed Mixed Model Prediction of General Finite Population Parameters

    Authors: Shonosuke Sugasawa, Tatsuya Kubokawa

    Abstract: For estimating area-specific parameters (quantities) in a finite population, a mixed model prediction approach is attractive. However, this approach strongly depends on the normality assumption of the response values although we often encounter a non-normal case in practice. In such a case, transforming observations to make them suitable for normality assumption is a useful tool, but the problem o… ▽ More

    Submitted 11 June, 2018; v1 submitted 11 May, 2017; originally announced May 2017.

    Comments: 32 pages

  50. arXiv:1704.08440  [pdf, other

    stat.ME

    On Bootstrap Averaging Empirical Bayes Estimators

    Authors: Shonosuke Sugasawa

    Abstract: Parametric empirical Bayes (EB) estimators have been widely used in variety of fields including small area estimation, disease map**. Since EB estimator is constructed by plugging in the estimator of parameters in prior distributions, it might perform poorly if the estimator of parameters is unstable. This can happen when the number of samples are small or moderate. This paper suggests bootstrap… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    Comments: 10 pages