Skip to main content

Showing 1–16 of 16 results for author: Gu, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.01403  [pdf, other

    stat.CO stat.ME

    Greedy selection of optimal location of sensors for uncertainty reduction in seismic moment tensor inversion

    Authors: Ben Mansour Dia, Michael Fehler, SanLinn I. Kaka, Andrea Scarinci, Umair bin Waheed, Chen Gu

    Abstract: We address an optimal sensor placement problem through Bayesian experimental design for seismic full waveform inversion for the recovery of the associated moment tensor. The objective is that of optimally choosing the location of the sensors (stations) from which to collect the observed data. The Shannon expected information gain is used as the objective function to search for the optimal network… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  2. arXiv:2401.16235  [pdf

    cs.LG stat.AP

    Player Pressure Map -- A Novel Representation of Pressure in Soccer for Evaluating Player Performance in Different Game Contexts

    Authors: Chaoyi Gu, Jiaming Na, Yisheng Pei, Varuna De Silva

    Abstract: In soccer, contextual player performance metrics are invaluable to coaches. For example, the ability to perform under pressure during matches distinguishes the elite from the average. Appropriate pressure metric enables teams to assess players' performance accurately under pressure and design targeted training scenarios to address their weaknesses. The primary objective of this paper is to leverag… ▽ More

    Submitted 7 March, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  3. arXiv:2303.13323  [pdf, other

    stat.ML cs.LG cs.MA

    Deep Generative Multi-Agent Imitation Model as a Computational Benchmark for Evaluating Human Performance in Complex Interactive Tasks: A Case Study in Football

    Authors: Chaoyi Gu, Varuna De Silva

    Abstract: Evaluating the performance of human is a common need across many applications, such as in engineering and sports. When evaluating human performance in completing complex and interactive tasks, the most common way is to use a metric having been proved efficient for that context, or to use subjective measurement techniques. However, this can be an error prone and unreliable process since static metr… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 8 pages, 10 figures

  4. arXiv:2204.04840  [pdf, other

    stat.ME stat.AP

    Nonparametric Bayes Differential Analysis of Multigroup DNA Methylation Data

    Authors: Chiyu Gu, Veerabhadran Baladandayuthapani, Subharup Guha

    Abstract: DNA methylation datasets in cancer studies are comprised of measurements on a large number of genomic locations called cytosine-phosphate-guanine (CpG) sites with complex correlation structures. A fundamental goal of these studies is the development of statistical techniques that can identify disease genomic signatures across multiple patient groups defined by different experimental or biological… ▽ More

    Submitted 4 May, 2023; v1 submitted 10 April, 2022; originally announced April 2022.

  5. arXiv:2012.06093  [pdf, other

    stat.ME

    A flexible sensitivity analysis approach for unmeasured confounding with multiple treatments and a binary outcome with application to SEER-Medicare lung cancer data

    Authors: Liangyuan Hu, Jungang Zou, Chenyang Gu, Jiayi Ji, Michael Lopez, Minal Kale

    Abstract: In the absence of a randomized experiment, a key assumption for drawing causal inference about treatment effects is the ignorable treatment assignment. Violations of the ignorability assumption may lead to biased treatment effect estimates. Sensitivity analysis helps gauge how causal conclusions will be altered in response to the potential magnitude of departure from the ignorability assumption. H… ▽ More

    Submitted 13 August, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 36 pages, 12 figures, 9 table

  6. arXiv:2008.07687  [pdf, other

    stat.ME cs.CY stat.ML

    Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes

    Authors: Liangyuan Hu, Chenyang Gu

    Abstract: The preponderance of large-scale healthcare databases provide abundant opportunities for comparative effectiveness research. Evidence necessary to making informed treatment decisions often relies on comparing effectiveness of multiple treatment options on outcomes of interest observed in a small number of individuals. Causal inference with multiple treatments and rare outcomes is a subject that ha… ▽ More

    Submitted 2 October, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: 15 pages, 3 tables, 2 figures

  7. arXiv:2005.13988  [pdf, ps, other

    stat.ME

    Composition Estimation via Shrinkage

    Authors: Chong Gu

    Abstract: In this note, we explore a simple approach to composition estimation, using penalized likelihood density estimation on a nominal discrete domain. Practical issues such as smoothing parameter selection and the use of prior information are investigated in simulations, and a theoretical analysis is attempted. The method has been implemented in a pair of R functions for use by practitioners.

    Submitted 28 May, 2020; originally announced May 2020.

  8. arXiv:2001.06483  [pdf, other

    stat.ME stat.AP

    Estimation of Causal Effects of Multiple Treatments in Observational Studies with a Binary Outcome

    Authors: Liangyuan Hu, Chenyang Gu, Michael Lopez, Jiayi Ji, Juan Wisnivesky

    Abstract: There is a dearth of robust methods to estimate the causal effects of multiple treatments when the outcome is binary. This paper uses two unique sets of simulations to propose and evaluate the use of Bayesian Additive Regression Trees (BART) in such settings. First, we compare BART to several approaches that have been proposed for continuous outcomes, including inverse probability of treatment wei… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 3 figures, 3 tables. arXiv admin note: text overlap with arXiv:1901.04312

  9. arXiv:2001.04643  [pdf, other

    cs.LG cs.SD eess.AS eess.SP stat.ML

    DDSP: Differentiable Digital Signal Processing

    Authors: Jesse Engel, Lamtharn Hantrakul, Chenjie Gu, Adam Roberts

    Abstract: Most generative models of audio directly generate samples in one of two domains: time or frequency. While sufficient to express any signal, these representations are inefficient, as they do not utilize existing knowledge of how sound is generated and perceived. A third approach (vocoders/synthesizers) successfully incorporates strong domain knowledge of signal processing and perception, but has be… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

  10. arXiv:1912.09859  [pdf, ps, other

    cs.LG cs.NI eess.SP stat.ML

    Lightweight and Unobtrusive Data Obfuscation at IoT Edge for Remote Inference

    Authors: Dixing Xu, Mengyao Zheng, Linshan Jiang, Chaojie Gu, Rui Tan, Peng Cheng

    Abstract: Executing deep neural networks for inference on the server-class or cloud backend based on data generated at the edge of Internet of Things is desirable due primarily to the limited compute power of edge devices and the need to protect the confidentiality of the inference neural networks. However, such a remote inference scheme incurs concerns regarding the privacy of the inference data transmitte… ▽ More

    Submitted 25 March, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: This paper has been accepted by IEEE Internet of Things Journal, Special Issue on Artificial Intelligence Powered Edge Computing for Internet of Things

  11. arXiv:1909.09804  [pdf, other

    cs.CR cs.LG stat.ML

    Challenges of Privacy-Preserving Machine Learning in IoT

    Authors: Mengyao Zheng, Dixing Xu, Linshan Jiang, Chaojie Gu, Rui Tan, Peng Cheng

    Abstract: The Internet of Things (IoT) will be a main data generation infrastructure for achieving better system intelligence. However, the extensive data collection and processing in IoT also engender various privacy concerns. This paper provides a taxonomy of the existing privacy-preserving machine learning approaches developed in the context of cloud computing and discusses the challenges of applying the… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: In First International Workshop on Challenges in Artificial Intelligence and Machine Learning (AIChallengeIoT'19) November 10-13, 2019. 7 pages

  12. arXiv:1906.10098  [pdf, other

    stat.AP physics.geo-ph

    Bayesian waveform-based calibration of high-pressure acoustic emission systems with ball drop measurements

    Authors: Chen Gu, Ulrich Mok, Youssef M. Marzouk, Germán A Prieto Gomez, Farrokh Sheibani, J. Brian Evans, Bradford H. Hager

    Abstract: Acoustic emission (AE) is a widely used technology to study source mechanisms and material properties during high-pressure rock failure experiments. It is important to understand the physical quantities that acoustic emission sensors measure, as well as the response of these sensors as a function of frequency. This study calibrates the newly built AE system in the MIT Rock Physics Laboratory using… ▽ More

    Submitted 8 January, 2020; v1 submitted 24 June, 2019; originally announced June 2019.

    Journal ref: Geophysical Journal International, 2019, ggz568

  13. arXiv:1904.12787  [pdf, other

    cs.LG stat.ML

    Graph Matching Networks for Learning the Similarity of Graph Structured Objects

    Authors: Yujia Li, Chenjie Gu, Thomas Dullien, Oriol Vinyals, Pushmeet Kohli

    Abstract: This paper addresses the challenging problem of retrieval and matching of graph structured objects, and makes two key contributions. First, we demonstrate how Graph Neural Networks (GNN), which have emerged as an effective model for various supervised prediction problems defined on structured data, can be trained to produce embedding of graphs in vector spaces that enables efficient similarity rea… ▽ More

    Submitted 12 May, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

    Comments: Accepted as a conference paper at ICML 2019

  14. arXiv:1901.04312   

    stat.ME

    The Estimation of Causal Effects of Multiple Treatments in Observational Studies Using Bayesian Additive Regression Trees

    Authors: Chenyang Gu, Michael J. Lopez, Liangyuan Hu

    Abstract: There is currently a dearth of appropriate methods to estimate the causal effects of multiple treatments when the outcome is binary. For such settings, we propose the use of nonparametric Bayesian modeling, Bayesian Additive Regression Trees (BART). We conduct an extensive simulation study to compare BART to several existing, propensity score-based methods and to identify its operating characteris… ▽ More

    Submitted 27 February, 2020; v1 submitted 11 January, 2019; originally announced January 2019.

    Comments: This article has been replaced by "Estimation of Causal Effects of Multiple Treatments in Observational Studies with a Binary Outcome" (arXiv:2001.06483 [stat.ME])

  15. Development of a Common Patient Assessment Scale across the Continuum of Care: A Nested Multiple Imputation Approach

    Authors: Chenyang Gu, Roee Gutman

    Abstract: Evaluating and tracking patients' functional status through the post-acute care continuum requires a common instrument. However, different post-acute service providers such as nursing homes, inpatient rehabilitation facilities and home health agencies rely on different instruments to evaluate patients' functional status. These instruments assess similar functional status domains, but they comprise… ▽ More

    Submitted 17 July, 2018; v1 submitted 14 April, 2018; originally announced April 2018.

  16. arXiv:1710.10713   

    stat.ME

    Nonparametric Bayes Differential Analysis for Dependent Multigroup Data with Application to DNA Methylation Analyses in Cancer

    Authors: Chiyu Gu, Veerabhadran Baladandayuthapani, Subharup Guha

    Abstract: Modern cancer genomics datasets involve widely varying sizes and scales, measurement variables, and correlation structures. A fundamental analytical goal in these high-throughput studies is the development of general statistical techniques that can cleanly sift the signal from noise in identifying disease-specific genomic signatures across a set of experimental or biological conditions. We propose… ▽ More

    Submitted 10 April, 2022; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: An article with overlap** content but different focus has been submitted, thus this article is no longer appropriate