Skip to main content

Showing 1–8 of 8 results for author: Bacallado, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.02979  [pdf, other

    stat.ME math.ST stat.AP

    Regularised Canonical Correlation Analysis: graphical lasso, biplots and beyond

    Authors: Lennie Wells, Kumar Thurimella, Sergio Bacallado

    Abstract: Recent developments in regularized Canonical Correlation Analysis (CCA) promise powerful methods for high-dimensional, multiview data analysis. However, justifying the structural assumptions behind many popular approaches remains a challenge, and features of realistic biological datasets pose practical difficulties that are seldom discussed. We propose a novel CCA estimator rooted in an assumption… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 83 pages, 27 figures

    MSC Class: 62H20 (Primary) 62H12; 62P10 (Secondary) ACM Class: G.3

  2. arXiv:2210.09211  [pdf, other

    stat.ML cs.LG

    Conditional Neural Processes for Molecules

    Authors: Miguel Garcia-Ortegon, Andreas Bender, Sergio Bacallado

    Abstract: Neural processes (NPs) are models for transfer learning with properties reminiscent of Gaussian Processes (GPs). They are adept at modelling data consisting of few observations of many related functions on the same input space and are trained by minimizing a variational objective, which is computationally much less expensive than the Bayesian updating required by GPs. So far, most studies of NPs h… ▽ More

    Submitted 23 February, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

  3. arXiv:2110.15486  [pdf, other

    stat.ML cs.LG q-bio.BM

    DOCKSTRING: easy molecular docking yields better benchmarks for ligand design

    Authors: Miguel García-Ortegón, Gregor N. C. Simm, Austin J. Tripp, José Miguel Hernández-Lobato, Andreas Bender, Sergio Bacallado

    Abstract: The field of machine learning for drug discovery is witnessing an explosion of novel methods. These methods are often benchmarked on simple physicochemical properties such as solubility or general druglikeness, which can be readily computed. However, these properties are poor representatives of objective functions in drug design, mainly because they do not depend on the candidate's interaction wit… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  4. arXiv:2004.07743  [pdf, other

    stat.AP stat.ME

    BETS: The dangers of selection bias in early analyses of the coronavirus disease (COVID-19) pandemic

    Authors: Qingyuan Zhao, Nianqiao Ju, Sergio Bacallado, Rajen D. Shah

    Abstract: The coronavirus disease 2019 (COVID-19) has quickly grown from a regional outbreak in Wuhan, China to a global pandemic. Early estimates of the epidemic growth and incubation period of COVID-19 may have been biased due to sample selection. Using detailed case reports from 14 locations in and outside mainland China, we obtained 378 Wuhan-exported cases who left Wuhan before an abrupt travel quarant… ▽ More

    Submitted 24 September, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: 33 pages, 8 figures, 5 tables; Accepted for publication in The Annals of Applied Statistics on 24th September, 2020

    MSC Class: 62P10; 62F15

  5. arXiv:1806.11370  [pdf, other

    stat.AP

    Bayesian Uncertainty Directed Trial Designs

    Authors: Steffen Ventz, Matteo Cellamare, Sergio Bacallado, Lorenzo Trippa

    Abstract: Most Bayesian response-adaptive designs unbalance randomization rates towards the most promising arms with the goal of increasing the number of positive treatment outcomes during the study, even though the primary aim of the trial is different. We discuss Bayesian uncertainty directed designs (BUD), a class of Bayesian designs in which the investigator specifies an information measure tailored to… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  6. arXiv:1711.01241  [pdf, other

    stat.ME stat.AP

    Bayesian Mixed Effects Models for Zero-inflated Compositions in Microbiome Data Analysis

    Authors: Boyu Ren, Sergio Bacallado, Stefano Favaro, Tommi Vatanen, Curtis Huttenhower, Lorenzo Trippa

    Abstract: Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with adjustments for multiple hypothesis testing. We propose a Bayesian analysis for a generalized mixed effects linear model tailored to this application. The marginal… ▽ More

    Submitted 24 August, 2019; v1 submitted 3 November, 2017; originally announced November 2017.

  7. arXiv:1710.08045  [pdf, other

    cs.IR cs.LG stat.ML

    Sequential Matrix Completion

    Authors: Annie Marsden, Sergio Bacallado

    Abstract: We propose a novel algorithm for sequential matrix completion in a recommender system setting, where the $(i,j)$th entry of the matrix corresponds to a user $i$'s rating of product $j$. The objective of the algorithm is to provide a sequential policy for user-product pair recommendation which will yield the highest possible ratings after a finite time horizon. The algorithm uses a Gamma process fa… ▽ More

    Submitted 22 October, 2017; originally announced October 2017.

    Comments: 10 pages, 6 figures

  8. Bayesian Nonparametric Ordination for the Analysis of Microbial Communities

    Authors: Boyu Ren, Sergio Bacallado, Stefano Favaro, Susan Holmes, Lorenzo Trippa

    Abstract: Human microbiome studies use sequencing technologies to measure the abundance of bacterial species or Operational Taxonomic Units (OTUs) in samples of biological material. Typically the data are organized in contingency tables with OTU counts across heterogeneous biological samples. In the microbial ecology community, ordination methods are frequently used to investigate latent factors or clusters… ▽ More

    Submitted 20 January, 2017; v1 submitted 19 January, 2016; originally announced January 2016.