Skip to main content

Showing 1–5 of 5 results for author: Lock, E F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2102.13278  [pdf, other

    stat.ML cs.LG q-bio.QM stat.ME

    sJIVE: Supervised Joint and Individual Variation Explained

    Authors: Elise F. Palzer, Christine Wendt, Russell Bowler, Craig P. Hersh, Sandra E. Safo, Eric F. Lock

    Abstract: Analyzing multi-source data, which are multiple views of data on the same subjects, has become increasingly common in molecular biomedical research. Recent methods have sought to uncover underlying structure and relationships within and/or between the data sources, and other methods have sought to build a predictive model for an outcome using all sources. However, existing methods that do both are… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 23 pages, 8 tables, 3 figures

  2. arXiv:2010.03111  [pdf, other

    stat.ME cs.LG stat.ML

    Bayesian Distance Weighted Discrimination

    Authors: Eric F. Lock

    Abstract: Distance weighted discrimination (DWD) is a linear discrimination method that is particularly well-suited for classification tasks with high-dimensional data. The DWD coefficients minimize an intuitive objective function, which can solved very efficiently using state-of-the-art optimization techniques. However, DWD has not yet been cast into a model-based framework for statistical inference. In th… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 27 pages, 8 figures

  3. arXiv:2002.02601  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP stat.ME

    Bidimensional linked matrix factorization for pan-omics pan-cancer analysis

    Authors: Eric F. Lock, Jun Young Park, Katherine A. Hoadley

    Abstract: Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, pan-omics pan-cancer analysis, have extended our knowledge of molecular heterogenity beyond what was observed in single tumor and single platform studies. However, these studies… ▽ More

    Submitted 7 April, 2022; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: 26 pages, 5 figures

    Journal ref: Annals of Applied Statistics 2022, Vol. 16, No. 1, 193-215

  4. arXiv:1906.03722  [pdf, other

    stat.ML cs.LG q-bio.QM stat.ME

    Integrative Factorization of Bidimensionally Linked Matrices

    Authors: Jun Young Park, Eric F. Lock

    Abstract: Advances in molecular "omics'" technologies have motivated new methodology for the integration of multiple sources of high-content biomedical data. However, most statistical methods for integrating multiple data matrices only consider data shared vertically (one cohort on multiple platforms) or horizontally (different cohorts on a single platform). This is limiting for data that take the form of b… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 27 pages, 4 figures

    Journal ref: Biometrics, 2019

  5. Bayesian Consensus Clustering

    Authors: Eric F. Lock, David B. Dunson

    Abstract: The task of clustering a set of objects based on multiple sources of data arises in several modern applications. We propose an integrative statistical model that permits a separate clustering of the objects for each data source. These separate clusterings adhere loosely to an overall consensus clustering, and hence they are not independent. We describe a computationally scalable Bayesian framework… ▽ More

    Submitted 28 February, 2013; originally announced February 2013.

    Comments: 32 pages, 13 figures

    Journal ref: Bioinformatics 29 (2013) 2610-2616