Skip to main content

Showing 1–2 of 2 results for author: Korem, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01652  [pdf

    stat.ME cs.LG q-bio.QM

    Distributional bias compromises leave-one-out cross-validation

    Authors: George I. Austin, Itsik Pe'er, Tal Korem

    Abstract: Cross-validation is a common method for estimating the predictive performance of machine learning models. In a data-scarce regime, where one typically wishes to maximize the number of instances used for training the model, an approach called "leave-one-out cross-validation" is often used. In this design, a separate model is built for predicting each data instance after training on all other instan… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 20 pages, 5 figures, supplementary information

  2. arXiv:2405.19221  [pdf

    q-bio.QM cs.LG

    Domain adaptation in small-scale and heterogeneous biological datasets

    Authors: Seyedmehdi Orouji, Martin C. Liu, Tal Korem, Megan A. K. Peters

    Abstract: Machine learning techniques are steadily becoming more important in modern biology, and are used to build predictive models, discover patterns, and investigate biological problems. However, models trained on one dataset are often not generalizable to other datasets from different cohorts or laboratories, due to differences in the statistical properties of these datasets. These could stem from tech… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: main manuscript + supplement