Skip to main content

Showing 1–1 of 1 results for author: Brower-Sinning, R A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.03345  [pdf, other

    cs.SE

    Data Leakage in Notebooks: Static Detection and Better Processes

    Authors: Chenyang Yang, Rachel A Brower-Sinning, Grace A. Lewis, Christian Kästner

    Abstract: Data science pipelines to train and evaluate models with machine learning may contain bugs just like any other code. Leakage between training and test data can lead to overestimating the model's accuracy during offline evaluations, possibly leading to deployment of low-quality models in production. Such leakage can happen easily by mistake or by following poor practices, but may be tedious and cha… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.