Skip to main content

Showing 1–4 of 4 results for author: Villar, S

Searching in archive astro-ph. Search in all archives.
.
  1. arXiv:2405.18095  [pdf, other

    stat.ML astro-ph.IM cs.LG physics.data-an

    Is machine learning good or bad for the natural sciences?

    Authors: David W. Hogg, Soledad Villar

    Abstract: Machine learning (ML) methods are having a huge impact across all of the sciences. However, ML has a strong ontology - in which only the data exist - and a strong epistemology - in which a model is considered good if it performs well on held-out training data. These philosophies are in strong conflict with both standard practices and key philosophies in the natural sciences. Here we identify some… ▽ More

    Submitted 31 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: A Position Paper accepted for publication in the 2024 International Conference on Machine Learning (ICML)

  2. arXiv:2310.12528  [pdf, other

    astro-ph.IM cs.LG

    Constructing Impactful Machine Learning Research for Astronomy: Best Practices for Researchers and Reviewers

    Authors: D. Huppenkothen, M. Ntampaka, M. Ho, M. Fouesneau, B. Nord, J. E. G. Peek, M. Walmsley, J. F. Wu, C. Avestruz, T. Buck, M. Brescia, D. P. Finkbeiner, A. D. Goulding, T. Kacprzak, P. Melchior, M. Pasquato, N. Ramachandra, Y. -S. Ting, G. van de Ven, S. Villar, V. A. Villar, E. Zinger

    Abstract: Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best pr… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 14 pages, 3 figures; submitted to the Bulletin of the American Astronomical Society

  3. arXiv:2301.13724  [pdf, other

    stat.ML astro-ph.IM cs.LG math-ph physics.data-an

    Towards fully covariant machine learning

    Authors: Soledad Villar, David W. Hogg, Weichi Yao, George A. Kevrekidis, Bernhard Schölkopf

    Abstract: Any representation of data involves arbitrary investigator choices. Because those choices are external to the data-generating process, each choice leads to an exact symmetry, corresponding to the group of transformations that takes one possible representation to another. These are the passive symmetries; they include coordinate freedom, gauge symmetry, and units covariance, all of which have led t… ▽ More

    Submitted 28 June, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: substantial revision from v1; submitted to TMLR

  4. arXiv:2101.07256  [pdf, other

    physics.data-an astro-ph.IM cs.LG

    Fitting very flexible models: Linear regression with large numbers of parameters

    Authors: David W. Hogg, Soledad Villar

    Abstract: There are many uses for linear fitting; the context here is interpolation and denoising of data, as when you have calibration data and you want to fit a smooth, flexible function to those data. Or you want to fit a flexible function to de-trend a time series or normalize a spectrum. In these contexts, investigators often choose a polynomial basis, or a Fourier basis, or wavelets, or something equa… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

    Comments: all code used to make the figures is available at https://github.com/davidwhogg/FlexibleLinearModels