Skip to main content

Showing 1–4 of 4 results for author: Caulk, R A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10258  [pdf, other

    cs.CL

    Curating Grounded Synthetic Data with Global Perspectives for Equitable AI

    Authors: Elin Törnquist, Robert Alexander Caulk

    Abstract: The development of robust AI models relies heavily on the quality and variety of training data available. In fields where data scarcity is prevalent, synthetic data generation offers a vital solution. In this paper, we introduce a novel approach to creating synthetic datasets, grounded in real-world diversity and enriched through strategic diversification. We synthesize data using a comprehensive… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    ACM Class: I.2.7

  2. arXiv:2309.16743  [pdf, other

    cs.LG cs.AI cs.DC

    High Throughput Training of Deep Surrogates from Large Ensemble Runs

    Authors: Lucas Meyer, Marc Schouler, Robert Alexander Caulk, Alejandro Ribés, Bruno Raffin

    Abstract: Recent years have seen a surge in deep learning approaches to accelerate numerical solvers, which provide faithful but computationally intensive simulations of the physical world. These deep surrogates are generally trained in a supervised manner from limited amounts of data slowly generated by the same solver they intend to accelerate. We propose an open-source framework that enables the online t… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: The International Conference for High Performance Computing, Networking, Storage, and Analysis, Nov 2023, Denver, CO, United States

  3. arXiv:2309.15207  [pdf, other

    cs.LG

    Balancing Computational Efficiency and Forecast Error in Machine Learning-based Time-Series Forecasting: Insights from Live Experiments on Meteorological Nowcasting

    Authors: Elin Törnquist, Wagner Costa Santos, Timothy Pogue, Nicholas Wingle, Robert A. Caulk

    Abstract: Machine learning for time-series forecasting remains a key area of research. Despite successful application of many machine learning techniques, relating computational efficiency to forecast error remains an under-explored domain. This paper addresses this topic through a series of real-time experiments to quantify the relationship between computational cost and forecast error using meteorological… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 26 pages

    ACM Class: I.2; J.2

  4. arXiv:2306.16133  [pdf, other

    cs.AI cs.DC physics.comp-ph

    Training Deep Surrogate Models with Large Scale Online Learning

    Authors: Lucas Meyer, Marc Schouler, Robert Alexander Caulk, Alejandro Ribés, Bruno Raffin

    Abstract: The spatiotemporal resolution of Partial Differential Equations (PDEs) plays important roles in the mathematical description of the world's physical phenomena. In general, scientists and engineers solve PDEs numerically by the use of computationally demanding solvers. Recently, deep learning algorithms have emerged as a viable alternative for obtaining fast solutions for PDEs. Models are usually t… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.