Skip to main content

Showing 1–12 of 12 results for author: Egele, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09703  [pdf, other

    cs.LG stat.ML

    AI Competitions and Benchmarks: Dataset Development

    Authors: Romain Egele, Julio C. S. Jacques Junior, Jan N. van Rijn, Isabelle Guyon, Xavier Baró, Albert Clapés, Prasanna Balaprakash, Sergio Escalera, Thomas Moeslund, Jun Wan

    Abstract: Machine learning is now used in many applications thanks to its ability to predict, generate, or discover patterns from large quantities of data. However, the process of collecting and transforming data for practical use is intricate. Even in today's digital era, where substantial data is generated daily, it is uncommon for it to be readily usable; most often, it necessitates meticulous manual dat… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Preprint version of the 3rd Chapter of the book: Competitions and Benchmarks, the science behind the contests (https://sites.google.com/chalearn.org/book/home)

  2. arXiv:2404.05768  [pdf, other

    cs.LG physics.ao-ph stat.ML

    Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach

    Authors: Yixuan Sun, Ololade Sowunmi, Romain Egele, Sri Hari Krishna Narayanan, Luke Van Roekel, Prasanna Balaprakash

    Abstract: Training an effective deep learning model to learn ocean processes involves careful choices of various hyperparameters. We leverage the advanced search algorithms for multiobjective optimization in DeepHyper, a scalable hyperparameter optimization software, to streamline the development of neural networks tailored for ocean modeling. The focus is on optimizing Fourier neural operators (FNOs), a da… ▽ More

    Submitted 10 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  3. arXiv:2404.04111  [pdf, other

    cs.LG

    The Unreasonable Effectiveness Of Early Discarding After One Epoch In Neural Network Hyperparameter Optimization

    Authors: Romain Egele, Felix Mohr, Tom Viering, Prasanna Balaprakash

    Abstract: To reach high performance with deep learning, hyperparameter optimization (HPO) is essential. This process is usually time-consuming due to costly evaluations of neural networks. Early discarding techniques limit the resources granted to unpromising candidates by observing the empirical learning curves and canceling neural network training as soon as the lack of competitiveness of a candidate beco… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2312.12705  [pdf, other

    cs.DC cs.AI

    Optimizing Distributed Training on Frontier for Large Language Models

    Authors: Sajal Dash, Isaac Lyngaas, Junqi Yin, Xiao Wang, Romain Egele, Guo**g Cong, Feiyi Wang, Prasanna Balaprakash

    Abstract: Large language models (LLMs) have demonstrated remarkable success as foundational models, benefiting various downstream applications through fine-tuning. Recent studies on loss scaling have demonstrated the superior performance of larger LLMs compared to their smaller counterparts. Nevertheless, training LLMs with billions of parameters poses significant challenges and requires considerable comput… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Edited the abstract to better communicate the scope of the work

  5. arXiv:2309.14936  [pdf, other

    cs.LG cs.DC

    Parallel Multi-Objective Hyperparameter Optimization with Uniform Normalization and Bounded Objectives

    Authors: Romain Egele, Tyler Chang, Yixuan Sun, Venkatram Vishwanath, Prasanna Balaprakash

    Abstract: Machine learning (ML) methods offer a wide range of configurable hyperparameters that have a significant influence on their performance. While accuracy is a commonly used performance objective, in many settings, it is not sufficient. Optimizing the ML models with respect to multiple objectives such as accuracy, confidence, fairness, calibration, privacy, latency, and memory consumption is becoming… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Preprint with appendices

  6. arXiv:2307.15422  [pdf, other

    cs.LG

    Is One Epoch All You Need For Multi-Fidelity Hyperparameter Optimization?

    Authors: Romain Egele, Isabelle Guyon, Yixuan Sun, Prasanna Balaprakash

    Abstract: Hyperparameter optimization (HPO) is crucial for fine-tuning machine learning models but can be computationally expensive. To reduce costs, Multi-fidelity HPO (MF-HPO) leverages intermediate accuracy levels in the learning process and discards low-performing models early on. We compared various representative MF-HPO methods against a simple baseline on classical benchmark data. The baseline involv… ▽ More

    Submitted 26 September, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 5 pages, with extended appendices

  7. arXiv:2302.09748  [pdf, other

    cs.LG math.DS

    Quantifying uncertainty for deep learning based forecasting and flow-reconstruction using neural architecture search ensembles

    Authors: Romit Maulik, Romain Egele, Krishnan Raghavan, Prasanna Balaprakash

    Abstract: Classical problems in computational physics such as data-driven forecasting and signal reconstruction from sparse sensors have recently seen an explosion in deep neural network (DNN) based algorithmic approaches. However, most DNN models do not provide uncertainty estimates, which are crucial for establishing the trustworthiness of these techniques in downstream decision making tasks and scenarios… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  8. HPC Storage Service Autotuning Using Variational-Autoencoder-Guided Asynchronous Bayesian Optimization

    Authors: Matthieu Dorier, Romain Egele, Prasanna Balaprakash, Jaehoon Koo, Sandeep Madireddy, Srinivasan Ramesh, Allen D. Malony, Rob Ross

    Abstract: Distributed data storage services tailored to specific applications have grown popular in the high-performance computing (HPC) community as a way to address I/O and storage challenges. These services offer a variety of specific interfaces, semantics, and data representations. They also expose many tuning parameters, making it difficult for their users to find the best configuration for a given wor… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted at IEEE Cluster 2022

  9. arXiv:2207.00479  [pdf, other

    cs.LG

    Asynchronous Decentralized Bayesian Optimization for Large Scale Hyperparameter Optimization

    Authors: Romain Egele, Isabelle Guyon, Venkatram Vishwanath, Prasanna Balaprakash

    Abstract: Bayesian optimization (BO) is a promising approach for hyperparameter optimization of deep neural networks (DNNs), where each model training can take minutes to hours. In BO, a computationally cheap surrogate model is employed to learn the relationship between parameter configurations and their performance such as accuracy. Parallel BO methods often adopt single manager/multiple workers strategies… ▽ More

    Submitted 26 September, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

  10. arXiv:2110.13511  [pdf, other

    cs.LG

    AutoDEUQ: Automated Deep Ensemble with Uncertainty Quantification

    Authors: Romain Egele, Romit Maulik, Krishnan Raghavan, Bethany Lusch, Isabelle Guyon, Prasanna Balaprakash

    Abstract: Deep neural networks are powerful predictors for a variety of tasks. However, they do not capture uncertainty directly. Using neural network ensembles to quantify uncertainty is competitive with approaches based on Bayesian neural networks while benefiting from better computational scalability. However, building ensembles of neural networks is a challenging task because, in addition to choosing th… ▽ More

    Submitted 4 July, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

  11. arXiv:2010.16358  [pdf, other

    cs.LG cs.NE stat.ML

    AgEBO-Tabular: Joint Neural Architecture and Hyperparameter Search with Autotuned Data-Parallel Training for Tabular Data

    Authors: Romain Egele, Prasanna Balaprakash, Venkatram Vishwanath, Isabelle Guyon, Zhengying Liu

    Abstract: Develo** high-performing predictive models for large tabular data sets is a challenging task. The state-of-the-art methods are based on expert-developed model ensembles from different supervised learning methods. Recently, automated machine learning (AutoML) is emerging as a promising approach to automate predictive model development. Neural architecture search (NAS) is an AutoML approach that g… ▽ More

    Submitted 26 October, 2021; v1 submitted 30 October, 2020; originally announced October 2020.

  12. Scalable Reinforcement-Learning-Based Neural Architecture Search for Cancer Deep Learning Research

    Authors: Prasanna Balaprakash, Romain Egele, Misha Salim, Stefan Wild, Venkatram Vishwanath, Fangfang Xia, Tom Brettin, Rick Stevens

    Abstract: Cancer is a complex disease, the understanding and treatment of which are being aided through increases in the volume of collected data and in the scale of deployed computing power. Consequently, there is a growing need for the development of data-driven and, in particular, deep learning methods for various tasks such as cancer diagnosis, detection, prognosis, and prediction. Despite recent succes… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: SC '19: IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis, November 17--22, 2019, Denver, CO