Skip to main content

Showing 1–7 of 7 results for author: Gutiérrez, R L

.
  1. arXiv:2404.12498  [pdf

    cs.LG cs.AI eess.SY

    A Configurable Pythonic Data Center Model for Sustainable Cooling and ML Integration

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutierrez, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: There have been growing discussions on estimating and subsequently reducing the operational carbon footprint of enterprise data centers. The design and intelligent control for data centers have an important impact on data center carbon footprint. In this paper, we showcase PyDCM, a Python library that enables extremely fast prototy** of data center design and applies reinforcement learning-enabl… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning https://www.climatechange.ai/papers/neurips2023/15. arXiv admin note: substantial text overlap with arXiv:2310.03906

  2. arXiv:2310.18679  [pdf

    cs.CL cs.AI cs.LG

    N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

    Authors: Sajad Mousavi, Ricardo Luna Gutiérrez, Desik Rengarajan, Vineet Gundecha, Ashwin Ramesh Babu, Avisek Naug, Antonio Guillen, Soumyendu Sarkar

    Abstract: We propose a self-correction mechanism for Large Language Models (LLMs) to mitigate issues such as toxicity and fact hallucination. This method involves refining model outputs through an ensemble of critics and the model's own feedback. Drawing inspiration from human behavior, we explore whether LLMs can emulate the self-correction process observed in humans who often engage in self-reflection and… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Journal ref: NeurIPS 2023 Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models 2023(NeurIPS 2023)

  3. RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels

    Authors: Alexander Shmakov, Avisek Naug, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna Gutierrez, Ashwin Ramesh Babu, Antonio Guillen, Soumyendu Sarkar

    Abstract: Bayesian Optimization (BO), guided by Gaussian process (GP) surrogates, has proven to be an invaluable technique for efficient, high-dimensional, black-box optimization, a critical problem inherent to many applications such as industrial design and scientific computing. Recent contributions have introduced reinforcement learning (RL) to improve the optimization performance on both single function… ▽ More

    Submitted 8 November, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)

  4. PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutiérrez, Vineet Gundecha, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Lorenz Krause, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: The increasing global emphasis on sustainability and reducing carbon emissions is pushing governments and corporations to rethink their approach to data center design and operation. Given their high energy consumption and exponentially large computational workloads, data centers are prime candidates for optimizing power consumption, especially in areas such as cooling and IT energy usage. A signif… ▽ More

    Submitted 26 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: The 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys '23), November 15-16, 2023, Istanbul, Turkey

    Journal ref: 2023 BuildSys '23: Proceedings of the 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

  5. arXiv:2107.02603  [pdf, other

    cs.AI cs.LG

    Meta-Reinforcement Learning for Heuristic Planning

    Authors: Ricardo Luna Gutierrez, Matteo Leonetti

    Abstract: In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of tasks to prepare for and learn faster in new, unseen, but related tasks. The training tasks are usually hand-crafted to be representative of the expected distribution of test tasks and hence all used in training. We show that given a set of training tasks, learning can be both faster and more effective (leading to better perf… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: ICAPS 2021

  6. arXiv:2011.01054  [pdf, other

    cs.LG

    Information-theoretic Task Selection for Meta-Reinforcement Learning

    Authors: Ricardo Luna Gutierrez, Matteo Leonetti

    Abstract: In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of tasks to prepare for and learn faster in new, unseen, but related tasks. The training tasks are usually hand-crafted to be representative of the expected distribution of test tasks and hence all used in training. We show that given a set of training tasks, learning can be both faster and more effective (leading to better perf… ▽ More

    Submitted 1 July, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: Published at NeurIPS 2020

  7. arXiv:1906.06178  [pdf, other

    cs.LG cs.AI stat.ML

    Curriculum Learning for Cumulative Return Maximization

    Authors: Francesco Foglino, Christiano Coletto Christakou, Ricardo Luna Gutierrez, Matteo Leonetti

    Abstract: Curriculum learning has been successfully used in reinforcement learning to accelerate the learning process, through knowledge transfer between tasks of increasing complexity. Critical tasks, in which suboptimal exploratory actions must be minimized, can benefit from curriculum learning, and its ability to shape exploration through transfer. We propose a task sequencing algorithm maximizing the cu… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI-19). arXiv admin note: text overlap with arXiv:1901.11478