Skip to main content

Showing 1–9 of 9 results for author: Nekoei, H

.
  1. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2404.14620  [pdf, other

    cs.LG cs.CY

    Fairness Incentives in Response to Unfair Dynamic Pricing

    Authors: Jesse Thibodeau, Hadi Nekoei, Afaf Taïk, Janarthanan Rajendran, Golnoosh Farnadi

    Abstract: The use of dynamic pricing by profit-maximizing firms gives rise to demand fairness concerns, measured by discrepancies in consumer groups' demand responses to a given pricing strategy. Notably, dynamic pricing may result in buyer distributions unreflective of those of the underlying population, which can be problematic in markets where fair representation is socially desirable. To address this, p… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  3. arXiv:2308.10284  [pdf, other

    cs.LG cs.AI cs.MA

    Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi

    Authors: Hadi Nekoei, Xutong Zhao, Janarthanan Rajendran, Miao Liu, Sarath Chandar

    Abstract: Cooperative Multi-agent Reinforcement Learning (MARL) algorithms with Zero-Shot Coordination (ZSC) have gained significant attention in recent years. ZSC refers to the ability of agents to coordinate zero-shot (without additional interaction experience) with independently trained agents. While ZSC is crucial for cooperative MARL agents, it might not be possible for complex tasks and changing envir… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  4. arXiv:2302.02792  [pdf, other

    cs.LG

    Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning

    Authors: Hadi Nekoei, Akilesh Badrinaaraayanan, Amit Sinha, Mohammad Amini, Janarthanan Rajendran, Aditya Mahajan, Sarath Chandar

    Abstract: Decentralized cooperative multi-agent deep reinforcement learning (MARL) can be a versatile learning framework, particularly in scenarios where centralized training is either not possible or not practical. One of the critical challenges in decentralized deep MARL is the non-stationarity of the learning environment when multiple agents are learning concurrently. A commonly used and efficient scheme… ▽ More

    Submitted 17 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  5. arXiv:2301.02593  [pdf, other

    cs.MA cs.AI cs.LG eess.SY

    Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads

    Authors: Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, Hadi Nekoei, Liam Paull, Antoine Lesage-Landry

    Abstract: To integrate high amounts of renewable energy resources, electrical power grids must be able to cope with high amplitude, fast timescale variations in power generation. Frequency regulation through demand response has the potential to coordinate temporally flexible loads, such as air conditioners, to counteract these variations. Existing approaches for discrete control with dynamic constraints str… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: Presented as an extended abstract at AAMAS 2023

  6. arXiv:2103.03216  [pdf, other

    cs.LG cs.AI cs.MA

    Continuous Coordination As a Realistic Scenario for Lifelong Learning

    Authors: Hadi Nekoei, Akilesh Badrinaaraayanan, Aaron Courville, Sarath Chandar

    Abstract: Current deep reinforcement learning (RL) algorithms are still highly task-specific and lack the ability to generalize to new environments. Lifelong learning (LLL), however, aims at solving multiple tasks sequentially by efficiently transferring and using knowledge between tasks. Despite a surge of interest in lifelong RL in recent years, the lack of a realistic testbed makes robust evaluation of L… ▽ More

    Submitted 14 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 19 pages with supplementary materials. Added results for Lifelong RL methods and some future work. Accepted to ICML 2021

  7. arXiv:2102.08501  [pdf, other

    cs.LG stat.ML

    DEUP: Direct Epistemic Uncertainty Prediction

    Authors: Salem Lahlou, Moksh Jain, Hadi Nekoei, Victor Ion Butoi, Paul Bertin, Jarrid Rector-Brooks, Maksym Korablyov, Yoshua Bengio

    Abstract: Epistemic Uncertainty is a measure of the lack of knowledge of a learner which diminishes with more evidence. While existing work focuses on using the variance of the Bayesian posterior due to parameter uncertainty as a measure of epistemic uncertainty, we argue that this does not capture the part of lack of knowledge induced by model misspecification. We discuss how the excess risk, which is the… ▽ More

    Submitted 3 February, 2023; v1 submitted 16 February, 2021; originally announced February 2021.

  8. arXiv:2011.00901  [pdf, other

    stat.ME physics.comp-ph physics.data-an stat.ML

    Sampling Algorithms, from Survey Sampling to Monte Carlo Methods: Tutorial and Literature Review

    Authors: Benyamin Ghojogh, Hadi Nekoei, Aydin Ghojogh, Fakhri Karray, Mark Crowley

    Abstract: This paper is a tutorial and literature review on sampling algorithms. We have two main types of sampling in statistics. The first type is survey sampling which draws samples from a set or population. The second type is sampling from probability distribution where we have a probability density or mass function. In this paper, we cover both types of sampling. First, we review some required backgrou… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: The first three authors contributed equally to this work

  9. arXiv:2007.03158  [pdf, other

    cs.LG cs.AI stat.ML

    The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

    Authors: Harm van Seijen, Hadi Nekoei, Evan Racah, Sarath Chandar

    Abstract: Deep model-based Reinforcement Learning (RL) has the potential to substantially improve the sample-efficiency of deep RL. While various challenges have long held it back, a number of papers have recently come out reporting success with deep model-based methods. This is a great development, but the lack of a consistent metric to evaluate such methods makes it difficult to compare various approaches… ▽ More

    Submitted 3 December, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020, code: https://github.com/chandar-lab/LoCA