Skip to main content

Showing 1–6 of 6 results for author: Madjiheurem, S

.
  1. arXiv:2406.07234  [pdf

    cs.LG

    OPFData: Large-scale datasets for AC optimal power flow with topological perturbations

    Authors: Sean Lovett, Miha Zgubic, Sofia Liguori, Sephora Madjiheurem, Hamish Tomlinson, Sophie Elster, Chris Apps, Sims Witherspoon, Luis Piloto

    Abstract: Solving the AC optimal power flow problem (AC-OPF) is critical to the efficient and safe planning and operation of power grids. Small efficiency improvements in this domain have the potential to lead to billions of dollars of cost savings, and significant reductions in emissions from fossil fuel generators. Recent work on data-driven solution methods for AC-OPF shows the potential for large speed… ▽ More

    Submitted 18 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2403.17660  [pdf, other

    cs.LG

    CANOS: A Fast and Scalable Neural AC-OPF Solver Robust To N-1 Perturbations

    Authors: Luis Piloto, Sofia Liguori, Sephora Madjiheurem, Miha Zgubic, Sean Lovett, Hamish Tomlinson, Sophie Elster, Chris Apps, Sims Witherspoon

    Abstract: Optimal Power Flow (OPF) refers to a wide range of related optimization problems with the goal of operating power systems efficiently and securely. In the simplest setting, OPF determines how much power to generate in order to minimize costs while meeting demand for power and satisfying physical and operational constraints. In even the simplest case, power grid operators use approximations of the… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  3. arXiv:2007.01839  [pdf, other

    cs.LG cs.AI stat.ML

    Expected Eligibility Traces

    Authors: Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa

    Abstract: The question of how to determine which states and actions are responsible for a certain outcome is known as the credit assignment problem and remains a central research question in reinforcement learning and artificial intelligence. Eligibility traces enable efficient credit assignment to the recent sequence of states and actions experienced by the agent, but not to counterfactual sequences that c… ▽ More

    Submitted 8 February, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: AAAI, distinguished paper award

  4. arXiv:1910.10277  [pdf, other

    cs.LG stat.ML

    State2vec: Off-Policy Successor Features Approximators

    Authors: Sephora Madjiheurem, Laura Toni

    Abstract: A major challenge in reinforcement learning (RL) is the design of agents that are able to generalize across tasks that share common dynamics. A viable solution is meta-reinforcement learning, which identifies common structures among past tasks to be then generalized to new tasks (meta-test). In meta-training, the RL agent learns state representations that encode prior information from a set of tas… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  5. arXiv:1901.05351  [pdf, ps, other

    cs.LG stat.ML

    Representation Learning on Graphs: A Reinforcement Learning Application

    Authors: Sephora Madjiheurem, Laura Toni

    Abstract: In this work, we study value function approximation in reinforcement learning (RL) problems with high dimensional state or action spaces via a generalized version of representation policy iteration (RPI). We consider the limitations of proto-value functions (PVFs) at accurately approximating the value function in low dimensions and we highlight the importance of features learning for an improved l… ▽ More

    Submitted 17 January, 2019; v1 submitted 16 January, 2019; originally announced January 2019.

  6. arXiv:1609.00439  [pdf, other

    cs.HC

    Qualitative Framing of Financial Incentives - A Case of Emotion Annotation

    Authors: Sephora Madjiheurem, Valentina Sintsova, Pearl Pu

    Abstract: Online labor platforms, such as the Amazon Mechanical Turk, provide an effective framework for eliciting responses to judgment tasks. Previous work has shown that workers respond best to financial incentives, especially to extra bonuses. However, most of the tested incentives involve describing the bonus conditions in formulas instead of plain English. We believe that different incentives given in… ▽ More

    Submitted 1 September, 2016; originally announced September 2016.

    Comments: Work-in-Progress Paper in the 4th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2016)

    ACM Class: H.1.2; H.5.3