Skip to main content

Showing 1–8 of 8 results for author: Reuel, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06987  [pdf, other

    cs.CY

    Position Paper: Technical Research and Talent is Needed for Effective AI Governance

    Authors: Anka Reuel, Lisa Soder, Ben Bucknall, Trond Arne Undheim

    Abstract: In light of recent advancements in AI capabilities and the increasingly widespread integration of AI systems into society, governments worldwide are actively seeking to mitigate the potential harms and risks associated with these technologies through regulation and other governance tools. However, there exist significant gaps between governance aspirations and the current state of the technical to… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 9 pages, 3 figures, Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  2. arXiv:2406.04554  [pdf, ps, other

    cs.CY

    Generative AI Needs Adaptive Governance

    Authors: Anka Reuel, Trond Arne Undheim

    Abstract: Because of the speed of its development, broad scope of application, and its ability to augment human performance, generative AI challenges the very notions of governance, trust, and human agency. The technology's capacity to mimic human knowledge work, feedback loops including significant uptick in users, research, investor, policy, and media attention, data and compute resources, all lead to rap… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.19522  [pdf

    cs.AI

    Artificial Intelligence Index Report 2024

    Authors: Nestor Maslej, Loredana Fattorini, Raymond Perrault, Vanessa Parli, Anka Reuel, Erik Brynjolfsson, John Etchemendy, Katrina Ligett, Terah Lyons, James Manyika, Juan Carlos Niebles, Yoav Shoham, Russell Wald, Jack Clark

    Abstract: The 2024 Index is our most comprehensive to date and arrives at an important moment when AI's influence on society has never been more pronounced. This year, we have broadened our scope to more extensively cover essential trends such as technical advancements in AI, public perceptions of the technology, and the geopolitical dynamics surrounding its development. Featuring more original data than ev… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.06909  [pdf, ps, other

    cs.LG cs.AI cs.CY

    Fairness in Reinforcement Learning: A Survey

    Authors: Anka Reuel, Devin Ma

    Abstract: While our understanding of fairness in machine learning has significantly progressed, our understanding of fairness in reinforcement learning (RL) remains nascent. Most of the attention has been on fairness in one-shot classification tasks; however, real-world, RL-enabled systems (e.g., autonomous vehicles) are much more complicated in that agents operate in dynamic environments over a long period… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 10 pages

    ACM Class: A.1; I.2

  5. arXiv:2401.03408  [pdf, other

    cs.AI cs.CL cs.CY cs.MA

    Escalation Risks from Language Models in Military and Diplomatic Decision-Making

    Authors: Juan-Pablo Rivera, Gabriel Mukobi, Anka Reuel, Max Lamparth, Chandler Smith, Jacquelyn Schneider

    Abstract: Governments are increasingly considering integrating autonomous AI agents in high-stakes military and foreign-policy decision-making, especially with the emergence of advanced generative AI models like GPT-4. Our work aims to scrutinize the behavior of multiple AI agents in simulated wargames, specifically focusing on their predilection to take escalatory actions that may exacerbate multilateral c… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 10 pages body, 57 pages appendix, 46 figures, 11 tables

    Journal ref: The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT 24), June 3-6, 2024, Rio de Janeiro, Brazil

  6. arXiv:2308.15514  [pdf, other

    cs.AI

    International Governance of Civilian AI: A Jurisdictional Certification Approach

    Authors: Robert Trager, Ben Harack, Anka Reuel, Allison Carnegie, Lennart Heim, Lewis Ho, Sarah Kreps, Ranjit Lall, Owen Larter, Seán Ó hÉigeartaigh, Simon Staffell, José Jaime Villalobos

    Abstract: This report describes trade-offs in the design of international governance arrangements for civilian artificial intelligence (AI) and presents one approach in detail. This approach represents the extension of a standards, licensing, and liability regime to the global level. We propose that states establish an International AI Organization (IAIO) to certify state jurisdictions (not firms or AI proj… ▽ More

    Submitted 11 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  7. arXiv:2304.07249  [pdf, other

    cs.CY

    How to design an AI ethics board

    Authors: Jonas Schuett, Anka Reuel, Alexis Carlier

    Abstract: Organizations that develop and deploy artificial intelligence (AI) systems need to take measures to reduce the associated risks. In this paper, we examine how AI companies could design an AI ethics board in a way that reduces risks from AI. We identify five high-level design choices: (1) What responsibilities should the board have? (2) What should its legal structure be? (3) Who should sit on the… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 21 pages, 2 figures, 2 tables

  8. arXiv:2302.12461  [pdf, other

    cs.LG cs.AI cs.CL

    Analyzing And Editing Inner Mechanisms Of Backdoored Language Models

    Authors: Max Lamparth, Anka Reuel

    Abstract: Poisoning of data sets is a potential security threat to large language models that can lead to backdoored models. A description of the internal mechanisms of backdoored language models and how they process trigger inputs, e.g., when switching to toxic language, has yet to be found. In this work, we study the internal representations of transformer-based backdoored language models and determine ea… ▽ More

    Submitted 3 May, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Final version accepted at FAccT 24

    Journal ref: The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT 24), June 3-6, 2024, Rio de Janeiro, Brazil