Skip to main content

Showing 1–17 of 17 results for author: Heim, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10799  [pdf

    cs.CY cs.LG

    Training Compute Thresholds: Features and Functions in AI Governance

    Authors: Lennart Heim

    Abstract: This paper examines the use of training compute thresholds as a tool for governing artificial intelligence (AI) systems. We argue that compute thresholds serve as a valuable trigger for further evaluation of AI models, rather than being the sole determinant of the regulation. Key advantages of compute thresholds include their correlation with model capabilities and risks, quantifiability, ease of… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Working paper

  2. arXiv:2405.10295  [pdf

    cs.CY cs.AI cs.HC

    Societal Adaptation to Advanced AI

    Authors: Jamie Bernardi, Gabriel Mukobi, Hilary Greaves, Lennart Heim, Markus Anderljung

    Abstract: Existing strategies for managing risks from advanced AI systems often focus on affecting what AI systems are developed and how they diffuse. However, this approach becomes less feasible as the number of developers of advanced AI grows, and impedes beneficial use-cases as well as harmful ones. In response, we urge a complementary approach: increasing societal adaptation to advanced AI, that is, red… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2404.02675  [pdf, other

    cs.CY cs.AI

    Responsible Reporting for Frontier AI Development

    Authors: Noam Kolt, Markus Anderljung, Joslyn Barnhart, Asher Brass, Kevin Esvelt, Gillian K. Hadfield, Lennart Heim, Mikel Rodriguez, Jonas B. Sandbrink, Thomas Woodside

    Abstract: Mitigating the risks from frontier AI systems requires up-to-date and reliable information about those systems. Organizations that develop and deploy frontier systems have significant access to such information. By reporting safety-critical information to actors in government, industry, and civil society, these organizations could improve visibility into new and emerging risks posed by frontier sy… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2403.08501  [pdf, other

    cs.CY

    Governing Through the Cloud: The Intermediary Role of Compute Providers in AI Regulation

    Authors: Lennart Heim, Tim Fist, Janet Egan, Sihao Huang, Stephen Zekany, Robert Trager, Michael A Osborne, Noa Zilberman

    Abstract: As jurisdictions around the world take their first steps toward regulating the most powerful AI systems, such as the EU AI Act and the US Executive Order 14110, there is a growing need for effective enforcement mechanisms that can verify compliance and respond to violations. We argue that compute providers should have legal obligations and ethical responsibilities associated with AI development an… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: v2: Fixing affiliations, formatting errors, and vector graphics

  5. arXiv:2402.08797  [pdf, other

    cs.CY

    Computing Power and the Governance of Artificial Intelligence

    Authors: Girish Sastry, Lennart Heim, Haydn Belfield, Markus Anderljung, Miles Brundage, Julian Hazell, Cullen O'Keefe, Gillian K. Hadfield, Richard Ngo, Konstantin Pilz, George Gor, Emma Bluemke, Sarah Shoker, Janet Egan, Robert F. Trager, Shahar Avin, Adrian Weller, Yoshua Bengio, Diane Coyle

    Abstract: Computing power, or "compute," is crucial for the development and deployment of artificial intelligence (AI) capabilities. As a result, governments and companies have started to leverage compute as a means to govern AI. For example, governments are investing in domestic compute capacity, controlling the flow of compute to competing countries, and subsidizing compute access to certain sectors. Howe… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Figures can be accessed at: https://github.com/lheim/CPGAI-Figures

  6. arXiv:2401.13138  [pdf, other

    cs.CY cs.AI

    Visibility into AI Agents

    Authors: Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei, Lewis Hammond, Herbie Bradley, Emma Bluemke, Nitarshan Rajkumar, David Krueger, Noam Kolt, Lennart Heim, Markus Anderljung

    Abstract: Increased delegation of commercial, scientific, governmental, and personal activities to AI agents -- systems capable of pursuing complex goals with limited supervision -- may exacerbate existing societal risks and introduce new risks. Understanding and mitigating these risks involves critically evaluating existing governance structures, revising and adapting these structures where needed, and ens… ▽ More

    Submitted 17 May, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted to ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT 2024)

  7. arXiv:2401.02452  [pdf, other

    cs.CY cs.AI

    The Compute Divide in Machine Learning: A Threat to Academic Contribution and Scrutiny?

    Authors: Tamay Besiroglu, Sage Andrus Bergerson, Amelia Michael, Lennart Heim, Xueyun Luo, Neil Thompson

    Abstract: There are pronounced differences in the extent to which industrial and academic AI labs use computing resources. We provide a data-driven survey of the role of the compute divide in sha** machine learning research. We show that a compute divide has coincided with a reduced representation of academic-only research teams in compute intensive research topics, especially foundation models. We argue… ▽ More

    Submitted 8 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  8. arXiv:2311.15377  [pdf, other

    cs.CY

    Increased Compute Efficiency and the Diffusion of AI Capabilities

    Authors: Konstantin Pilz, Lennart Heim, Nicholas Brown

    Abstract: Training advanced AI models requires large investments in computational resources, or compute. Yet, as hardware innovation reduces the price of compute and algorithmic advances make its use more efficient, the cost of training an AI model to a given performance falls over time - a concept we describe as increasing compute efficiency. We find that while an access effect increases the number of acto… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

  9. arXiv:2311.02651  [pdf

    cs.CY cs.AI

    Compute at Scale: A Broad Investigation into the Data Center Industry

    Authors: Konstantin Pilz, Lennart Heim

    Abstract: This report characterizes the data center industry and its importance for AI development. Data centers are industrial facilities that efficiently provide compute at scale and thus constitute the engine rooms of today's digital economy. As large-scale AI training and inference become increasingly computationally expensive, they are dominantly executed from this designated infrastructure. Key featur… ▽ More

    Submitted 22 November, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

  10. arXiv:2310.13625  [pdf, other

    cs.CY

    Oversight for Frontier AI through a Know-Your-Customer Scheme for Compute Providers

    Authors: Janet Egan, Lennart Heim

    Abstract: To address security and safety risks stemming from highly capable artificial intelligence (AI) models, we propose that the US government should ensure compute providers implement Know-Your-Customer (KYC) schemes. Compute - the computational power and infrastructure required to train and run these AI models - is emerging as a node for oversight. KYC, a standard developed by the banking sector to id… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  11. arXiv:2308.15514  [pdf, other

    cs.AI

    International Governance of Civilian AI: A Jurisdictional Certification Approach

    Authors: Robert Trager, Ben Harack, Anka Reuel, Allison Carnegie, Lennart Heim, Lewis Ho, Sarah Kreps, Ranjit Lall, Owen Larter, Seán Ó hÉigeartaigh, Simon Staffell, José Jaime Villalobos

    Abstract: This report describes trade-offs in the design of international governance arrangements for civilian artificial intelligence (AI) and presents one approach in detail. This approach represents the extension of a standards, licensing, and liability regime to the global level. We propose that states establish an International AI Organization (IAIO) to certify state jurisdictions (not firms or AI proj… ▽ More

    Submitted 11 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  12. arXiv:2305.07153  [pdf, other

    cs.CY

    Towards best practices in AGI safety and governance: A survey of expert opinion

    Authors: Jonas Schuett, Noemi Dreksler, Markus Anderljung, David McCaffary, Lennart Heim, Emma Bluemke, Ben Garfinkel

    Abstract: A number of leading AI companies, including OpenAI, Google DeepMind, and Anthropic, have the stated goal of building artificial general intelligence (AGI) - AI systems that achieve or exceed human performance across a wide range of cognitive tasks. In pursuing this goal, they may develop and deploy AI systems that pose particularly significant risks. While they have already taken some measures to… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 38 pages, 8 figures, 8 tables

  13. arXiv:2211.04325  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

    Authors: Pablo Villalobos, Jaime Sevilla, Lennart Heim, Tamay Besiroglu, Marius Hobbhahn, Anson Ho

    Abstract: We analyze the growth of dataset sizes used in machine learning for natural language processing and computer vision, and extrapolate these using two methods; using the historical growth rate and estimating the compute-optimal dataset size for future predicted compute budgets. We investigate the growth in data usage by estimating the total stock of unlabeled data available on the internet over the… ▽ More

    Submitted 25 October, 2022; originally announced November 2022.

  14. arXiv:2210.04610  [pdf, other

    cs.AI cs.CR cs.CV cs.CY cs.LG

    Red-Teaming the Stable Diffusion Safety Filter

    Authors: Javier Rando, Daniel Paleka, David Lindner, Lennart Heim, Florian Tramèr

    Abstract: Stable Diffusion is a recent open-source image generation model comparable to proprietary models such as DALLE, Imagen, or Parti. Stable Diffusion comes with a safety filter that aims to prevent generating explicit images. Unfortunately, the filter is obfuscated and poorly documented. This makes it hard for users to prevent misuse in their applications, and to understand the filter's limitations a… ▽ More

    Submitted 10 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: ML Safety Workshop NeurIPS 2022

  15. arXiv:2207.02852  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Machine Learning Model Sizes and the Parameter Gap

    Authors: Pablo Villalobos, Jaime Sevilla, Tamay Besiroglu, Lennart Heim, Anson Ho, Marius Hobbhahn

    Abstract: We study trends in model size of notable machine learning systems over time using a curated dataset. From 1950 to 2018, model size in language models increased steadily by seven orders of magnitude. The trend then accelerated, with model size increasing by another five orders of magnitude in just 4 years from 2018 to 2022. Vision models grew at a more constant pace, totaling 7 orders of magnitude… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  16. Compute Trends Across Three Eras of Machine Learning

    Authors: Jaime Sevilla, Lennart Heim, Anson Ho, Tamay Besiroglu, Marius Hobbhahn, Pablo Villalobos

    Abstract: Compute, data, and algorithmic advances are the three fundamental factors that guide the progress of modern Machine Learning (ML). In this paper we study trends in the most readily quantified factor - compute. We show that before 2010 training compute grew in line with Moore's law, doubling roughly every 20 months. Since the advent of Deep Learning in the early 2010s, the scaling of training compu… ▽ More

    Submitted 9 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Journal ref: 2022 International Joint Conference on Neural Networks (IJCNN), Padua, Italy, 2022, pp. 1-8

  17. arXiv:2104.10645  [pdf, other

    cs.LG

    Measuring what Really Matters: Optimizing Neural Networks for TinyML

    Authors: Lennart Heim, Andreas Biri, Zhongnan Qu, Lothar Thiele

    Abstract: With the surge of inexpensive computational and memory resources, neural networks (NNs) have experienced an unprecedented growth in architectural and computational complexity. Introducing NNs to resource-constrained devices enables cost-efficient deployments, widespread availability, and the preservation of sensitive data. This work addresses the challenges of bringing Machine Learning to MCUs, wh… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.