Skip to main content

Showing 1–1 of 1 results for author: Madkour, N

.
  1. arXiv:2405.10986  [pdf

    cs.CR cs.AI cs.CY cs.LG

    Benchmark Early and Red Team Often: A Framework for Assessing and Managing Dual-Use Hazards of AI Foundation Models

    Authors: Anthony M. Barrett, Krystal Jackson, Evan R. Murphy, Nada Madkour, Jessica Newman

    Abstract: A concern about cutting-edge or "frontier" AI foundation models is that an adversary may use the models for preparing chemical, biological, radiological, nuclear, (CBRN), cyber, or other attacks. At least two methods can identify foundation models with potential dual-use capability; each has advantages and disadvantages: A. Open benchmarks (based on openly available questions and answers), which a… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 62 pages