Skip to main content

Showing 1–2 of 2 results for author: Liu, A B

.
  1. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in develo** biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are develo** evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  2. Refined height pairing

    Authors: Bruno Kahn, with an appendix by Qing Liu

    Abstract: For a $d$-dimensional smooth projective variety $X$ over the function field of a smooth variety $B$ over a field $k$ and for $i\ge 0$, we define a subgroup $CH^i(X)^{(0)}$ of $CH^i(X)$ and construct a "refined height pairing" \[CH^i(X)^{(0)}\times CH^{d+1-i}(X)^{(0)}\to CH^1(B)\] in the category of abelian groups modulo isogeny. For $i=1,d$, $CH^i(X)^{(0)}$ is the group of cycles numerically equiv… ▽ More

    Submitted 6 December, 2023; v1 submitted 1 September, 2020; originally announced September 2020.

    Comments: To appear in Alg. & Number theory. Added after Def. 2.2: Even if it is not apparent anymore, this definition was inspired by [8, Assumption 2] and [5, 1.2]

    Journal ref: Alg. Number Th. 18 (2024) 1039-1079