Search | arXiv e-print repository

Ten Hard Problems in Artificial Intelligence We Must Get Right

Authors: Gavin Leech, Simson Garfinkel, Misha Yagudin, Alexander Briand, Aleksandr Zhuravlev

Abstract: We explore the AI2050 "hard problems" that block the promise of AI and cause AI risks: (1) develo** general capabilities of the systems; (2) assuring the performance of AI systems and their training processes; (3) aligning system goals with human goals; (4) enabling great applications of AI in real life; (5) addressing economic disruptions; (6) ensuring the participation of all; (7) at the same… ▽ More We explore the AI2050 "hard problems" that block the promise of AI and cause AI risks: (1) develo** general capabilities of the systems; (2) assuring the performance of AI systems and their training processes; (3) aligning system goals with human goals; (4) enabling great applications of AI in real life; (5) addressing economic disruptions; (6) ensuring the participation of all; (7) at the same time ensuring socially responsible deployment; (8) addressing any geopolitical disruptions that AI causes; (9) promoting sound governance of the technology; and (10) managing the philosophical disruptions for humans living in the age of AI. For each problem, we outline the area, identify significant recent work, and suggest ways forward. [Note: this paper reviews literature through January 2023.] △ Less

Submitted 19 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: 75 + 19 pages

arXiv:2308.10248 [pdf, other]

Activation Addition: Steering Language Models Without Optimization

Authors: Alexander Matt Turner, Lisa Thiergart, Gavin Leech, David Udell, Juan J. Vazquez, Ulisse Mini, Monte MacDiarmid

Abstract: Reliably controlling the behavior of large language models is a pressing open problem. Existing methods include supervised finetuning, reinforcement learning from human feedback, prompt engineering and guided decoding. We instead investigate activation engineering: modifying activations at inference-time to predictably alter model behavior. We bias the forward pass with a 'steering vector' implici… ▽ More Reliably controlling the behavior of large language models is a pressing open problem. Existing methods include supervised finetuning, reinforcement learning from human feedback, prompt engineering and guided decoding. We instead investigate activation engineering: modifying activations at inference-time to predictably alter model behavior. We bias the forward pass with a 'steering vector' implicitly specified through natural language. Past work learned these steering vectors; our Activation Addition (ActAdd) method instead computes them by taking activation differences resulting from pairs of prompts. We demonstrate ActAdd on a range of LLMs (LLaMA-3, OPT, GPT-2, and GPT-J), obtaining SOTA on detoxification and negative-to-positive sentiment control. Our approach yields inference-time control over high-level properties of output like topic and sentiment while preserving performance on off-target tasks. ActAdd takes far less compute and implementation effort than finetuning or RLHF, allows users control through natural language, and its computational overhead (as a fraction of inference time) appears stable or improving over increasing model size. △ Less

Submitted 4 June, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

arXiv:2305.11022 [pdf, other]

Massively Parallel Reweighted Wake-Sleep

Authors: Thomas Heap, Gavin Leech, Laurence Aitchison

Abstract: Reweighted wake-sleep (RWS) is a machine learning method for performing Bayesian inference in a very general class of models. RWS draws $K$ samples from an underlying approximate posterior, then uses importance weighting to provide a better estimate of the true posterior. RWS then updates its approximate posterior towards the importance-weighted estimate of the true posterior. However, recent work… ▽ More Reweighted wake-sleep (RWS) is a machine learning method for performing Bayesian inference in a very general class of models. RWS draws $K$ samples from an underlying approximate posterior, then uses importance weighting to provide a better estimate of the true posterior. RWS then updates its approximate posterior towards the importance-weighted estimate of the true posterior. However, recent work [Chattergee and Diaconis, 2018] indicates that the number of samples required for effective importance weighting is exponential in the number of latent variables. Attaining such a large number of importance samples is intractable in all but the smallest models. Here, we develop massively parallel RWS, which circumvents this issue by drawing $K$ samples of all $n$ latent variables, and individually reasoning about all $K^n$ possible combinations of samples. While reasoning about $K^n$ combinations might seem intractable, the required computations can be performed in polynomial time by exploiting conditional independencies in the generative model. We show considerable improvements over standard "global" RWS, which draws $K$ samples from the full joint. △ Less

Submitted 18 May, 2023; originally announced May 2023.

arXiv:2303.00779 [pdf]

Timed material self-assembly controlled by circadian clock proteins

Authors: Gregor Leech, Lauren Melcher, Michelle Chiu, Maya Nugent, Lily Burton, Janet Kang, Soo Ji Kim, Sourav Roy, Leila Farhadi, Jennifer L. Ross, Moumita Das, Michael J. Rust, Rae M. Robertson-Anderson

Abstract: Active biological molecules present a powerful, yet largely untapped, opportunity to impart autonomous regulation to materials. Because these systems can function robustly to regulate when and where chemical reactions occur, they have the ability to bring complex, life-like behavior to synthetic materials. Here, we achieve this design feat by using functionalized circadian clock proteins, KaiB and… ▽ More Active biological molecules present a powerful, yet largely untapped, opportunity to impart autonomous regulation to materials. Because these systems can function robustly to regulate when and where chemical reactions occur, they have the ability to bring complex, life-like behavior to synthetic materials. Here, we achieve this design feat by using functionalized circadian clock proteins, KaiB and KaiC, to engineer time-dependent crosslinking of colloids. The resulting material self-assembles with programmable kinetics, producing macroscopic changes in material properties, via molecular assembly of KaiB-KaiC complexes. We show that colloid crosslinking depends strictly on the phosphorylation state of KaiC, with kinetics that are synced with KaiB-KaiC complexing. Our microscopic image analyses and computational models indicate that the stability of colloidal super-structures depends sensitively on the number of Kai complexes per colloid connection. Consistent with our model predictions, a high concentration stabilizes the material against dissolution after a robust self-assembly phase, while a low concentration allows circadian oscillation of material structure. This work introduces the concept of harnessing biological timers to control synthetic materials; and, more generally, opens the door to using protein-based reaction networks to endow synthetic systems with life-like functional properties. △ Less

Submitted 20 March, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

Comments: 5 figures + SI

arXiv:2302.04081 [pdf, other]

Decision trees compensate for model misspecification

Authors: Hugh Panton, Gavin Leech, Laurence Aitchison

Abstract: The best-performing models in ML are not interpretable. If we can explain why they outperform, we may be able to replicate these mechanisms and obtain both interpretability and performance. One example are decision trees and their descendent gradient boosting machines (GBMs). These perform well in the presence of complex interactions, with tree depth governing the order of interactions. However, i… ▽ More The best-performing models in ML are not interpretable. If we can explain why they outperform, we may be able to replicate these mechanisms and obtain both interpretability and performance. One example are decision trees and their descendent gradient boosting machines (GBMs). These perform well in the presence of complex interactions, with tree depth governing the order of interactions. However, interactions cannot fully account for the depth of trees found in practice. We confirm 5 alternative hypotheses about the role of tree depth in performance in the absence of true interactions, and present results from experiments on a battery of datasets. Part of the success of tree models is due to their robustness to various forms of mis-specification. We present two methods for robust generalized linear models (GLMs) addressing the composite and mixed response scenarios. △ Less

Submitted 8 February, 2023; originally announced February 2023.

arXiv:2104.04113 [pdf]

Active Cytoskeletal Composites Display Emergent Tunable Contractility and Restructuring

Authors: Gloria Lee, Gregor Leech, Pancy Lwin, Jonathan Michel, Christopher Currie, Michael J. Rust, Jennifer L. Ross, Ryan J. McGorty, Moumita Das, Rae M. Robertson-Anderson

Abstract: The cytoskeleton is a model active matter system that controls diverse cellular processes from division to motility. While both active actomyosin dynamics and actin-microtubule interactions are key to the cytoskeleton's versatility and adaptability, an understanding of their interplay is lacking. Here, we couple microscale experiments with mechanistic modeling to elucidate how connectivity, rigidi… ▽ More The cytoskeleton is a model active matter system that controls diverse cellular processes from division to motility. While both active actomyosin dynamics and actin-microtubule interactions are key to the cytoskeleton's versatility and adaptability, an understanding of their interplay is lacking. Here, we couple microscale experiments with mechanistic modeling to elucidate how connectivity, rigidity, and force-generation affect emergent material properties in in vitro composites of actin, tubulin, and myosin. We use time-resolved differential dynamic microscopy and spatial image autocorrelation to show that ballistic contraction occurs in composites with sufficient flexibility and motor density, but that a critical fraction of microtubules is necessary to sustain controlled dynamics. Our active double-network models reveal that percolated actomyosin networks are essential for contraction, but that networks with comparable actin and microtubule densities can uniquely resist mechanical stresses while simultaneously supporting substantial restructuring. Our findings provide a much-needed blueprint for designing cytoskeleton-inspired materials that couple tunability with resilience and adaptability. △ Less

Submitted 8 April, 2021; originally announced April 2021.

arXiv:2009.11677 [pdf, other]

Legally grounded fairness objectives

Authors: Dylan Holden-Sim, Gavin Leech, Laurence Aitchison

Abstract: Recent work has identified a number of formally incompatible operational measures for the unfairness of a machine learning (ML) system. As these measures all capture intuitively desirable aspects of a fair system, choosing "the one true" measure is not possible, and instead a reasonable approach is to minimize a weighted combination of measures. However, this simply raises the question of how to c… ▽ More Recent work has identified a number of formally incompatible operational measures for the unfairness of a machine learning (ML) system. As these measures all capture intuitively desirable aspects of a fair system, choosing "the one true" measure is not possible, and instead a reasonable approach is to minimize a weighted combination of measures. However, this simply raises the question of how to choose the weights. Here, we formulate Legally Grounded Fairness Objectives (LGFO), which uses signals from the legal system to non-arbitrarily measure the social cost of a specific degree of unfairness. The LGFO is the expected damages under a putative lawsuit that might be awarded to those who were wrongly classified, in the sense that the ML system made a decision different to that which would have be made under the court's preferred measure. Notably, the two quantities necessary to compute the LGFO, the court's preferences about fairness measures, and the expected damages, are unknown but well-defined, and can be estimated by legal advice. Further, as the damages awarded by the legal system are designed to measure and compensate for the harm caused to an individual by an unfair classification, the LGFO aligns closely with society's estimate of the social cost. △ Less

Submitted 24 September, 2020; originally announced September 2020.

arXiv:2007.13454 [pdf, other]

How Robust are the Estimated Effects of Nonpharmaceutical Interventions against COVID-19?

Authors: Mrinank Sharma, Sören Mindermann, Jan Markus Brauner, Gavin Leech, Anna B. Stephenson, Tomáš Gavenčiak, Jan Kulveit, Yee Whye Teh, Leonid Chindelevitch, Yarin Gal

Abstract: To what extent are effectiveness estimates of nonpharmaceutical interventions (NPIs) against COVID-19 influenced by the assumptions our models make? To answer this question, we investigate 2 state-of-the-art NPI effectiveness models and propose 6 variants that make different structural assumptions. In particular, we investigate how well NPI effectiveness estimates generalise to unseen countries, a… ▽ More To what extent are effectiveness estimates of nonpharmaceutical interventions (NPIs) against COVID-19 influenced by the assumptions our models make? To answer this question, we investigate 2 state-of-the-art NPI effectiveness models and propose 6 variants that make different structural assumptions. In particular, we investigate how well NPI effectiveness estimates generalise to unseen countries, and their sensitivity to unobserved factors. Models that account for noise in disease transmission compare favourably. We further evaluate how robust estimates are to different choices of epidemiological parameters and data. Focusing on models that assume transmission noise, we find that previously published results are remarkably robust across these variables. Finally, we mathematically ground the interpretation of NPI effectiveness estimates when certain common assumptions do not hold. △ Less

Submitted 20 December, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

Journal ref: NeurIPS 2020, Advances in Neural Information Processing Systems 33

Showing 1–8 of 8 results for author: Leech, G