Skip to main content

Showing 1–27 of 27 results for author: Turner, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.06681  [pdf, other

    cs.CL cs.AI cs.LG

    Steering Llama 2 via Contrastive Activation Addition

    Authors: Nina Rimsky, Nick Gabrieli, Julian Schulz, Meg Tong, Evan Hubinger, Alexander Matt Turner

    Abstract: We introduce Contrastive Activation Addition (CAA), an innovative method for steering language models by modifying their activations during forward passes. CAA computes "steering vectors" by averaging the difference in residual stream activations between pairs of positive and negative examples of a particular behavior, such as factual versus hallucinatory responses. During inference, these steerin… ▽ More

    Submitted 6 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  2. arXiv:2310.08043  [pdf, other

    cs.AI

    Understanding and Controlling a Maze-Solving Policy Network

    Authors: Ulisse Mini, Peli Grietzer, Mrinank Sharma, Austin Meek, Monte MacDiarmid, Alexander Matt Turner

    Abstract: To understand the goals and goal representations of AI systems, we carefully study a pretrained reinforcement learning policy that solves mazes by navigating to a range of target squares. We find this network pursues multiple context-dependent goals, and we further identify circuits within the network that correspond to one of these goals. In particular, we identified eleven channels that track th… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 46 pages

  3. arXiv:2309.05440  [pdf

    cs.DC

    Emissions and energy efficiency on large-scale high performance computing facilities: ARCHER2 UK national supercomputing service case study

    Authors: Adrian Jackson, Alan Simpson, Andrew Turner

    Abstract: Large supercomputing facilities are critical to research in many areas that impact on decisions such as how to address the current climate emergency. For example, climate modelling, renewable energy facility design and new battery technologies. However, these systems themselves are a source of large amounts of emissions due to the embodied emissions associated with their construction, transport, a… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  4. arXiv:2308.13561  [pdf, other

    cs.HC cs.CV

    Project Aria: A New Tool for Egocentric Multi-Modal AI Research

    Authors: Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira , et al. (49 additional authors not shown)

    Abstract: Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul… ▽ More

    Submitted 1 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  5. arXiv:2308.10248  [pdf, other

    cs.CL cs.LG

    Activation Addition: Steering Language Models Without Optimization

    Authors: Alexander Matt Turner, Lisa Thiergart, Gavin Leech, David Udell, Juan J. Vazquez, Ulisse Mini, Monte MacDiarmid

    Abstract: Reliably controlling the behavior of large language models is a pressing open problem. Existing methods include supervised finetuning, reinforcement learning from human feedback, prompt engineering and guided decoding. We instead investigate activation engineering: modifying activations at inference-time to predictably alter model behavior. We bias the forward pass with a 'steering vector' implici… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

  6. arXiv:2303.11731  [pdf, other

    cs.DC

    Automated service monitoring in the deployment of ARCHER2

    Authors: Kieran Leach, Philip Cass, Steven Robson, Eimantas Kazakevicius, Martin Lafferty, Andrew Turner, Alan Simpson

    Abstract: The ARCHER2 service, a CPU based HPE Cray EX system with 750,080 cores (5,860 nodes), has been deployed throughout 2020 and 2021, going into full service in December of 2021. A key part of the work during this deployment was the integration of ARCHER2 into our local monitoring systems. As ARCHER2 was one of the very first large-scale EX deployments, this involved close collaboration and developmen… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 7 pages

    ACM Class: C.5.1; C.4

  7. arXiv:2302.02477  [pdf, other

    cs.LG eess.SP q-bio.QM

    Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for Parkinson Disease Treatment

    Authors: Qitong Gao, Stephen L. Schimdt, Afsana Chowdhury, Guangyu Feng, Jennifer J. Peters, Katherine Genty, Warren M. Grill, Dennis A. Turner, Miroslav Pajic

    Abstract: Deep brain stimulation (DBS) has shown great promise toward treating motor symptoms caused by Parkinson's disease (PD), by delivering electrical pulses to the Basal Ganglia (BG) region of the brain. However, DBS devices approved by the U.S. Food and Drug Administration (FDA) can only deliver continuous DBS (cDBS) stimuli at a fixed amplitude; this energy inefficient operation reduces battery lifet… ▽ More

    Submitted 15 March, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Accepted to International Conference on Cyber Physical Systems (ICCPS) 2023

  8. arXiv:2206.13477  [pdf, other

    cs.AI

    Parametrically Retargetable Decision-Makers Tend To Seek Power

    Authors: Alexander Matt Turner, Prasad Tadepalli

    Abstract: If capable AI agents are generally incentivized to seek power in service of the objectives we specify for them, then these systems will pose enormous risks, in addition to enormous benefits. In fully observable environments, most reward functions have an optimal policy which seeks power by kee** options open and staying alive. However, the real world is neither fully observable, nor must trained… ▽ More

    Submitted 11 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 10-page main paper, 36 pages total, poster at NeurIPS 2022

  9. arXiv:2206.11831  [pdf, other

    cs.AI

    On Avoiding Power-Seeking by Artificial Intelligence

    Authors: Alexander Matt Turner

    Abstract: We do not know how to align a very intelligent AI agent's behavior with human interests. I investigate whether -- absent a full solution to this AI alignment problem -- we can build smart AI agents which have limited impact on the world, and which do not autonomously seek power. In this thesis, I introduce the attainable utility preservation (AUP) method. I demonstrate that AUP produces conservati… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 287 pages, PhD thesis

  10. arXiv:2206.11812  [pdf, other

    cs.AI

    Formalizing the Problem of Side Effect Regularization

    Authors: Alexander Matt Turner, Aseem Saxena, Prasad Tadepalli

    Abstract: AI objectives are often hard to specify properly. Some approaches tackle this problem by regularizing the AI's side effects: Agents must weigh off "how much of a mess they make" with an imperfectly specified proxy objective. We propose a formal criterion for side effect regularization via the assistance game framework. In these games, the agent solves a partially observable Markov decision process… ▽ More

    Submitted 8 November, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: 14 pages, accepted to ML Safety Workshop at NeurIPS 2022. Alexander Turner and Aseem Saxena contributed equally

  11. arXiv:2206.11061  [pdf, other

    cs.DB cs.AI cs.LO

    An Ontological Approach to Analysing Social Service Provisioning

    Authors: Mark S. Fox, Bart Gajderowicz, Daniela Rosu, Alina Turner, Lester Lyu

    Abstract: This paper introduces ontological concepts required to evaluate and manage the coverage of social services in a Smart City context. Here, we focus on the perspective of key stakeholders, namely social purpose organizations and the clients they serve. The Compass ontology presented here extends the Common Impact Data Standard by introducing new concepts related to key dimensions: the who (Stakehold… ▽ More

    Submitted 24 June, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Update: corrected email, header text

  12. arXiv:2203.07989  [pdf, ps, other

    cs.LG stat.ML

    Approximability and Generalisation

    Authors: Andrew J. Turner, Ata Kabán

    Abstract: Approximate learning machines have become popular in the era of small devices, including quantised, factorised, hashed, or otherwise compressed predictors, and the quest to explain and guarantee good generalisation abilities for such methods has just begun. In this paper we study the role of approximability in learning, both in the full precision and the approximated settings of the predictor that… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: 25 pages

  13. arXiv:2202.07590  [pdf, other

    hep-th cs.LG

    Identifying equivalent Calabi--Yau topologies: A discrete challenge from math and physics for machine learning

    Authors: Vishnu Jejjala, Washington Taylor, Andrew Turner

    Abstract: We review briefly the characteristic topological data of Calabi--Yau threefolds and focus on the question of when two threefolds are equivalent through related topological data. This provides an interesting test case for machine learning methodology in discrete mathematics problems motivated by physics.

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 6 pages, 3 figures; Contribution to proceedings of 2021 Nankai symposium on Mathematical Dialogues in celebration of S. S. Chern's 110th anniversary

    Report number: MIT-CTP-5406

  14. arXiv:2009.11806  [pdf, other

    cs.PF

    Investigating Applications on the A64FX

    Authors: Adrian Jackson, Michèle Weiland, Nick Brown, Andrew Turner, Mark Parsons

    Abstract: The A64FX processor from Fujitsu, being designed for computational simulation and machine learning applications, has the potential for unprecedented performance in HPC systems. In this paper, we evaluate the A64FX by benchmarking against a range of production HPC platforms that cover a number of processor technologies. We investigate the performance of complex scientific applications across multip… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

  15. arXiv:2006.06547  [pdf, other

    cs.AI

    Avoiding Side Effects in Complex Environments

    Authors: Alexander Matt Turner, Neale Ratzlaff, Prasad Tadepalli

    Abstract: Reward function specification can be difficult. Rewarding the agent for making a widget may be easy, but penalizing the multitude of possible negative side effects is hard. In toy environments, Attainable Utility Preservation (AUP) avoided side effects by penalizing shifts in the ability to achieve randomly generated goals. We scale this approach to large, randomly generated environments based on… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted as spotlight paper at NeurIPS 2020. 10 pages main paper; 19 pages with appendices

  16. arXiv:2001.01707  [pdf

    cs.LG eess.IV stat.ML

    Meta-modal Information Flow: A Method for Capturing Multimodal Modular Disconnectivity in Schizophrenia

    Authors: Haleh Falakshahi, Victor M. Vergara, **gyu Liu, Daniel H. Mathalon, Judith M. Ford, James Voyvodic, Bryon A. Mueller, Aysenil Belger, Sarah McEwen, Steven G. Potkin, Adrian Preda, Hooman Rokham, **g Sui, Jessica A. Turner, Sergey Plis, Vince D. Calhoun

    Abstract: Objective: Multimodal measurements of the same phenomena provide complementary information and highlight different perspectives, albeit each with their own limitations. A focus on a single modality may lead to incorrect inferences, which is especially important when a studied phenomenon is a disease. In this paper, we introduce a method that takes advantage of multimodal data in addressing the hyp… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Journal ref: IEEE Transactions on Biomedical Engineering, 2019

  17. arXiv:1912.02771  [pdf, other

    stat.ML cs.CR cs.LG

    Label-Consistent Backdoor Attacks

    Authors: Alexander Turner, Dimitris Tsipras, Aleksander Madry

    Abstract: Deep neural networks have been demonstrated to be vulnerable to backdoor attacks. Specifically, by injecting a small number of maliciously constructed inputs into the training set, an adversary is able to plant a backdoor into the trained model. This backdoor can then be activated during inference by a backdoor trigger to fully control the model's behavior. While such attacks are very effective, t… ▽ More

    Submitted 6 December, 2019; v1 submitted 5 December, 2019; originally announced December 2019.

  18. arXiv:1912.01683  [pdf, other

    cs.AI

    Optimal Policies Tend to Seek Power

    Authors: Alexander Matt Turner, Logan Smith, Rohin Shah, Andrew Critch, Prasad Tadepalli

    Abstract: Some researchers speculate that intelligent reinforcement learning (RL) agents would be incentivized to seek resources and power in pursuit of their objectives. Other researchers point out that RL agents need not have human-like power-seeking instincts. To clarify this discussion, we develop the first formal theory of the statistical tendencies of optimal policies. In the context of Markov decisio… ▽ More

    Submitted 28 January, 2023; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted to NeurIPS 2021 as spotlight paper. 12 pages, 44 pages with appendices. Since the 2021 acceptance, we updated the paper to point out that optimal policies can be qualitatively divorced from real-world learned policies

  19. arXiv:1906.03891  [pdf, other

    cs.DC cs.PF

    Analysis of parallel I/O use on the UK national supercomputing service, ARCHER using Cray LASSi and EPCC SAFE

    Authors: Andrew Turner, Dominic Sloan-Murphy, Karthee Sivalingam, Harvey Richardson, Julian Kunkel

    Abstract: In this paper, we describe how we have used a combination of the LASSi tool (developed by Cray) and the SAFE software (developed by EPCC) to collect and analyse Lustre I/O performance data for all jobs running on the UK national supercomputing service, ARCHER; and to provide reports on I/O usage for users in our standard reporting framework. We also present results from analysis of parallel I/O us… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: 15 pages, 19 figures, 5 tables, 2019 Cray User Group Meeting (CUG) , Montreal, Canada

  20. arXiv:1904.04250  [pdf, other

    cs.DC

    Evaluating the Arm Ecosystem for High Performance Computing

    Authors: Adrian Jackson, Andrew Turner, Michele Weiland, Nick Johnson, Olly Perks, Mark Parsons

    Abstract: In recent years, Arm-based processors have arrived on the HPC scene, offering an alternative the existing status quo, which was largely dominated by x86 processors. In this paper, we evaluate the Arm ecosystem, both the hardware offering and the software stack that is available to users, by benchmarking a production HPC platform that uses Marvell's ThunderX2 processors. We investigate the performa… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 18 pages, accepted at PASC19, 1 figure

  21. Conservative Agency via Attainable Utility Preservation

    Authors: Alexander Matt Turner, Dylan Hadfield-Menell, Prasad Tadepalli

    Abstract: Reward functions are easy to misspecify; although designers can make corrections after observing mistakes, an agent pursuing a misspecified reward function can irreversibly change the state of its environment. If that change precludes optimization of the correctly specified reward function, then correction is futile. For example, a robotic factory assistant could break expensive equipment due to a… ▽ More

    Submitted 10 June, 2020; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: Published in AI, Ethics, and Society 2020

  22. arXiv:1810.08677  [pdf, other

    cs.CL stat.ML

    A neural network to classify metaphorical violence on cable news

    Authors: Matthew A. Turner

    Abstract: I present here an experimental system for identifying and annotating metaphor in corpora. It is designed to plug in to Metacorps, an experimental web app for annotating metaphor. As Metacorps users annotate metaphors, the system will use user annotations as training data. When the system is confident, it will suggest an identification and an annotation. Once approved by the user, this becomes more… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: 6 pages, 1 figure, 1 table

  23. arXiv:1805.12152  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Robustness May Be at Odds with Accuracy

    Authors: Dimitris Tsipras, Shibani Santurkar, Logan Engstrom, Alexander Turner, Aleksander Madry

    Abstract: We show that there may exist an inherent tension between the goal of adversarial robustness and that of standard generalization. Specifically, training robust models may not only be more resource-consuming, but also lead to a reduction of standard accuracy. We demonstrate that this trade-off between the standard accuracy of a model and its robustness to adversarial perturbations provably exists in… ▽ More

    Submitted 9 September, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: ICLR'19

  24. Sequential Data Mining using Correlation Matrix Memory

    Authors: Sanil Shanker KP, Aaron Turner, Elizabeth Sherly, Jim Austin

    Abstract: This paper proposes a method for sequential data mining using correlation matrix memory. Here, we use the concept of the Logical Match to mine the indices of the sequential pattern. We demonstrate the uniqueness of the method with both the artificial and the real datum taken from NCBI databank.

    Submitted 6 July, 2014; originally announced July 2014.

    Comments: Networking and Information Technology (ICNIT), 2010 International Conference on

  25. arXiv:1309.1101  [pdf, ps, other

    cs.DC cs.SE

    Simplifying the Development, Use and Sustainability of HPC Software

    Authors: Jeremy Cohen, Chris Cantwell, Neil Chue Hong, David Moxey, Malcolm Illingworth, Andrew Turner, John Darlington, Spencer Sherwin

    Abstract: Develo** software to undertake complex, compute-intensive scientific processes requires a challenging combination of both specialist domain knowledge and software development skills to convert this knowledge into efficient code. As computational platforms become increasingly heterogeneous and newer types of platform such as Infrastructure-as-a-Service (IaaS) cloud computing become more widely ac… ▽ More

    Submitted 4 September, 2013; originally announced September 2013.

    Comments: 4 page position paper, submission to WSSSPE13 workshop

  26. arXiv:1209.5922  [pdf

    cs.DB q-bio.NC

    Towards structured sharing of raw and derived neuroimaging data across existing resources

    Authors: D. B. Keator, K. Helmer, J. Steffener, J. A. Turner, T. G. M. Van Erp, S. Gadde, N. Ashish, G. A. Burns, B. N. Nichols, S. S. Ghosh

    Abstract: Data sharing efforts increasingly contribute to the acceleration of scientific discovery. Neuroimaging data is accumulating in distributed domain-specific databases and there is currently no integrated access mechanism nor an accepted format for the critically important meta-data that is necessary for making use of the combined, available neuroimaging data. In this manuscript, we present work from… ▽ More

    Submitted 6 March, 2013; v1 submitted 26 September, 2012; originally announced September 2012.

  27. arXiv:cs/0607072  [pdf

    cs.HC

    Effect of Interface Style in Peer Review Comments for UML Designs

    Authors: Scott A. Turner, Manuel A. Perez-Quinones, Stephen H. Edwards

    Abstract: This paper presents our evaluation of using a Tablet-PC to provide peer-review comments in the first year Computer Science course. Our exploration consisted of an evaluation of how students write comments on other students' assignments using three different methods: pen and paper, a Tablet-PC, and a desktop computer. Our ultimate goal is to explore the effect that interface style (Tablet vs. Des… ▽ More

    Submitted 14 July, 2006; originally announced July 2006.

    Comments: 8 pages, 7 figures

    ACM Class: H.1; H.4; H.5