Skip to main content

Showing 1–10 of 10 results for author: Saunders, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.05802  [pdf, other

    cs.CL cs.LG

    Self-critiquing models for assisting human evaluators

    Authors: William Saunders, Catherine Yeh, Jeff Wu, Steven Bills, Long Ouyang, Jonathan Ward, Jan Leike

    Abstract: We fine-tune large language models to write natural language critiques (natural language critical comments) using behavioral cloning. On a topic-based summarization task, critiques written by our models help humans find flaws in summaries that they would have otherwise missed. Our models help find naturally occurring flaws in both model and human written summaries, and intentional flaws in summari… ▽ More

    Submitted 13 June, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

  2. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  3. arXiv:2112.09332  [pdf, other

    cs.CL cs.AI cs.LG

    WebGPT: Browser-assisted question-answering with human feedback

    Authors: Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman

    Abstract: We fine-tune GPT-3 to answer long-form questions using a text-based web-browsing environment, which allows the model to search and navigate the web. By setting up the task so that it can be performed by humans, we are able to train models on the task using imitation learning, and then optimize answer quality with human feedback. To make human evaluation of factual accuracy easier, models must coll… ▽ More

    Submitted 1 June, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: 32 pages

  4. arXiv:2110.06674  [pdf, other

    cs.CY cs.AI cs.CL

    Truthful AI: Develo** and governing AI that does not lie

    Authors: Owain Evans, Owen Cotton-Barratt, Lukas Finnveden, Adam Bales, Avital Balwit, Peter Wills, Luca Righetti, William Saunders

    Abstract: In many contexts, lying -- the use of verbal falsehoods to deceive -- is harmful. While lying has traditionally been a human affair, AI systems that make sophisticated verbal statements are becoming increasingly prevalent. This raises the question of how we should limit the harm caused by AI "lies" (i.e. falsehoods that are actively selected for). Human truthfulness is governed by social norms and… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    ACM Class: I.2.0

  5. arXiv:2107.03374  [pdf, other

    cs.LG

    Evaluating Large Language Models Trained on Code

    Authors: Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter , et al. (33 additional authors not shown)

    Abstract: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol… ▽ More

    Submitted 14 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: corrected typos, added references, added authors, added acknowledgements

  6. arXiv:2007.10054  [pdf, other

    cs.DC physics.comp-ph

    Parallel Performance of ARM ThunderX2 for Atomistic Simulation Algorithms

    Authors: William Robert Saunders, James Grant, Eike Hermann Müller

    Abstract: Atomistic simulation drives scientific advances in modern material science and accounts for a significant proportion of wall time on High Performance Computing facilities. It is important that algorithms are efficient and implementations are performant in a continuously diversifying hardware landscape. Furthermore, they have to be portable to make best use of the available computing resource. In… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: 10 pages, 3 figures, 1 tables; submitted to EAHPC-2020 (Embracing Arm: a journey of porting and optimization to the latest Arm-based processors 2020)

  7. arXiv:1905.04065  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.DC

    Fast electrostatic solvers for kinetic Monte Carlo simulations

    Authors: William Robert Saunders, James Grant, Eike Hermann Müller, Ian Thompson

    Abstract: Kinetic Monte Carlo (KMC) is an important computational tool in physics and chemistry. In contrast to standard Monte Carlo, KMC permits the description of time dependent dynamical processes and is not restricted to systems in equilibrium. Recently KMC has been applied successfully in modelling of novel energy materials such as Lithium-ion batteries and solar cells. We consider general solid state… ▽ More

    Submitted 1 March, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: 26 pages, 19 figures, 7 tables; accepted for publication in Computer Physics Communications

    MSC Class: 78M16; 82C80; 82D37; 65Y05; 65Y20 ACM Class: J.2; G.4; D.1.3; D.2.11

  8. arXiv:1708.01135  [pdf, other

    cs.DC cs.SE physics.comp-ph

    Long range forces in a performance portable Molecular Dynamics framework

    Authors: William R. Saunders, James Grant, Eike H. Müller

    Abstract: Molecular Dynamics (MD) codes predict the fundamental properties of matter by following the trajectories of a collection of interacting model particles. To exploit diverse modern manycore hardware, efficient codes must use all available parallelism. At the same time they need to be portable and easily extendible by the domain specialist (physicist/chemist) without detailed knowledge of this hardwa… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 9 pages, 3 figures, submitted to ParCo 2017 Parallel Computing Conference

    ACM Class: D.1.3; D.2.11; J.2; G.4

  9. arXiv:1707.05173  [pdf, other

    cs.AI cs.LG cs.NE

    Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

    Authors: William Saunders, Girish Sastry, Andreas Stuhlmueller, Owain Evans

    Abstract: AI systems are increasingly applied to complex tasks that involve interaction with humans. During training, such systems are potentially dangerous, as they haven't yet learned to avoid actions that could cause serious harm. How can an AI system explore and learn without making a single mistake that harms humans or otherwise causes serious damage? For model-free reinforcement learning, having a hum… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

  10. arXiv:1704.03329  [pdf, other

    cs.DC cs.SE physics.comp-ph

    A Domain Specific Language for Performance Portable Molecular Dynamics Algorithms

    Authors: William R. Saunders, James Grant, Eike H. Müller

    Abstract: Developers of Molecular Dynamics (MD) codes face significant challenges when adapting existing simulation packages to new hardware. In a continuously diversifying hardware landscape it becomes increasingly difficult for scientists to be experts both in their own domain (physics/chemistry/biology) and specialists in the low level parallelisation and optimisation of their codes. To address this chal… ▽ More

    Submitted 13 November, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: 24 pages, 12 figures, 11 tables, accepted for publication in Computer Physics Communications on 12 Nov 2017

    ACM Class: D.1.3, D.2.11, J.2, G.4