Showing 1–2 of 2 results for author: Green, B P

Search v0.5.6 released 2020-02-24

arXiv:2311.17017 [pdf, other]

cs.CY cs.AI

Foundational Moral Values for AI Alignment

Authors: Betty Li Hou, Brian Patrick Green

Abstract: Solving the AI alignment problem requires having clear, defensible values towards which AI systems can align. Currently, targets for alignment remain underspecified and do not seem to be built from a philosophically robust structure. We begin the discussion of this problem by presenting five core, foundational values, drawn from moral philosophy and built on the requisites for human existence: sur… ▽ More Solving the AI alignment problem requires having clear, defensible values towards which AI systems can align. Currently, targets for alignment remain underspecified and do not seem to be built from a philosophically robust structure. We begin the discussion of this problem by presenting five core, foundational values, drawn from moral philosophy and built on the requisites for human existence: survival, sustainable intergenerational existence, society, education, and truth. We show that these values not only provide a clearer direction for technical alignment work, but also serve as a framework to highlight threats and opportunities from AI systems to both obtain and sustain these values. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: AI meets Moral Philosophy and Moral Psychology Workshop, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
arXiv:2301.03740 [pdf, other]

cs.CY cs.AI

A Multi-Level Framework for the AI Alignment Problem

Authors: Betty Li Hou, Brian Patrick Green

Abstract: AI alignment considers how we can encode AI systems in a way that is compatible with human values. The normative side of this problem asks what moral values or principles, if any, we should encode in AI. To this end, we present a framework to consider the question at four levels: Individual, Organizational, National, and Global. We aim to illustrate how AI alignment is made up of value alignment p… ▽ More AI alignment considers how we can encode AI systems in a way that is compatible with human values. The normative side of this problem asks what moral values or principles, if any, we should encode in AI. To this end, we present a framework to consider the question at four levels: Individual, Organizational, National, and Global. We aim to illustrate how AI alignment is made up of value alignment problems at each of these levels, where values at each level affect the others and effects can flow in either direction. We outline key questions and considerations of each level and demonstrate an application of this framework to the topic of AI content moderation. △ Less

Submitted 9 January, 2023; originally announced January 2023.

Comments: ML Safety Workshop, 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Search v0.5.6 released 2020-02-24