Abstract

Translate

Building artificial intelligence (AI) that aligns with human values is an unsolved problem. Here we developed a human-in-the-loop research pipeline called Democratic AI, in which reinforcement learning is used to design a social mechanism that humans prefer by majority. A large group of humans played an online investment game that involved deciding whether to keep a monetary endowment or to share it with others for collective benefit. Shared revenue was returned to players under two different redistribution mechanisms, one designed by the AI and the other by humans. The AI discovered a mechanism that redressed initial wealth imbalance, sanctioned free riders and successfully won the majority vote. By optimizing for human preferences, Democratic AI offers a proof of concept for value-aligned policy innovation.

Koster, Balaguer et al. show that an AI mechanism is able to learn to produce a redistribution policy which is preferred to alternatives by humans in an incentivized game.

Details

Title

Human-centred mechanism design with Democratic AI

Author

Koster, Raphael¹

; Balaguer, Jan¹; Tacchetti, Andrea¹; Weinstein, Ari¹; Zhu, Tina¹; Hauser, Oliver²

; Williams, Duncan¹; Campbell-Gillingham, Lucy¹; Thacker, Phoebe¹; Botvinick, Matthew³

; Summerfield, Christopher⁴

¹ Deepmind, London, UK (GRID:grid.498210.6) (ISNI:0000 0004 5999 1726)
² University of Exeter, Department of Economics and Institute for Data Science and Artificial Intelligence, Exeter, UK (GRID:grid.8391.3) (ISNI:0000 0004 1936 8024)
³ Deepmind, London, UK (GRID:grid.498210.6) (ISNI:0000 0004 5999 1726); University College London, Gatsby Computational Neuroscience Unit, London, UK (GRID:grid.83440.3b) (ISNI:0000000121901201)
⁴ Deepmind, London, UK (GRID:grid.498210.6) (ISNI:0000 0004 5999 1726); University of Oxford, Department of Experimental Psychology, Oxford, UK (GRID:grid.4991.5) (ISNI:0000 0004 1936 8948)

Pages

1398-1407

Publication year

2022

Publication date

Oct 2022

Publisher

Nature Publishing Group

e-ISSN

23973374

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1038/s41562-022-01383-x

ProQuest document ID

2726686158

© The Author(s) 2022. corrected publication 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Human-centred mechanism design with Democratic AI

Jump to:

Abstract

Details

Suggested sources