It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Since Alan Turing envisioned artificial intelligence, technical progress has often been measured by the ability to defeat humans in zero-sum encounters (e.g., Chess, Poker, or Go). Less attention has been given to scenarios in which human–machine cooperation is beneficial but non-trivial, such as scenarios in which human and machine preferences are neither fully aligned nor fully in conflict. Cooperation does not require sheer computational power, but instead is facilitated by intuition, cultural norms, emotions, signals, and pre-evolved dispositions. Here, we develop an algorithm that combines a state-of-the-art reinforcement-learning algorithm with mechanisms for signaling. We show that this algorithm can cooperate with people and other algorithms at levels that rival human cooperation in a variety of two-player repeated stochastic games. These results indicate that general human–machine cooperation is achievable using a non-trivial, but ultimately simple, set of algorithmic mechanisms.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details






1 Computer Science Department, Brigham Young University, Provo, UT, USA
2 Khalifa University of Science and Technology, Masdar Institute, Abu Dhabi, United Arab Emirates
3 UVA Digital Himalaya Project, University of Virginia, Charlottesville, VA, USA
4 British University in Dubai, Dubai, United Arab Emirates; School of Informatics, University of Edinburgh, Edinburgh, UK
5 Toulouse School of Economics (TSM-Research), Centre National de la Recherche Scientifique, University of Toulouse Capitole, Toulouse, France
6 The Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA
7 Department of Psychology and Social Behavior, University of California, Irvine, CA, USA
8 The Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA; Institute for Data, Systems and Society, Massachusetts Institute of Technology, Cambridge, MA, USA