Abstract

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.

Nonlinear machine learning methods have good predictive ability but the lack of transparency of the algorithms can limit their use. Here the authors investigate how these methods approach learning in order to assess the dependability of their decision making.

Details

Title
Unmasking Clever Hans predictors and assessing what machines really learn
Author
Lapuschkin Sebastian 1 ; Wäldchen Stephan 2 ; Binder, Alexander 3 ; Montavon Grégoire 2 ; Samek Wojciech 1 ; Klaus-Robert, Müller 4 

 Fraunhofer Heinrich Hertz Institute, Department of Video Coding & Analytics, Berlin, Germany (GRID:grid.435231.2) (ISNI:0000 0004 0495 5488) 
 Technische Universität Berlin, Department of Electrical Engineering and Computer Science, Berlin, Germany (GRID:grid.6734.6) (ISNI:0000 0001 2292 8254) 
 Singapore University of Technology and Design, ISTD Pillar, Singapore, Singapore (GRID:grid.263662.5) (ISNI:0000 0004 0500 7631) 
 Technische Universität Berlin, Department of Electrical Engineering and Computer Science, Berlin, Germany (GRID:grid.6734.6) (ISNI:0000 0001 2292 8254); Korea University, Department of Brain and Cognitive Engineering, Seoul, Republic of Korea (GRID:grid.222754.4) (ISNI:0000 0001 0840 2678); Max Planck Institut für Informatik, Saarbrücken, Germany (GRID:grid.419528.3) (ISNI:0000 0004 0491 9823) 
Publication year
2019
Publication date
Dec 2019
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2190076363
Copyright
This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.