Abstract

The usefulness of machine learning algorithms has led to their widespread adoption prior to the development of a conceptual framework for making sense of them. One common response to this situation is to say that machine learning suffers from a “black box problem.” That is, machine learning algorithms are “opaque” to human users, failing to be “interpretable” or “explicable” in terms that would render categorization procedures “understandable.” The purpose of this paper is to challenge the widespread agreement about the existence and importance of a black box problem. The first section argues that “interpretability” and cognates lack precise meanings when applied to algorithms. This makes the concepts difficult to use when trying to solve the problems that have motivated the call for interpretability (etc.). Furthermore, since there is no adequate account of the concepts themselves, it is not possible to assess whether particular technical features supply formal definitions of those concepts. The second section argues that there are ways of being a responsible user of these algorithms that do not require interpretability (etc.). In many cases in which a black box problem is cited, interpretability is a means to a further end such as justification or non-discrimination. Since addressing these problems need not involve something that looks like an “interpretation” (etc.) of an algorithm, the focus on interpretability artificially constrains the solution space by characterizing one possible solution as the problem itself. Where possible, discussion should be reformulated in terms of the ends of interpretability.

Details

Title
Against Interpretability: a Critical Examination of the Interpretability Problem in Machine Learning
Author
Krishnan, Maya 1   VIAFID ORCID Logo 

 All Souls College, Oxford, UK (GRID:grid.4991.5) (ISNI:0000 0004 1936 8948) 
Pages
487-502
Publication year
2020
Publication date
Sep 2020
Publisher
Springer Nature B.V.
ISSN
22105433
e-ISSN
22105441
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2272482130
Copyright
© The Author(s) 2019. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.