Abstract

Translate

The usefulness of machine learning algorithms has led to their widespread adoption prior to the development of a conceptual framework for making sense of them. One common response to this situation is to say that machine learning suffers from a “black box problem.” That is, machine learning algorithms are “opaque” to human users, failing to be “interpretable” or “explicable” in terms that would render categorization procedures “understandable.” The purpose of this paper is to challenge the widespread agreement about the existence and importance of a black box problem. The first section argues that “interpretability” and cognates lack precise meanings when applied to algorithms. This makes the concepts difficult to use when trying to solve the problems that have motivated the call for interpretability (etc.). Furthermore, since there is no adequate account of the concepts themselves, it is not possible to assess whether particular technical features supply formal definitions of those concepts. The second section argues that there are ways of being a responsible user of these algorithms that do not require interpretability (etc.). In many cases in which a black box problem is cited, interpretability is a means to a further end such as justification or non-discrimination. Since addressing these problems need not involve something that looks like an “interpretation” (etc.) of an algorithm, the focus on interpretability artificially constrains the solution space by characterizing one possible solution as the problem itself. Where possible, discussion should be reformulated in terms of the ends of interpretability.

Details

Title

Against Interpretability: a Critical Examination of the Interpretability Problem in Machine Learning

Author

Krishnan, Maya¹

¹ All Souls College, Oxford, UK (GRID:grid.4991.5) (ISNI:0000 0004 1936 8948)

Pages

487-502

Publication year

2020

Publication date

Sep 2020

Publisher

Springer Nature B.V.

ISSN

22105433

e-ISSN

22105441

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1007/s13347-019-00372-9

ProQuest document ID

2272482130

© The Author(s) 2019. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Against Interpretability: a Critical Examination of the Interpretability Problem in Machine Learning

Jump to:

Abstract

Details

Suggested sources