Content area

Abstract

This paper proposes a systematic benchmarking method called BenchMetrics to analyze and compare the robustness of binary classification performance metrics based on the confusion matrix for a crisp classifier. BenchMetrics, introducing new concepts such as meta-metrics (metrics about metrics) and metric space, has been tested on fifteen well-known metrics including balanced accuracy, normalized mutual information, Cohen’s Kappa, and Matthews correlation coefficient (MCC), along with two recently proposed metrics, optimized precision and index of balanced accuracy in the literature. The method formally presents a pseudo-universal metric space where all the permutations of confusion matrix elements yielding the same sample size are calculated. It evaluates the metrics and metric spaces in a two-staged benchmark based on our proposed eighteen new criteria and finally ranks the metrics by aggregating the criteria results. The mathematical evaluation stage analyzes metrics’ equations, specific confusion matrix variations, and corresponding metric spaces. The second stage, including seven novel meta-metrics, evaluates the robustness aspects of metric spaces. We interpreted each benchmarking result and comparatively assessed the effectiveness of BenchMetrics with the limited comparison studies in the literature. The results of BenchMetrics have demonstrated that widely used metrics have significant robustness issues, and MCC is the most robust and recommended metric for binary classification performance evaluation.

Details

Title
BenchMetrics: a systematic benchmarking method for binary classification performance metrics
Author
Gürol, Canbek 1 ; Taskaya Temizel Tugba 2 ; Sagiroglu Seref 3 

 ASELSAN, Ankara, Turkey (GRID:grid.432264.5) (ISNI:0000 0004 0410 4608); Middle East Technical University, Informatics Institute, Ankara, Turkey (GRID:grid.6935.9) (ISNI:0000 0001 1881 7391) 
 Middle East Technical University, Informatics Institute, Ankara, Turkey (GRID:grid.6935.9) (ISNI:0000 0001 1881 7391) 
 Gazi University, Computer Engineering Department, Ankara, Turkey (GRID:grid.25769.3f) (ISNI:0000 0001 2169 7132) 
Pages
14623-14650
Publication year
2021
Publication date
Nov 2021
Publisher
Springer Nature B.V.
ISSN
09410643
e-ISSN
14333058
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2585228537
Copyright
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2021.