Abstract

Quantifying predictive uncertainty in deep neural networks is a challenging and yet unsolved problem. Existing quantification approaches can be categorized into two lines. Bayesian methods provide a complete uncertainty quantification theory but are often not scalable to large-scale models. Along another line, non-Bayesian methods have good scalability and can quantify uncertainty with high quality. The most remarkable idea in this line is Deep Ensemble, but it is limited in practice due to its expensive computational cost. Thus, we propose HatchEnsemble to improve the efficiency and practicality of Deep Ensemble. The main idea is to use function-preserving transformations, ensuring HatchNets to inherit the knowledge learned by a single model called SeedNet. This process is called hatching, and HatchNet can be obtained by continuously widening the SeedNet. Based on our method, two different hatches are proposed, respectively, for ensembling the same and different architecture networks. To ensure the diversity of models, we also add random noises to parameters during hatching. Experiments on both clean and corrupted datasets show that HatchEnsemble can give a competitive prediction performance and better-calibrated uncertainty quantification in a shorter time compared with baselines.

Details

Title
HatchEnsemble: an efficient and practical uncertainty quantification method for deep neural networks
Author
Xia Yufeng 1 ; Zhang, Jun 2 ; Jiang Tingsong 3 ; Gong Zhiqiang 3 ; Yao, Wen 3   VIAFID ORCID Logo  ; Feng, Ling 4 

 National University of Defense Technology, College of Aerospace Science and Engineering, Changsha, China (GRID:grid.412110.7) (ISNI:0000 0000 9548 2110) 
 Tsinghua University, Department of Computer Science and Technology, Beijing, China (GRID:grid.12527.33) (ISNI:0000 0001 0662 3178); Chinese Academy of Military Science, National Innovation Institute of Defense Technology, Beijing, China (GRID:grid.500274.4) 
 Chinese Academy of Military Science, National Innovation Institute of Defense Technology, Beijing, China (GRID:grid.500274.4) 
 Tsinghua University, Department of Computer Science and Technology, Beijing, China (GRID:grid.12527.33) (ISNI:0000 0001 0662 3178) 
Pages
2855-2869
Publication year
2021
Publication date
Dec 2021
Publisher
Springer Nature B.V.
ISSN
21994536
e-ISSN
21986053
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2588792624
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.