It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Modern neural networks (NNs) often do not generalize well in the presence of a ‘covariate shift’; that is, in situations where the training and test data distributions differ, but the conditional distribution of classification labels given the data remains unchanged. In such cases, NN generalization can be reduced to a problem of learning more robust, domain-invariant features. Domain adaptation (DA) methods include a broad range of techniques aimed at achieving this; however, these methods have struggled with the need for extensive hyperparameter tuning, which then incurs significant computational costs. In this work, we introduce SInkhorn Dynamic Domain Adaptation (SIDDA), an out-of-the-box DA training algorithm built upon the Sinkhorn divergence, that can achieve effective domain alignment with minimal hyperparameter tuning and computational overhead. We demonstrate the efficacy of our method on multiple simulated and real datasets of varying complexity, including simple shapes, handwritten digits, real astronomical observations, and remote sensing data. These datasets exhibit covariate shifts due to noise, blurring, differences between telescopes, and variations in imaging wavelengths. SIDDA is compatible with a variety of NN architectures, and it works particularly well in improving classification accuracy and model calibration when paired with symmetry-aware equivariant NNs (ENNs). We find that SIDDA consistently enhances the generalization capabilities of NNs, achieving up to a
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
; Patel, Purvik 2 ; Nord, Brian D 3
; Walmsley, Mike 4 ; Ćiprijanović, Aleksandra 5
1 Department of Physics, Northeastern University , Boston, MA, 02115, United States of America; NSF AI Institute for Artificial Intelligence & Fundamental Interactions (IAIFI) , Cambridge, MA, 02139, United States of America; Fermi National Accelerator Laboratory , Batavia, IL, 60510, United States of America
2 Khoury College of Computer Science, Northeastern University , Boston, MA, 02115, United States of America
3 Fermi National Accelerator Laboratory , Batavia, IL, 60510, United States of America; Department of Astronomy and Astrophysics, University of Chicago , Chicago, IL, 60637, United States of America; Kavli Institute for Cosmological Physics, University of Chicago , Chicago, IL, 60637, United States of America
4 Dunlap Institute for Astronomy & Astrophysics, University of Toronto , Toronto, ON M5S 3H4, Canada; Jodrell Bank Centre for Astrophysics, Department of Physics & Astronomy, University of Manchester , Manchester M13 9PL, United Kingdom
5 Fermi National Accelerator Laboratory , Batavia, IL, 60510, United States of America; Department of Astronomy and Astrophysics, University of Chicago , Chicago, IL, 60637, United States of America; NSF-Simons AI Institute for the Sky (SkAI) , 172 E. Chestnut St., Chicago, IL 60611, United States of America




