Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Blood oxygen saturation (SpO2) is an essential physiological parameter for evaluating a person’s health. While conventional SpO2 measurement devices like pulse oximeters require skin contact, advanced computer vision technology can enable remote SpO2 monitoring through a regular camera without skin contact. In this paper, we propose novel deep learning models to measure SpO2 remotely from facial videos and evaluate them using a public benchmark database, VIPL-HR. We utilize a spatial–temporal representation to encode SpO2 information recorded by conventional RGB cameras and directly pass it into selected convolutional neural networks to predict SpO2. The best deep learning model achieves 1.274% in mean absolute error and 1.71% in root mean squared error, which exceed the international standard of 4% for an approved pulse oximeter. Our results significantly outperform the conventional analytical Ratio of Ratios model for contactless SpO2 measurement. Results of sensitivity analyses of the influence of spatial–temporal representation color spaces, subject scenarios, acquisition devices, and SpO2 ranges on the model performance are reported with explainability analyses to provide more insights for this emerging research field.

Details

Title
Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning
Author
Chun-Hong, Cheng 1   VIAFID ORCID Logo  ; Yuen, Zhikun 2 ; Chen, Shutao 3 ; Kwan-Long, Wong 3 ; Jing-Wei, Chin 3 ; Chan, Tsz-Tai 3 ; So, Richard H Y 4 

 Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK 
 Department of Computer Science, University of Ottawa, Ottawa, ON K1H 8M5, Canada; [email protected] 
 PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China; [email protected] (S.C.); [email protected] (K.-L.W.); [email protected] (J.-W.C.); [email protected] (T.-T.C.); [email protected] (R.H.Y.S.) 
 PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China; [email protected] (S.C.); [email protected] (K.-L.W.); [email protected] (J.-W.C.); [email protected] (T.-T.C.); [email protected] (R.H.Y.S.); Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China 
First page
251
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
23065354
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2991000106
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.