Content area

Abstract

The QR factorisation is a cornerstone of numerical linear algebra, essential for solving overdetermined linear systems, eigenvalue problems, and various scientific computing tasks. However, computing it for ill-conditioned tall-and-skinny (TS) matrices on large-scale distributed-memory systems, particularly those with multiple GPUs, presents significant challenges in balancing numerical stability, high performance, and efficient communication. Traditional Householder-based QR methods provide numerical stability but perform poorly on TS matrices due to their reliance on memory-bound kernels. This paper introduces a novel algorithm for computing the QR factorisation of ill-conditioned TS matrices based on CholeskyQR methods. Although CholeskyQR is fast, it typically fails due to severe loss of orthogonality for ill-conditioned inputs. To solve this, our new algorithm, mCQRGSI+, combines the speed of CholeskyQR with stabilising techniques from the Gram–Schmidt process. It is specifically optimised for distributed multi-GPU systems, using adaptive strategies to balance computation and communication. Our analysis shows the method achieves accuracy comparable to Householder QR, even for extremely ill-conditioned matrices (condition numbers up to 1016). Scaling experiments demonstrate speedups of up to 12× over ScaLAPACK and 16× over SLATE’s CholeskyQR2. This work delivers a method that is both robust and highly parallel, advancing the state-of-the-art for this challenging class of problems.

Details

1009240
Business indexing term
Title
Scalable QR Factorisation of Ill-Conditioned Tall-and-Skinny Matrices on Distributed GPU Systems
Author
Mijić Nenad 1   VIAFID ORCID Logo  ; Kaushik Abhiram 2   VIAFID ORCID Logo  ; Živković Dario 1   VIAFID ORCID Logo  ; Davidović Davor 1   VIAFID ORCID Logo 

 Centre for Informatics and Computing, Ruđer Bošković Institute, Bijenička Cesta 54, 10000 Zagreb, Croatia; [email protected] (N.M.); [email protected] (A.K.); [email protected] (D.Ž.) 
 Centre for Informatics and Computing, Ruđer Bošković Institute, Bijenička Cesta 54, 10000 Zagreb, Croatia; [email protected] (N.M.); [email protected] (A.K.); [email protected] (D.Ž.), Department of Physics, University of Jyväskylä, P.O. Box 35, 40014 Jyväskylä, Finland, Helsinki Institute of Physics, University of Helsinki, P.O. Box 64, 00014 Helsinki, Finland 
Publication title
Volume
13
Issue
22
First page
3608
Number of pages
22
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-11-11
Milestone dates
2025-10-08 (Received); 2025-11-06 (Accepted)
Publication history
 
 
   First posting date
11 Nov 2025
ProQuest document ID
3275542003
Document URL
https://www.proquest.com/scholarly-journals/scalable-qr-factorisation-ill-conditioned-tall/docview/3275542003/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-26
Database
ProQuest One Academic