Content area

Abstract

Exponential advances in computational power have fueled advances in many disciplines, and biology is no exception. High-Performance Computing (HPC) is gaining traction as one of the essential tools in scientific research. Further advances to exascale capabilities will necessitate more energy-efficient hardware. In this article, we present our efforts to improve the efficiency of genome assembly on ARM-based HPC systems. We use vectorization to optimize the popular genome assembly pipeline of minimap2, miniasm, and Racon. We compare different implementations using the Scalable Vector Extension (SVE) instruction set architecture and evaluate their performance in different aspects. Additionally, we compare the performance of autovectorization to hand-tuned code with intrinsics. Lastly, we present the design of a CPU dispatcher included in the Racon consensus module that enables the automatic selection of the fastest instruction set supported by the utilized CPU. Our findings provide a promising direction for further optimization of genome assembly on ARM-based HPC systems.

Details

1009240
Location
Title
First Steps towards Efficient Genome Assembly on ARM-Based HPC
Publication title
Volume
13
Issue
1
First page
39
Publication year
2024
Publication date
2024
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20799292
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2023-12-20
Milestone dates
2023-11-15 (Received); 2023-12-18 (Accepted)
Publication history
 
 
   First posting date
20 Dec 2023
ProQuest document ID
2912642981
Document URL
https://www.proquest.com/scholarly-journals/first-steps-towards-efficient-genome-assembly-on/docview/2912642981/se-2?accountid=208611
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2024-10-14
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic