Content area

Abstract

The multicore evolution has stimulated renewed interests in scaling up applications on shared-memory multiprocessors, significantly improving the scalability of many applications. But the scalability is limited within a single node; therefore programmers still have to redesign applications to scale out over multiple nodes. This paper revisits the design and implementation of distributed shared memory (DSM) as a way to scale out applications optimized for non-uniform memory access (NUMA) architecture over a well-connected cluster. This paper presents MAGI, an efficient DSM system that provides a transparent shared address space with scalable performance on a cluster with fast network interfaces. MAGI is unique in that it presents a NUMA abstraction to fully harness the multicore resources in each node through hierarchical synchronization and memory management. MAGI also exploits the memory access patterns of big-data applications and leverages a set of optimizations for remote direct memory access (RDMA) to reduce the number of page faults and the cost of the coherence protocol. MAGI has been implemented as a user-space library with pthread-compatible interfaces and can run existing multithreaded applications with minimized modifications. We deployed MAGI over an 8-node RDMAenabled cluster. Experimental evaluation shows that MAGI achieves up to 9.25x speedup compared with an unoptimized implementation, leading to a scalable performance for large-scale data-intensive applications.

Details

10000008
Business indexing term
Title
Scaling out NUMA-Aware Applications with RDMA-Based Distributed Shared Memory
Author
Hong, Yang 1 ; Zheng, Yang 1 ; Yang, Fan 1 ; Zang, Bin-Yu 1 ; Guan, Hai-Bing 1 ; Chen, Hai-Bo 1 

 Shanghai Jiao Tong University, Shanghai Key Laboratory for Scalable Computing Systems, Shanghai, China (GRID:grid.16821.3c) (ISNI:0000 0004 0368 8293) 
Volume
34
Issue
1
Pages
94-112
Publication year
2019
Publication date
Jan 2019
Publisher
Springer Nature B.V.
Place of publication
Beijing
Country of publication
Netherlands
ISSN
10009000
e-ISSN
18604749
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2019-01-18
Milestone dates
2019-01-11 (Registration); 2018-06-13 (Received); 2018-11-21 (Rev-Recd)
Publication history
 
 
   First posting date
18 Jan 2019
ProQuest document ID
2918609996
Document URL
https://www.proquest.com/scholarly-journals/scaling-out-numa-aware-applications-with-rdma/docview/2918609996/se-2?accountid=208611
Copyright
© Springer Science+Business Media, LLC, part of Springer Nature 2019.
Last updated
2024-12-12
Database
ProQuest One Academic