Content area

Abstract

NAND flash memory has been widely adopted as the primary data storage medium in data centers. However, the inherent characteristic of out-of-place updates in NAND flash necessitates garbage collection (GC) operations on NAND flash-based solid-state drives (SSDs), aimed at reclaiming flash blocks occupied by invalid data. GC processes entail additional read and write operations, which can lead to the blocking of user requests, thereby increasing the tail latency. Moreover, frequent execution of GC operations is prone to induce more pages to be written, further reducing the lifetime of SSDs. In light of these challenges, we introduce an innovative GC scheme, termed SplitGC. This scheme leverages the records of data redundancy gathered during periodic read scrub operations within the SSD. By analyzing these features of data duplication, SplitGC enhances the selection strategy for the victim block. Furthermore, it bifurcates the migration of valid data pages into two phases: non-duplicate pages follow standard relocation procedures, whereas the movement of duplicate pages is scheduled during idle periods of the SSD. The experiment results show that our scheme reduces tail latency induced by GC by 8% to 83% at the 99.99th percentile and significantly decreases the amount of valid page migration by 38% to 67% compared with existing schemes.

Details

1009240
Business indexing term
Title
Exploiting Data Duplication to Reduce Data Migration in Garbage Collection Inside SSD
Author
Nie Shiqiang 1 ; Niu Jie 1   VIAFID ORCID Logo  ; Yang Chaoyun 1 ; Zhang, Peng 2 ; Yang, Qiong 2 ; Wang, Dong 2 ; Wu, Weiguo 1   VIAFID ORCID Logo 

 School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an 710049, China; [email protected] (S.N.); 
 Xi’an Aeronautics Computing Technique Research Institute, Xi’an 710065, China 
Publication title
Volume
14
Issue
9
First page
1873
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20799292
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-05-04
Milestone dates
2025-04-15 (Received); 2025-05-03 (Accepted)
Publication history
 
 
   First posting date
04 May 2025
ProQuest document ID
3203193386
Document URL
https://www.proquest.com/scholarly-journals/exploiting-data-duplication-reduce-migration/docview/3203193386/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-13
Database
ProQuest One Academic