Abstract

“Cuohu Bazi” (CHBZ) is an ancient sorghum variety collected from the fields of China, known for its agronomic traits like dwarf stature, early maturation. In this study, we present the first telomere-to-telomere (T2T) and gap-free genome assembly of CHBZ using PacBio HiFi reads, Oxford Nanopore Technologies, and Hi-C data. The assembled genome comprises 724.85 Mb, effectively resolving all 3,913 gaps that were present in the previous sorghum BTx623 reference genome. Notably, the T2T assembly captures 10 centromeres and all 20 telomeres, providing strong support for their integrity. This assembly is of high quality in terms of contiguity (contig N50: 71.1 Mb), completeness (BUSCO score: 99.01%, k-mer completeness: 98.88%), and correctness (QV: 61.60). Repetitive sequences accounted for 70.41% of the genome and a total of 32,855 protein-coding genes have been annotated. Furthermore, 161 CHBZ-specific presence/absence variants genes have been identified when comparing to BTx623 genome. This study provides valuable insights for future research on sorghum genetics, genomics, and evolutionary history.

Details

Title
Telomere-to-telomere genome assembly of sorghum
Author
Li, Meng 1 ; Chen, Chunhai 2 ; Wang, Haigang 1 ; Qin, Huibin 1 ; Hou, Sen 1 ; Yang, Xukui 2 ; Jian, Jianbo 2 ; Gao, Peng 3 ; Liu, Minxuan 4 ; Mu, Zhixin 1 

 Ministry of Agriculture and Rural Affairs, Center for Agricultural Genetic Resources Research, Shanxi Agricultural University, Key Laboratory of Crop Gene Resources and Germplasm Enhancement on Loess Plateau, Taiyuan, China (GRID:grid.418524.e) (ISNI:0000 0004 0369 6250) 
 BGI Genomics, Shenzhen, China (GRID:grid.418524.e) 
 BGI, Shenzhen, China (GRID:grid.21155.32) (ISNI:0000 0001 2034 1839) 
 Chinese Academy of Agricultural Sciences, Institute of Crop Sciences, Beijing, China (GRID:grid.410727.7) (ISNI:0000 0001 0526 1937) 
Pages
835
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3087453302
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.