Full text

Turn on search term navigation

© 2025 Zhou et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Tuberculosis is a major public health threat resulting in more than one million lives lost every year. Many challenges exist to defeat this deadly infectious disease which address the importance of a thorough understanding of the biology of the causative agent Mycobacterium tuberculosis (MTB). We generated a non-redundant pangenome of 420 epidemic MTB strains from China including 344 Lineage 2 strains, 69 Lineage 4 strains, six Lineage 3 strains, and one Lineage 1 strain. We estimate that MTB strains have a pangenome of 4,278 genes encoding 4,183 proteins, of which 3,438 are core genes. However, due to 99,694 interruptions in 2,447 coding genes, we can only confidently confirm 1,651 of these genes are translated in all samples. Of these interruptions, 67,315 (67.52%) could be classified by various genetic variations detected by currently available tools, and more than half of them are due to structural variations, mostly small indels. Assuming a proportion of these interruptions are artifacts, the number of active core genes would still be much lower than 3,438. We further described differential evolutionary patterns of genes under the influences of selective pressure, population structure and purifying selection. While selective pressure is ubiquitous among these coding genes, evolutionary adaptations are concentrated in 1,310 genes. Genes involved in cell wall biogenesis are under the strongest selective pressure, while the biological process of disruption of host organelles indicates the direction of the most intensive positive selection. This study provides a comprehensive view on the genetic diversity and evolutionary patterns of coding genes in MTB which may deepen our understanding of its epidemiology and pathogenicity.

Details

Title
Understanding the epidemiology and pathogenesis of Mycobacterium tuberculosis with non-redundant pangenome of epidemic strains in China
Author
Zhou, Yang  VIAFID ORCID Logo  ; Anthony, Richard; Wang, Shengfen; Xia, Hui; Ou, Xichao; Zhao, Bing; Song, Yuanyuan; Yang, Zheng; He, Ping; Liu, Dongxin; Zhao, Yanlin; Dick van Soolingen
First page
e0324152
Section
Research Article
Publication year
2025
Publication date
May 2025
Publisher
Public Library of Science
e-ISSN
19326203
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3205744382
Copyright
© 2025 Zhou et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.