Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Simple Summary

Bacteria increase their genetic diversity by acquiring genetic material from a variety of organisms across different species, a process known as horizontal gene transfer. Recently, there has been an increasing emphasis in the scientific literature on the importance of HGT for the evolution of eukaryotes such as fungi, animals, and plants. However, this observation presents a paradox since eukaryotes have developed mechanisms to prevent HGT. Our work proposes an alternative scenario to explain the abundance of reported cases of HGT in eukaryotes. We suggest that the misidentification of HGT candidates in eukaryotes is due to the lack of similar sequences (genes or proteins) in public databases. To support this hypothesis, we designed specific databases to identify potential HGT candidates within a particular group of fungi. Our results demonstrate that fewer HGT candidates are detected when more similar sequences exist for these candidates in databases. This finding holds significant relevance as public databases continue to grow; consequently, newly available information may refute several previously identified instances of HGT. Our experiments lead us to conclude that database imbalances need consideration before asserting new occurrences of HGT in eukaryotes.

Abstract

Horizontal gene transfer (HGT) is a widely acknowledged phenomenon in prokaryotes for generating genetic diversity. However, the impact of this process in eukaryotes, particularly interdomain HGT, is a topic of debate. Although there have been observed biases in interdomain HGT detection, little exploration has been conducted on the effects of imbalanced databases. In our study, we conducted experiments to assess how different databases affect the detection of interdomain HGT using proteomes from the Pezizomycotina fungal subphylum as our focus group. Our objective was to simulate the database imbalance commonly found in public biological databases, where bacterial and eukaryotic sequences are unevenly represented, and demonstrate that an increase in uploaded eukaryotic sequences leads to a decrease in predicted HGTs. For our experiments, four databases with varying proportions of eukaryotic sequences but consistent proportions of bacterial sequences were utilized. We observed a significant reduction in detected interdomain HGT candidates as the proportion of eukaryotes increased within the database. Our data suggest that the imbalance in databases bias the interdomain HGT detection and highlights challenges associated with confirming the presence of interdomain HGT among Pezizomycotina fungi and potentially other groups within Eukarya.

Details

Title
Database Bias in the Detection of Interdomain Horizontal Gene Transfer Events in Pezizomycotina
Author
Aguirre-Carvajal, Kevin 1   VIAFID ORCID Logo  ; Munteanu, Cristian R 2   VIAFID ORCID Logo  ; Armijos-Jaramillo, Vinicio 3   VIAFID ORCID Logo 

 Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, Campus Elviña s/n, 15071 Coruña, Spain; [email protected] (K.A.-C.); [email protected] (C.R.M.); Bio-Cheminformatics Research Group, Universidad de Las Américas, Quito 170513, Ecuador 
 Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, Campus Elviña s/n, 15071 Coruña, Spain; [email protected] (K.A.-C.); [email protected] (C.R.M.) 
 Bio-Cheminformatics Research Group, Universidad de Las Américas, Quito 170513, Ecuador; Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito 170513, Ecuador 
First page
469
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
20797737
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3084740298
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.