Abstract

A shortcoming of most correlation distance methods based on the composition vectors without alignment developed for phylogenetic analysis using complete genomes is that the "distances" are not proper distance metrics in the strict mathematical sense. In this paper we propose two new correlation-related distance metrics to replace the old one in our dynamical language approach. Four genome datasets are employed to evaluate the effects of this replacement from a biological point of view. We find that the two proper distance metrics yield trees with the same or similar topologies as/to those using the old "distance" and agree with the tree of life based on 16S rRNA in a majority of the basic branches. Hence the two proper correlation-related distance metrics proposed here improve our dynamical language approach for phylogenetic analysis.

Details

Title
Proper Distance Metrics for Phylogenetic Analysis Using Complete Genomes without Sequence Alignment
Author
Yu, Zu-Guo; Zhan, Xiao-Wen; Han, Guo-Sheng; Wang, Roger W; Anh, Vo; Chu, Ka Hou
Pages
1141-1154
Publication year
2010
Publication date
2010
Publisher
MDPI AG
ISSN
16616596
e-ISSN
14220067
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1526141659
Copyright
Copyright MDPI AG 2010