Abstract

Even though repeat masking using custom designed libraries significantly improves the genome annotation and gene prediction, such libraries for palm trees are yet to be designed and made accessible to the researchers. In this study, a repeat library was designed and validated for use in coconut and related palm genomes. Coconut genome with chromosome-level assembly was used to design independent libraries for tall and dwarf ecotypes, which were subsequently merged. Efficiency of the combined de novo library in genome annotation and gene prediction was assessed in comparison with the conventional libraries (Dfam+RepBase), using RepeatMasker. De novo library had 76.3 % efficiency in coconut genomes compared to 3.51 % in custom libraries and number of genes predicted was reduced from an average of 193,099 to 31,022. In date palm, oil pam and sago palm also, combined library gave higher repeat masking and reduced the number of genes predicted. The de novo library can be accessed at http://www.kau.in/repeat-libraries.

Details

Title
An efficient repeat masking library for the genomic data of coconut and related trees
Author
Prasad, Shri Hari 1 ; Mathew, Deepu 1 ; Ravisankar Valsalan 1 

 Bioinformatics Centre, Kerala Agricultural University, Thrissur – 680 656, India 
Pages
114-117
Publication year
2023
Publication date
2023
Publisher
De Gruyter Poland
ISSN
00375349
e-ISSN
25098934
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3157767434
Copyright
© 2023. This work is published under http://creativecommons.org/licenses/by-nc-nd/3.0 (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.