It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT.
Binning is an essential step in genome-resolved metagenomic analysis in which assembled contigs originating from the same source population are clustered. However it is challenging, especially for low abundance microbial species. Here the authors introduce a toolkit that integrates multiple prominent binning tools and AI for efficient and high-resolution recovery of non-redundant bins from short- and long-read metagenomic sequencing datasets.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
; Yuan, Li 2 ; Lian, Chun-Ang 1
; Lin, Bin 3 ; Chen, Jie 2 ; Mu, Rong 4 ; Qiao, Xuejiao 4 ; Zhang, Liyu 4 ; Xu, Zheng 5 ; Fan, Lu 6
; Zhang, Yunzeng 7 ; Wang, Shanquan 8
; Li, Junyi 9
; Cao, Huiluo 10 ; Li, Bing 11
; Chen, Baowei 12 ; Song, Chi 13
; Liu, Yongxin 14
; Shi, Lili 15 ; Tian, Yonghong 2
; Ni, Jinren 16
; Zhang, Tong 17
; Zhou, Jizhong 18
; Zhuang, Wei-Qin 19 ; Yu, Ke 1
1 Peking University, Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319); Peking University, AI for Science (AI4S)-Preferred Program, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319)
2 Peking University, AI for Science (AI4S)-Preferred Program, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319); Peking University, School of Electronic and Computer Engineering, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319); Peng Cheng Laboratory, Shenzhen, China (GRID:grid.508161.b) (ISNI:0000 0005 0389 1328)
3 Peking University, School of Electronic and Computer Engineering, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319)
4 Peking University, Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319)
5 Southern University of Sciences and Technology Yantian Hospital, Shenzhen, China (GRID:grid.263817.9) (ISNI:0000 0004 1773 1790); Chinese Academy of Sciences, Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Shenzhen, China (GRID:grid.9227.e) (ISNI:0000000119573309)
6 Southern University of Science and Technology (SUSTech), Department of Ocean Science and Engineering, Shenzhen, China (GRID:grid.263817.9) (ISNI:0000 0004 1773 1790)
7 Yangzhou University, Joint International Research Laboratory of Agriculture and Agri-Product Safety, the Ministry of Education of China, Yangzhou, China (GRID:grid.268415.c)
8 Sun Yat-Sen University, Environmental Microbiomics Research Center, School of Environmental Science and Engineering, Guangzhou, China (GRID:grid.12981.33) (ISNI:0000 0001 2360 039X)
9 Harbin Institute of Technology (Shenzhen), School of Computer Science and Technology, Shenzhen, China (GRID:grid.19373.3f) (ISNI:0000 0001 0193 3564)
10 University of Hong Kong, Department of Microbiology, Hong Kong, China (GRID:grid.194645.b) (ISNI:0000 0001 2174 2757)
11 Tsinghua University, Shenzhen International Graduate School, Shenzhen, China (GRID:grid.12527.33) (ISNI:0000 0001 0662 3178)
12 Sun Yat-sen University, Guangdong Provincial Key Laboratory of Marine Resources and Coastal Engineering, School of Marine Sciences, Zhuhai, China (GRID:grid.12981.33) (ISNI:0000 0001 2360 039X)
13 Chengdu University of Traditional Chinese Medicine, Institute of Herbgenomics, Chengdu, China (GRID:grid.411304.3) (ISNI:0000 0001 0376 205X); Ltd, Wuhan Benagen Technology Co., Wuhan, China (GRID:grid.411304.3)
14 Chinese Academy of Agricultural Sciences, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Shenzhen, China (GRID:grid.410727.7) (ISNI:0000 0001 0526 1937)
15 Peking University, AI for Science (AI4S)-Preferred Program, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319); Peking University Shenzhen Graduate School, State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319)
16 Peking University, Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Shenzhen, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319); Peking University, College of Environmental Sciences and Engineering, Key Laboratory of Water and Sediment Sciences, Ministry of Education, Beijing, China (GRID:grid.11135.37) (ISNI:0000 0001 2256 9319)
17 University of Hong Kong, Department of Civil Engineering, Hong Kong, China (GRID:grid.194645.b) (ISNI:0000 0001 2174 2757)
18 University of Oklahoma, Institute for Environmental Genomics, Norman, USA (GRID:grid.266900.b) (ISNI:0000 0004 0447 0018)
19 University of Auckland, Department of Civil and Environmental Engineering, Faculty of Engineering, Auckland, New Zealand (GRID:grid.9654.e) (ISNI:0000 0004 0372 3343)




