Benchmarking de novo assembly and binning for time-series metagenomic data generated from drinking water systems

This abstract has open access
Abstract Summary

Assembling genomes from metagenomic short-read data can be challenging due to sequencing errors, depth of sequencing coverage and the unknown and uneven complexity of microbial communities. Reconstructing high-quality genomes is a crucial part of the metagenomic workflow as subsequent ecological and metabolic inferences depend on the accuracy, quality, and completeness of genomes. Genomes are often reconstructed by assembling all samples together (co-assembly) or creating individual assemblies. Co-assembly is a computationally intensive approach that involves the pooling of multiple samples to permit higher sequence coverage, which allows for the identification of populations that are present at lower abundances. In contrast, single sample assembly is computationally less intensive; however, lower sequence coverage resulting from this approach makes genome reconstruction difficult, as coverage heuristics to accurately disentangle repetitive sequences and differentiate between strain variants cannot be properly applied. In this study, we assessed the performance of a combination of selected assembly and binning strategies for time-series drinking water metagenomic data that were collected over 6 months, to determine the combination of assembly/binning approaches that result in the highest quality and quantity metagenome-assembled genomes (MAGs). Our findings suggest that the metaSPAdes co-assembly strategies had the best performance as it produced larger assemblies that were less fragmented, encompassing a lower proportion short contigs (N50 = 5.85 ± 0.63 kbp) that retained at least 80% of the sequence data. Furthermore, the MAGs that were reconstructed from the metaSPAdes co-assembly strategies, using CONCOCT and MetaBAT2, produced the highest quantity good-quality MAGs with mapping rates of at least 60%. These assembly/binning approaches also permitted the identification of unique MAGs and the differentiation between strain variants. Overall, this study suggest that multiple assembly and binning approaches are required to assist in the recovery of a greater proportion high-quality MAGs.

Abstract ID :
MEWE129
Submission Type
Your topic most closely relates to:
Average Rating
8/10
Upload presentation and handouts (max 3) :
If the file does not load, click here to open/download the file.
Northeastern University
United States Department of Agriculture (USDA) Agriculture Research Service
Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences
Northeastern University
Northeastern
Northeastern University

Abstracts With Same Type

Abstract ID
Abstract Title
Abstract Topic
Submission Type
Primary Author
MEWE87
Poster Session 3: Microbial processes in water systems and engineering
Poster Presentation
Ms. Katherine Vilardi
MEWE22
Poster Session 5: Systems microbiology approaches
Poster Presentation
Dr. Seow Wah How
MEWE152
Poster Session 5: Systems microbiology approaches
Poster Presentation
Mr. Rui Xiao
MEWE171
8. (Waste)water-based epidemiology, microbial risk assessment
Poster Presentation
Mrs. Bianca Costa
MEWE59
Poster Session 3: Microbial processes in water systems and engineering
Poster Presentation
Ms. Caroline Schleich
MEWE61
Poster Session 3: Microbial processes in water systems and engineering
Poster Presentation
Ms. Maria Takman
MEWE66
Poster Session 3: Microbial processes in water systems and engineering
Poster Presentation
Ms. Olga Sójka
183 visits