
Whole metagenome shotgun (WMS) sequencing involves randomly breaking up DNA from an environmental sample into small fragments, sequencing these fragments, and then using computational methods to reconstruct the original genomes present in the sample.
We will analyse WMS data using two main approaches:
Reads profiling means trying to assign some information (usually taxonomy, sometimes functions) to each read in the dataset. Then, by aggregating the information from all reads, we can try to infer the composition of the microbial community in the sample.
When assemblying the raw reads, we will obtain a set of contigs - longer sequences that represent (parts of) genomes present in the sample. Each contig represents a fragment of one of the genomes in the sample. The process of binning can then be used to group contigs that likely belong to the same genome, based on sequence composition and coverage patterns. If a “bin” is of sufficient quality and completeness, it can be referred to as a metagenome-assembled genome (MAG).
❓ Do you expect to identify more species with a de novo approach, or via reads profiling?
Each MAG can be treated as a draft genome produced by a single isolate shotgun, but we have to expect a lower quality due to contamination (contigs coming from other species), reduced completeness (several missing parts).