Instructor TBA
Instructor TBA

Microbiome research typically involves the study of the collective genetic material of microorganisms from a wide variety of natural environments. As a result of the efforts carried out by scientists across this diverse and expanding research field, millions of microbiome datasets have been collected and made publicly available. Detailed analysis of collected data is increasingly redefining our understanding of microbiology by providing unique insights into microbial community composition, the processes performed by the microbes and their relationships with their surroundings and each other.

MGnify (Mitchell AL, et al., 2020) is a platform that provides free assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in various environments. The resource accepts user-submitted data and provides standardised pipelines that offer taxonomic and functional analysis of microbiome datasets.

Inside of the Workshop

In this two day remote tutorial, you will explore common approaches to analysing and annotating contigs produced from a metagenomics assembly. The course will be a mixture of introductory lectures, followed by hands-on practical sessions. Due to time constraints, you will investigate pre-calculated examples or use a web browser to explore outputs via the MGnify website. By the end of the course, you should understand how to process contigs, functionally and taxonomically characterise the contigs, and be able to generate metagenome assembled genomes from your assemblies.


Prerequisites & description

Part 1 – Next steps after metagenomic assembly

  • Lecture: Metagenomics sequence quality control, and assembly
    ○ What constitutes a good assembly?
    ○ How we estimate assembly quality
    ○ Host decontamination
    ○ Submitting to ENA
  • Hands-on: Assessing quality, and decontaminating host DNA in assemblies
    ○ Prerequisites: Installed Bandage, up-to-date web browser (e.g. Chrome), and ability to install docker containers.

Part 2 – Annotating bacterial contigs

  • Lecture: Overview of MGnify annotation pipeline
    ○ Getting assembly data into MGnify
    ○ Taxonomic assignment
    ○ Functional characterisation
    ○ Pathways/systems
  • Hands-on: Accessing annotations via the MGnify website
    ○ Prerequisites: Up-to-date web browser (e.g. Chrome), ability to install docker containers.

Part 3 – Annotating viral contigs

  • Lecture: Viral contig annotation using the VIRify pipeline
    ○ VirSorter/VirFinder
    ○ ViPhoG HMMs for taxonomic assignment
    ○ Other tools/issues
  • Hands-on: Investigating annotation outputs, on the command line and in MGnify website
    ○ Prerequisites: Up-to-date web browser (e.g. Chrome), ability to install docker containers.

Part 4 – Metagenome assembled genomes (MAGs)

  • Lecture: Generation of metagenome assembled genomes (MAGs) from assemblies
    ○ How to generate MAGs
    ○ Assessment of quality (MIGMAGs)
    ○ Taxonomic assignment
    ○ Dereplication of MAGs across samples
  • Hands-on: Generating MAGs, assessing genome quality and taxonomic assignment
    ○ Prerequisites: Up-to-date web browser (e.g. Chrome), ability to install docker containers.


Mitchell AL, et al., MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res. 2020 Jan 8;48(D1):D570-D578. doi: 10.1093/nar/gkz1035.