The Roslin Institute, Edinburgh
Abstract 2018-2019: Reproducible Snakemake pipelines for metagenomic analysis in farm animals
Metagenomic analysis is generally time consuming and requires the implementation of multiple tools. A pipeline stitching the different steps together and allowing the jobs to run in parallel would be time-saving and guarantee the exact same process for every sample. With a subset of an Illumina HiSeq 3000 dataset, generated from 359 European pig herds’ and poultry flocks’ gut microbiomes, a bioinformatics pipeline for metagenomic analysis was developed. Using conda environments and Snakemake, all steps from quality control to genome binning and annotation were incorporated. Because of their significance in the food chain, detection of antimicrobial resistance genes was included as well. By inspecting pathways, enzymes and CAZymes, a clearer understanding of the gut microbiome is intended. The pipeline was designed for use with metagenomic, paired-end, short read sequences. User-friendliness was guaranteed by using conda. Apart from anaconda, no installations are required. The pipeline and accompanying files can be downloaded from https://github.com/Juliepvr/snakemake_metagenomic. Results from the test run can be inspected at https://github.com/Juliepvr/pigNL_vs_pigES.
Tags: bioinformatics |
Address
Easter Bush
EH25 9RG Edinburgh
United Kingdom |
Contacts
Traineeship supervisor
Michael Watson
07990827831 mick.watson@roslin.ed.ac.uk |