Microbial species comprise strains with largely different set of genes and functional potential. Identifying microbial strains and characterizing their genes is thus essential for pathogen discovery, epidemiology and population genomics. Here we present a novel computational strain-level metagenomic profiling tool, called PanPhlAn, for identifying the gene composition and in-vivo transcriptional activity of individual strains from metagenomic and metatranscriptomic samples. PanPhlAn enables both the identification of known organisms and the characterization of previously unseen strains. Applied to the 2011 German E. coli outbreak, we demonstrate the ability of PanPhlAn to recognize outbreak strains and identify their associated virulence and resistance factors. Based on almost two thousand samples, PanPhlAn produced the largest strain-level, culture-free population genomic study of human-associated microbial species. In a large cohort of preterm infants, PanPhlAn enabled the identification of disease-associated strain-level genetic biomarkers. PanPhlAn is available at http://segatalab.cibio.unitn.it/tools/panphlan.

Scholz, M.U.; Ward, D.V.; Pasolli, E.; Tolio, T.; Zolfo, M.; Asnicar, F.; Truong, D.T.; Tett, A.; Morrow, L.A.; Segata, N. (2016). Pangenome-based computational metagenomic profiling enables strain-level culture-free epidemiology and population genomics studies. In: ECCB 2016: 15th European Conference on Computational Biology, September 3-7, 2016, The Hague, Netherlands. url: http://www.ebi.ac.uk/eccb/2016/sessions/ht05/index.html handle: http://hdl.handle.net/10449/35602

Pangenome-based computational metagenomic profiling enables strain-level culture-free epidemiology and population genomics studies

Scholz, Matthias Uwe;
2016-01-01

Abstract

Microbial species comprise strains with largely different set of genes and functional potential. Identifying microbial strains and characterizing their genes is thus essential for pathogen discovery, epidemiology and population genomics. Here we present a novel computational strain-level metagenomic profiling tool, called PanPhlAn, for identifying the gene composition and in-vivo transcriptional activity of individual strains from metagenomic and metatranscriptomic samples. PanPhlAn enables both the identification of known organisms and the characterization of previously unseen strains. Applied to the 2011 German E. coli outbreak, we demonstrate the ability of PanPhlAn to recognize outbreak strains and identify their associated virulence and resistance factors. Based on almost two thousand samples, PanPhlAn produced the largest strain-level, culture-free population genomic study of human-associated microbial species. In a large cohort of preterm infants, PanPhlAn enabled the identification of disease-associated strain-level genetic biomarkers. PanPhlAn is available at http://segatalab.cibio.unitn.it/tools/panphlan.
Metagenomics
Strain-level metagenomics
Pangenome
Metatranscriptomics
Population genomics
Epidemiology
2016
Scholz, M.U.; Ward, D.V.; Pasolli, E.; Tolio, T.; Zolfo, M.; Asnicar, F.; Truong, D.T.; Tett, A.; Morrow, L.A.; Segata, N. (2016). Pangenome-based computational metagenomic profiling enables strain-level culture-free epidemiology and population genomics studies. In: ECCB 2016: 15th European Conference on Computational Biology, September 3-7, 2016, The Hague, Netherlands. url: http://www.ebi.ac.uk/eccb/2016/sessions/ht05/index.html handle: http://hdl.handle.net/10449/35602
File in questo prodotto:
File Dimensione Formato  
HT05 – Pangenome-based computational me...population genomics studies « ECCB2016.pdf

accesso aperto

Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.07 MB
Formato Adobe PDF
1.07 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10449/35602
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact