Characterization of the core and accessory genomes of Pseudomonas aeruginosa using bioinformatic tools Spine and AGEnt

Egon A. Ozer*, Jonathan P. Allen, Alan R. Hauser

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

132 Scopus citations

Abstract

Background: Pseudomonas aeruginosa is an important opportunistic pathogen responsible for many infections in hospitalized and immunocompromised patients. Previous reports estimated that approximately 10% of its 6.6 Mbp genome varies from strain to strain and is therefore referred to as "accessory genome" Elements within the accessory genome of P. aeruginosa have been associated with differences in virulence and antibiotic resistance. As whole genome sequencing of bacterial strains becomes more widespread and cost-effective, methods to quickly and reliably identify accessory genomic elements in newly sequenced P. aeruginosa genomes will be needed. Results: We developed a bioinformatic method for identifying the accessory genome of P. aeruginosa. First, the core genome was determined based on sequence conserved among the completed genomes of twelve reference strains using Spine, a software program developed for this purpose. The core genome was 5.84 Mbp in size and contained 5,316 coding sequences. We then developed an in silico genome subtraction program named AGEnt to filter out core genomic sequences from P. aeruginosa whole genomes to identify accessory genomic sequences of these reference strains. This analysis determined that the accessory genome of P. aeruginosa ranged from 6.9-18.0% of the total genome, was enriched for genes associated with mobile elements, and was comprised of a majority of genes with unknown or unclear function. Using these genomes, we showed that AGEnt performed well compared to other publically available programs designed to detect accessory genomic elements. We then demonstrated the utility of the AGEnt program by applying it to the draft genomes of two previously unsequenced P. aeruginosa strains, PA99 and PA103. Conclusions: The P. aeruginosa genome is rich in accessory genetic material. The AGEnt program accurately identified the accessory genomes of newly sequenced P. aeruginosa strains, even when draft genomes were used. As P. aeruginosa genomes become available at an increasingly rapid pace, this program will be useful in cataloging the expanding accessory genome of this bacterium and in discerning correlations between phenotype and accessory genome makeup. The combination of Spine and AGEnt should be useful in defining the accessory genomes of other bacterial species as well.

Original languageEnglish (US)
Article number737
JournalBMC Genomics
Volume15
Issue number1
DOIs
StatePublished - Aug 29 2014

Funding

The authors thank Mark Mandel and Sudhir Penugonda for discussions and guidance. We would like to thank Lisa DeShong Sadzewicz, Luke Tallon, and staff at the University of Maryland School of Medicine Institute for Genome Sciences for NGS sequencing, as well as Sébastien Boisvert at the Université Laval, Québec, Canada for assistance with genomic assembly software. These studies were supported by the American Cancer Society (MRSG-13-220-01 – MPC, E.O.) and the National Institutes of Health (F32AI089068 and T32AI007476, E.O.; T32AI007476 and F32AI108247-01, J.P.A.; AI053674, AI075191, AI099269, AI04831 and AI088286, A.R.H.).

Keywords

  • AGEnt
  • Accessory genome
  • Comparative genomics
  • Core genome
  • Pseudomonas aeruginosa
  • Spine
  • Whole-genome sequencing

ASJC Scopus subject areas

  • Genetics
  • Biotechnology

Fingerprint

Dive into the research topics of 'Characterization of the core and accessory genomes of Pseudomonas aeruginosa using bioinformatic tools Spine and AGEnt'. Together they form a unique fingerprint.

Cite this