ArticlePDF Available

Draft Genome Sequences of Helicobacter pylori Strains HPARG63 and HPARG8G, Cultured from Patients with Chronic Gastritis and Gastric Ulcer Disease

Authors:
  • Institute Infection Diseases" Carlos G. Malbran"
  • Alchemab Therapeutics

Abstract

Helicobacter pylori colonizes the human gastric mucosa, leading to a spectrum of gastric diseases in susceptible populations. Here we announce the draft genome sequences of strains HPARG8G and HPARG63. The data for both genome sequences provide insights regarding the diversity in gene content and rearrangement of the genomic islands commonly harbored by H. pylori.
Draft Genome Sequences of Helicobacter pylori Strains HPARG63 and
HPARG8G, Cultured from Patients with Chronic Gastritis and Gastric
Ulcer Disease
Rita Inés Armitano,
a
Gerardo Zerbetto De Palma,
a
Mario José Matteo,
a
Santiago Revale,
b
Soledad Romero,
b
Germán Matías Traglia,
a
Mariana Catalano
a
Instituto de Microbiología y Parasitología Médica (IMPAM, UBA-CONICET), Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones
Científicas y Tecnológicas, Buenos Aires, Argentina
a
; Instituto de Agrobiotecnología Rosario (INDEAR)-CONICET, CCT-CONICET, Rosario, Argentina
b
Helicobacter pylori colonizes the human gastric mucosa, leading to a spectrum of gastric diseases in susceptible populations.
Here we announce the draft genome sequences of strains HPARG8G and HPARG63. The data for both genome sequences pro-
vide insights regarding the diversity in gene content and rearrangement of the genomic islands commonly harbored by H. pylori.
Received 7 August 2013 Accepted 9 August 2013 Published 5 September 2013
Citation Armitano RI, Zerbetto De Palma G, Matteo MJ, Revale S, Romero S, Traglia GM, Catalano M. 2013. Draft genome sequences of Helicobacter pylori strains HPARG63 and
HPARG8G, cultured from patients with chronic gastritis and gastric ulcer disease. Genome Announc. 1(5):e00700-13. doi:10.1128/genomeA.00700-13.
Copyright © 2013 Armitano et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license.
Address correspondence to Mariana Catalano, catalano@fmed.uba.ar.
Helicobacter pylori strain HPARG8G was isolated from a patient
with gastric ulcer disease and strain HPARG63 was recovered
from a patient with chronic gastritis. Here we report the draft genome
sequences of both strains. A total of 785,600 reads with an average
length of 435.02 nucleotides, corresponding to 33-fold coverage for
strain HPARG8G and 100-fold coverage for HPARG63, were ob-
tained with a 454 GS titanium pyrosequencer. De novo assembly
was done using 454 Newbler version 2.6, yielding 47 contigs for
HPARG8G and 34 contigs for HPARG63. The genomes were anno-
tated using the RAST server (1) and manually curated. Mauve (2) was
used to predict the order and orientation of the contigs, and Artemis
(3) was employed to glean details of the two genomes in comparison
with the closest overall neighbors predicted by RAST.
The Helicobacter pylori HPARG8G genome sequence com-
prises 1,596,552 bp and 1,568 coding sequences (RAST) as well as
36 tRNAs and 2 rRNAs. The average GC content was 38.98%.
The HPARG63 strain genome sequence contains approximately
1,668,716 bp and 1,623 coding sequences and also has 36 tRNAs
and 2 rRNAs, with an average GC content of 38.79%. The H. py-
lori P12 strain was predicted by RAST as one of the closest overall
neighbors of HPARG8G and HPARG63. Both genomes displayed
a conserved repertoire of housekeeping genes corresponding to
various metabolic pathways. The two genomes each showed a
complete cag pathogenicity island (cag PAI), with the complex
rearrangement in the region delimited by dapB and murI genes
described in the Mongolian gerbil-colonizing strain H. pylori B8
(4). In HPARG8G and HPARG63, the cagA gene was located
12,726 bp and 11,982 bp downstream from the cagB-cag1 region,
respectively. Consequently, the complete cag PAI gene order
might not be as highly conserved as had been proposed previously
(5). The HPARG63 strain contained two transposon of plasticity
zone (TnPZ) elements (6). TnPZ type 1b lacked the genes
HPP12_455 through HPP12_457 found in the same element in
strain P12; the HPP12_459 through HPP12_461 genes from the
type IV secretion system TFS4 were also absent (7). In both strains,
this TnPZ was inserted in a pseudogene similar to hp0464 of strain
26695 (7). In addition, the HPARG63 element harbored the
virD2-virD4,virB11-virB9, and topA genes with the highest ho-
mology to genes of other H. pylori strains. The second element was
a TnPZ type 2 remnant located in an unusual chromosomal inser-
tion site, containing few type IV secretion system TFS3 genes. The
strain HPARG8G carried only xerD,jhp0959,jhp0949, and
jhp0948 genes from the TnPZ type 1b left end (6) and hp0444,
hp0445, and hp0446 genes from the TFS4, inserted in a methyl-
transferase type II system. Our findings support the hypothesis
that although the TnPZs are genomic elements transferable as a
whole, they often contain only subsets of the genes present on the
complete islands (6,7).
Nucleotide sequence accession numbers. The first versions of
the HPARG8G and HPARG63 sequences are accessible at the
European Nucleotide Archive (http://www.ebi.ac.uk/ena/) under
accession numbers CBKZ010000001 through CBKZ010000047
and CBKY010000001 through CBKY010000034.
ACKNOWLEDGMENTS
This project was supported by grants BID 1728 OC-AR-PICT 2010-1492
provided by the Agencia Nacional de Promoción Científica y Tecnológica
and M064 from the Universidad de Buenos Aires (UBACyT-2010-2012).
REFERENCES
1. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma
K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman
AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch
GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O.
2008. The RAST server: rapid annotations using subsystems technology.
BMC Genomics 9:75. doi:10.1186/1471-2164-9-75.
2. Rissman AI, Mau B, Biehl BS, Darling AE, Glasner JD, Perna NT. 2009.
Reordering contigs of draft genomes using the Mauve aligner. Bioinformat-
ics 25:2071–2073.
3. Abbott JC, Aanensen DM, Rutherford K, Butcher S, Spratt BG. 2005.
WebACT—an online companion for the Artemis Comparison Tool. Bioin-
formatics 21:3665–3666.
Genome AnnouncementsSeptember/October 2013 Volume 1 Issue 5 e00700-13 genomea.asm.org 1
4. Farnbacher M, Jahns T, Willrodt D, Daniel R, Haas R, Goesmann A,
Kurtz S, Rieder G. 2010. Sequencing, annotation, and comparative ge-
nome analysis of the gerbil-adapted Helicobacter pylori strain B8. BMC
Genomics 11:335. doi:10.1186/1471-2164-11-335.
5. Olbermann P, Josenhans C, Moodley Y, Uhr M, Stamer C, Vauterin M,
Suerbaum S, Achtman M, Linz B. 2010. A global overview of the genetic
and functional diversity in the Helicobacter pylori cag pathogenicity island.
PLoS Genet. 6:e1001069. doi:10.1371/journal.pgen.1001069.
6. Kersulyte D, Lee W, Subramaniam D, Anant S, Herrera P, Cabrera L,
Balqui J, Barabas O, Kalia A, Gilman RH, Berg DE. 2009. Helicobacter
pylori’s plasticity zones are novel transposable elements. PLoS One 4:e6859.
doi:10.1371/journal.pone.0006859.
7. Fischer W, Windhager L, Rohrer S, Zeiller M, Karnholz A, Hoffmann R,
Zimmer R, Haas R. 2010. Strain-specific genes of Helicobacter pylori: ge-
nome evolution driven by a novel type IV secretion system and genomic
island transfer. Nucleic Acids Res. 38:6089 6101.
Armitano et al.
Genome Announcements2genomea.asm.org September/October 2013 Volume 1 Issue 5 e00700-13
... These populations generally correspond to their geographical origins. Up to date, hundreds of strains have been sequenced or being sequenced [9][10][11][12] . The purpose of this review is to compare the genomic differences of H. pylori from different populations, and discuss their implications in the geographical variation of the prevalence of the infection. ...
... To explore the genomic differences between H. pylori strains, we have determined the genome sequence of H. pylori strain D33 isolated from a Chinese patient with gastric cancer. Whole-genome sequencing was performed as described previously [9,10] . Briefly, the raw reads were trimmed and filtered, yielding a total of 2364383 reads with an average length of 260 nucleotides. ...
Article
Full-text available
Helicobacter pylori (H. pylori), a major pathogen colonizing the human stomach, shows great genetic variation. Comparative analysis of strains from different H. pylori populations revealed that the genome size of strains from East Asia decreased to 1.60 Mbp, which is significantly smaller than that from Europe or Africa. In parallel with the genome reduction, the number of protein coding genes was decreased, and the guanine-cytosine content was lowered to 38.9%. Elimination of non-essential genes by mutations is likely to be a major cause of the genome reduction. Bacteria with a small genome cost less energy. Thus, H. pylori strains from East Asia may have proliferation and growth advantages over those from Western countries. This could result in enhanced capacity of bacterial spreading. Therefore, the reduced genome size potentially contributes to the high prevalence of H. pylori in East Asia.
Article
Full-text available
The Helicobacter pylori cag pathogenicity island (cagPAI) encodes a type IV secretion system. Humans infected with cagPAI-carrying H. pylori are at increased risk for sequelae such as gastric cancer. Housekeeping genes in H. pylori show considerable genetic diversity; but the diversity of virulence factors such as the cagPAI, which transports the bacterial oncogene CagA into host cells, has not been systematically investigated. Here we compared the complete cagPAI sequences for 38 representative isolates from all known H. pylori biogeographic populations. Their gene content and gene order were highly conserved. The phylogeny of most cagPAI genes was similar to that of housekeeping genes, indicating that the cagPAI was probably acquired only once by H. pylori, and its genetic diversity reflects the isolation by distance that has shaped this bacterial species since modern humans migrated out of Africa. Most isolates induced IL-8 release in gastric epithelial cells, indicating that the function of the Cag secretion system has been conserved despite some genetic rearrangements. More than one third of cagPAI genes, in particular those encoding cell-surface exposed proteins, showed signatures of diversifying (Darwinian) selection at more than 5% of codons. Several unknown gene products predicted to be under Darwinian selection are also likely to be secreted proteins (e.g. HP0522, HP0535). One of these, HP0535, is predicted to code for either a new secreted candidate effector protein or a protein which interacts with CagA because it contains two genetic lineages, similar to cagA. Our study provides a resource that can guide future research on the biological roles and host interactions of cagPAI proteins, including several whose function is still unknown.
Article
Full-text available
The Mongolian gerbils are a good model to mimic the Helicobacter pylori-associated pathogenesis of the human stomach. In the current study the gerbil-adapted strain B8 was completely sequenced, annotated and compared to previous genomes, including the 73 supercontigs of the parental strain B128. The complete genome of H. pylori B8 was manually curated gene by gene, to assign as much function as possible. It consists of a circular chromosome of 1,673,997 bp and of a small plasmid of 6,032 bp carrying nine putative genes. The chromosome contains 1,711 coding sequences, 293 of which are strain-specific, coding mainly for hypothetical proteins, and a large plasticity zone containing a putative type-IV-secretion system and coding sequences with unknown function. The cag-pathogenicity island is rearranged such that the cagA-gene is located 13,730 bp downstream of the inverted gene cluster cagB-cag1. Directly adjacent to the cagA-gene, there are four hypothetical genes and one variable gene with a different codon usage compared to the rest of the H. pylori B8-genome. This indicates that these coding sequences might be acquired via horizontal gene transfer.The genome comparison of strain B8 to its parental strain B128 delivers 425 unique B8-proteins. Due to the fact that strain B128 was not fully sequenced and only automatically annotated, only 12 of these proteins are definitive singletons that might have been acquired during the gerbil-adaptation process of strain B128. Our sequence data and its analysis provide new insight into the high genetic diversity of H. pylori-strains. We have shown that the gerbil-adapted strain B8 has the potential to build, possibly by a high rate of mutation and recombination, a dynamic pool of genetic variants (e.g. fragmented genes and repetitive regions) required for the adaptation-processes. We hypothesize that these variants are essential for the colonization and persistence of strain B8 in the gerbil stomach during in ammation.
Article
Full-text available
The availability of multiple bacterial genome sequences has revealed a surprising extent of variability among strains of the same species. The human gastric pathogen Helicobacter pylori is known as one of the most genetically diverse species. We have compared the genome sequence of the duodenal ulcer strain P12 and six other H. pylori genomes to elucidate the genetic repertoire and genome evolution mechanisms of this species. In agreement with previous findings, we estimate that the core genome comprises about 1200 genes and that H. pylori possesses an open pan-genome. Strain-specific genes are preferentially located at potential genome rearrangement sites or in distinct plasticity zones, suggesting two different mechanisms of genome evolution. The P12 genome contains three plasticity zones, two of which encode type IV secretion systems and have typical features of genomic islands. We demonstrate for the first time that one of these islands is capable of self-excision and horizontal transfer by a conjugative process. We also show that excision is mediated by a protein of the XerD family of tyrosine recombinases. Thus, in addition to its natural transformation competence, conjugative transfer of genomic islands has to be considered as an important source of genetic diversity in H. pylori.
Article
Full-text available
Genes present in only certain strains of a bacterial species can strongly affect cellular phenotypes and evolutionary potentials. One segment that seemed particularly rich in strain-specific genes was found by comparing the first two sequenced Helicobacter pylori genomes (strains 26695 and J99) and was named a "plasticity zone". We studied the nature and evolution of plasticity zones by sequencing them in five more Helicobacter strains, determining their locations in additional strains, and identifying them in recently released genome sequences. They occurred as discrete units, inserted at numerous chromosomal sites, and were usually flanked by direct repeats of 5'AAGAATG, a sequence generally also present in one copy at unoccupied sites in other strains. This showed that plasticity zones are transposable elements, to be called TnPZs. Each full length TnPZ contained a cluster of type IV protein secretion genes (tfs3), a tyrosine recombinase family gene ("xerT"), and a large (>or=2800 codon) orf encoding a protein with helicase and DNA methylase domains, plus additional orfs with no homology to genes of known function. Several TnPZ types were found that differed in gene arrangement or DNA sequence. Our analysis also indicated that the first-identified plasticity zones (in strains 26695 and J99) are complex mosaics of TnPZ remnants, formed by multiple TnPZ insertions, and spontaneous and transposable element mediated deletions. Tests using laboratory-generated deletions showed that TnPZs are not essential for viability, but identified one TnPZ that contributed quantitatively to bacterial growth during mouse infection and another that affected synthesis of proinflammatory cytokines in cell culture. We propose that plasticity zone genes are contained in conjugative transposons (TnPZs) or remnants of them, that TnPZ insertion is mediated by XerT recombinase, and that some TnPZ genes affect bacterial phenotypes and fitness.
Article
Full-text available
Mauve Contig Mover provides a new method for proposing the relative order of contigs that make up a draft genome based on comparison to a complete or draft reference genome. A novel application of the Mauve aligner and viewer provides an automated reordering algorithm coupled with a powerful drill-down display allowing detailed exploration of results. Availability: The software is available for download at http://gel.ahabs.wisc.edu/mauve. Contact: rissman@wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online and http://gel.ahabs.wisc.edu
Article
Full-text available
Unlabelled: WebACT is an online resource which enables the rapid provision of simultaneous BLAST comparisons between up to five genomic sequences in a format amenable for visualization with the well-known Artemis Comparison Tool (ACT). Comparisons can be generated on-the-fly using sequences directly retrieved via EMBL database queries, or by entering or uploading user sequences. Furthermore, pre-computed comparisons are available between all publicly available, completed prokaryotic genomes and plasmids currently contained within the Genome Reviews database (372 sequences, representing 175 different species). The system is designed to minimize the volume of downloaded data and maximize performance. Genome sequences, annotation and pre-computed comparisons are stored in a relational database allowing flexible querying based on user-defined sequence regions, from whole genome to a defined region flanking a specified gene. Comparison and sequence files, whether computed online or retrieved from the database of pre-computed genome comparisons, can be viewed online using ACT and are available for download. Availability: Freely accessible at http://www.webact.org. Supplementary information: User guide and worked examples are available at http://www.webact.org/WebACT/docs.
Article
Full-text available
The number of prokaryotic genome sequences becoming available is growing steadily and is growing faster than our ability to accurately annotate them. We describe a fully automated service for annotating bacterial and archaeal genomes. The service identifies protein-encoding, rRNA and tRNA genes, assigns functions to the genes, predicts which subsystems are represented in the genome, uses this information to reconstruct the metabolic network and makes the output easily downloadable for the user. In addition, the annotated genome can be browsed in an environment that supports comparative analysis with the annotated genomes maintained in the SEED environment. The service normally makes the annotated genome available within 12-24 hours of submission, but ultimately the quality of such a service will be judged in terms of accuracy, consistency, and completeness of the produced annotations. We summarize our attempts to address these issues and discuss plans for incrementally enhancing the service. By providing accurate, rapid annotation freely to the community we have created an important community resource. The service has now been utilized by over 120 external users annotating over 350 distinct genomes.
The RAST server: rapid annotations using subsystems technology
  • R K Aziz
  • D Bartels
  • A A Best
  • M Dejongh
  • T Disz
  • R A Edwards
  • K Formsma
  • S Gerdes
  • E M Glass
  • M Kubal
  • F Meyer
  • G J Olsen
  • R Olson
  • A L Osterman
  • R A Overbeek
  • L K Mcneil
  • D Paarmann
  • T Paczian
  • B Parrello
  • G D Pusch
  • C Reich
  • R Stevens
  • O Vassieva
  • V Vonstein
  • A Wilke
  • O Zagnitko
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi:10.1186/1471-2164-9-75.
Sequencing, annotation, and comparative genome analysis of the gerbil-adapted Helicobacter pylori strain B8
  • M Farnbacher
  • T Jahns
  • D Willrodt
  • R Daniel
  • R Haas
  • A Goesmann
  • S Kurtz
  • G Rieder
Farnbacher M, Jahns T, Willrodt D, Daniel R, Haas R, Goesmann A, Kurtz S, Rieder G. 2010. Sequencing, annotation, and comparative genome analysis of the gerbil-adapted Helicobacter pylori strain B8. BMC Genomics 11:335. doi:10.1186/1471-2164-11-335.
A global overview of the genetic and functional diversity in the Helicobacter pylori cag pathogenicity island
  • P Olbermann
  • C Josenhans
  • Y Moodley
  • M Uhr
  • C Stamer
  • M Vauterin
  • S Suerbaum
  • M Achtman
  • B Linz
Olbermann P, Josenhans C, Moodley Y, Uhr M, Stamer C, Vauterin M, Suerbaum S, Achtman M, Linz B. 2010. A global overview of the genetic and functional diversity in the Helicobacter pylori cag pathogenicity island. PLoS Genet. 6:e1001069. doi:10.1371/journal.pgen.1001069.