ArticlePDF Available

Comparative genomics reveals mechanism for short-term and long-term clonal transitions in pandemic Vibrio cholerae

Authors:
  • EzBiome Inc

Abstract and Figures

Vibrio cholerae, the causative agent of cholera, is a bacterium autochthonous to the aquatic environment, and a serious public health threat. V. cholerae serogroup O1 is responsible for the previous two cholera pandemics, in which classical and El Tor biotypes were dominant in the sixth and the current seventh pandemics, respectively. Cholera researchers continually face newly emerging and reemerging pathogenic clones carrying diverse combinations of phenotypic and genotypic properties, which significantly hampered control of the disease. To elucidate evolutionary mechanisms governing genetic diversity of pandemic V. cholerae, we compared the genome sequences of 23 V. cholerae strains isolated from a variety of sources over the past 98 years. The genome-based phylogeny revealed 12 distinct V. cholerae lineages, of which one comprises both O1 classical and El Tor biotypes. All seventh pandemic clones share nearly identical gene content. Using analogy to influenza virology, we define the transition from sixth to seventh pandemic strains as a "shift" between pathogenic clones belonging to the same O1 serogroup, but from significantly different phyletic lineages. In contrast, transition among clones during the present pandemic period is characterized as a "drift" between clones, differentiated mainly by varying composition of laterally transferred genomic islands, resulting in emergence of variants, exemplified by V. cholerae O139 and V. cholerae O1 El Tor hybrid clones. Based on the comparative genomics it is concluded that V. cholerae undergoes extensive genetic recombination via lateral gene transfer, and, therefore, genome assortment, not serogroup, should be used to define pathogenic V. cholerae clones.
Content may be subject to copyright.
Comparative genomics reveals mechanism for
short-term and long-term clonal transitions
in pandemic
Vibrio cholerae
Jongsik Chun
a,b,c
, Christopher J. Grim
b
, Nur A. Hasan
d,e
, Je Hee Lee
a,c
, Seon Young Choi
a,c
, Bradd J. Haley
d
, Elisa Taviani
d
,
Yoon-Seong Jeon
c
, Dong Wook Kim
c
, Jae-Hak Lee
a
, Thomas S. Brettin
f
, David C. Bruce
f
, Jean F. Challacombe
f
,
J. Chris Detter
f
, Cliff S. Han
f
, A. Christine Munk
f
, Olga Chertkov
f
, Linda Meincke
f
, Elizabeth Saunders
f
,
Ronald A. Walters
g
, Anwar Huq
d
, G. Balakrish Nair
h
, and Rita R. Colwell
b,d,i,1
aSchool of Biological Sciences and Institute of Microbiology, Seoul National University, Seoul 151-742, Republic of Korea; bCenter for Bioinformatics and
Computational Biology, University of Maryland Institute for Advanced Computer Studies, and dMaryland Pathogen Research Institute, University of
Maryland, College Park, MD 20742; cInternational Vaccine Institute, Seoul 151-818, Republic of Korea; eInternational Center for Diarrheal Disease
Research, Bangladesh, Dhaka-1000, Bangladesh; fBioscience Division, Department of Energy Joint Genome Institute, Los Alamos National Laboratory,
Los Alamos, NM 87545; gPacific Northwest National Laboratory, Richland, WA 99352; hNational Institute of Cholera and Enteric Diseases,
Beliaghata, Kolkata 700 010, India; and iJohns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205
Contributed by Rita R. Colwell, July 21, 2009 (sent for review May 19, 2009)
Vibrio cholerae, the causative agent of cholera, is a bacterium
autochthonous to the aquatic environment, and a serious public
health threat. V. cholerae serogroup O1 is responsible for the
previous two cholera pandemics, in which classical and El Tor
biotypes were dominant in the sixth and the current seventh
pandemics, respectively. Cholera researchers continually face
newly emerging and reemerging pathogenic clones carrying di-
verse combinations of phenotypic and genotypic properties, which
significantly hampered control of the disease. To elucidate evolu-
tionary mechanisms governing genetic diversity of pandemic V.
cholerae, we compared the genome sequences of 23 V. cholerae
strains isolated from a variety of sources over the past 98 years. The
genome-based phylogeny revealed 12 distinct V. cholerae lineages,
of which one comprises both O1 classical and El Tor biotypes. All
seventh pandemic clones share nearly identical gene content.
Using analogy to influenza virology, we define the transition from
sixth to seventh pandemic strains as a ‘‘shift’’ between pathogenic
clones belonging to the same O1 serogroup, but from significantly
different phyletic lineages. In contrast, transition among clones
during the present pandemic period is characterized as a ‘‘drift’’
between clones, differentiated mainly by varying composition of
laterally transferred genomic islands, resulting in emergence of
variants, exemplified by V. cholerae O139 and V. cholerae O1 El Tor
hybrid clones. Based on the comparative genomics it is concluded
that V. cholerae undergoes extensive genetic recombination via
lateral gene transfer, and, therefore, genome assortment, not
serogroup, should be used to define pathogenic V. cholerae clones.
genomic islands cholera toxin prophage lateral gene transfer
Vibrio cholerae, a bacterium autochthonous to the aquatic
environment, is the causative agent of cholera, a severe, water y,
life-threatening diarrheal disease. Historically, cholera bacteria
have been serogrouped based on their somatic O antigens, with
200 serogroups identified to date (1). Although strains from many
of the serogroups of V. cholerae have caused either individual cases
of mild gastroenteritis or local outbreaks of gastroenteritis, only the
toxigenic strains of serogroups O1 and O139 have been identified
as agents of cholera epidemics. Genes coding for cholera toxin,
ctxAB, and other virulence factors have been shown to reside in
bacteriophages and various mobile genetic elements. In addition, V.
cholerae serogroup O1 is differentiated into two biotypes, classical
and El Tor, by a combination of biochemical traits and sensitivity
to specific bacteriophages (2).
Throughout human history cholera pandemics have been re-
corded with seven such pandemics characterized over the past
hundred or more years. Today the disease remains endemic only in
developing countries, even though V. cholerae is native to estuaries
and river systems throughout the world (3). Isolates of the sixth
pandemic were almost exclusively of the O1 classical biotype,
whereas the current (seventh) pandemic is dominated by V. cholerae
O1 El Tor biotype, a transition occurring between 1905 and 1961.
The six pandemics previous to the current pandemic are considered
to have originated in the Indian subcontinent, whereas the seventh
pandemic strain was first isolated in the Indonesian island of
Sulawesi in 1961, and subsequently in Asia, Africa, and Latin
America.
Over the last 20 years, several new epidemic lineages of V.
cholerae O1 El Tor have emerged or reemerged. In 1992, a new
serogroup of V. cholerae, O139, was identified as the cause of
epidemic cholera in India and Bangladesh (4). That is, both V.
cholerae O1 El Tor and O139 consistently have been isolated where
the major cholera epidemics have occurred since 1992, although V.
cholerae O139 appears still to be restricted to Asia. Additionally, V.
cholerae ‘‘hybrid’’ O1 El Tor variants that carry the classical type
CTX prophage, or produce classical type cholera toxin subunit B
have been isolated repeatedly in Bangladesh (5, 6) and Mozam-
bique (7). These new variants have replaced the prototype seventh
pandemic V. cholerae O1 El Tor strains in Asia and Africa, with
respect to frequency of isolation from clinical cases of cholera.
It is clear that the dynamics of V. cholerae, like other enteric
pathogens, involve extensive lateral gene transfer via transduction,
conjugation, and transformation (2, 8, 9). However, the evolution-
ary history of this bacterium remains to be documented. Here, we
compare the genome sequences of 23 V. cholerae strains (Table 1),
representing diverse serogroups isolated at various times over the
past 98 years from a variety of sources and geographical locations.
We conclude that the current pandemic is caused by strains
belonging to a single phyletic line, diversified mainly by lateral gene
transfer occurring in the natural environment.
Results and Discussion
Phylogeny and Gene Content of
Vibrio cholerae
.Phylogenetic anal-
ysis, accomplished using 1.4 million bp of orthologous protein-
coding regions for 23 V. cholerae strains, revealed 12 distinct
Author contributions: J.C., C.J.G., R.A.W., A.H., G.B.N. and R.R.C. designed research; N.A.H.,
T.S.B., D.C.B., J.F.C., J.C.D., C.S.H., A.C.M., O.C., L.M., and E.S. performed research; J.C.,
C.J.G., N.A.H., J.H.L., S.Y.C., B.J.H., E.T., Y.-S.J., D.W.K., and J.-H.L analyzed data; and J.C. and
R.R.C. wrote the paper.
The authors declare no conflict of interest.
1To whom correspondence should be addressed. E-mail: rcolwell@umiacs.umd.edu.
This article contains supporting information online at www.pnas.org/cgi/content/full/
0907787106/DCSupplemental.
15442–15447
PNAS
September 8, 2009
vol. 106
no. 36 www.pnas.orgcgidoi10.1073pnas.0907787106
phyletic lineages. Strains belonging to non-O1/non-O139 sero-
groups from various sources showed significant genomic diversity
(Fig. 1A). In fact, each unique phyletic line adds 206 new genes to
the pan-genome of V. cholerae, on average (See SI Text and Fig. S1).
In contrast, all V. cholerae serogroup O1 strains, except for two,
comprised a monophyletic clade, designated V. cholerae phylocore
genome (PG) clade. Strains of both the sixth and seventh pandemics
are concluded to have evolved from a common ancestor of this PG
clade.
Twelve strains of the PG clade were further divided into two
subgroups, as shown in the phylogenetic tree constructed using
2.6 million bp alignment (Fig. 1 Aand B). The PG-1 subclade is
comprised of most of the V. cholerae O1 El Tor strains and one V.
cholerae O139 strain, whereas the PG-2 subclade contains strains of
V. cholerae O1 classical and O37 serogroups. Interestingly, all
clinical isolates associated with the current seventh cholera pan-
demic formed a very tight, monophyletic clade within the PG-1
subclade, which we have designated the seventh pandemic (7P)
clade (Fig. 1C). V. cholerae O1 El Tor and O139 strains isolated
from the Indian subcontinent and Africa epidemics during 1975 to
2004 are located in the 7P clade. We use the terms shift and drift
in a manner similar in some respects to their use in studies of the
influenza virus. In particular, we use shift to refer to long-term
accumulation of numerous base pair mutations whereas drift to
refer to short-term changes resulting from horizontal acquisition of
genomic islands.
O Serogrouping in the Context of Genome Evolution. The lipopoly-
saccharide (LPS) of V. cholerae consists of three major regions: lipid
A, core oligosaccharide (OS), and O antigen. V. cholerae synthesizes
Table 1. Characteristics of Vibrio cholerae strains analyzed in this study
Strain
Genome
code Serogroup Biotype
Geographical
Origin
Source of
isolation
Year of
isolation
Sequencing
status*
No. of
contigs Accession
N16961 VCN16961 O1 Inaba El Tor Bangladesh Clinical 1975 Complete 2 AE003852/AE003853
RC9 VCRC9 O1 Ogawa El Tor Kenya Clinical 1985 S/4/E 11 ACHX00000000
MJ-1236 VCMJ1236 O1 Inaba El Tor Matlab, Bangladesh Clinical 1994 Complete 2 CP001485/CP001486
B33 VCB33 O1 Ogawa El Tor Beira, Mozambique Clinical 2004 S/4/E 17 ACHZ00000000
CIRS 101 VCCIRS101 O1 Inaba El Tor Dhaka, Bangladesh Clinical 2002 S/4/E 18 ACVW00000000
MO10 VCMO10 O139 Madras, India Clinical 1992 S/4 84 AAKF03000000
2740–80 VC274080 O1 Inaba El Tor US Gulf Coast Water 1980 Sanger 257 AAUT01000000
BX330286 VCBX330286 O1 Inaba El Tor Australia Water 1986 Complete 8 ACIA00000000
MAK757 VCMAK757 O1 Ogawa El Tor Celebes Islands Clinical 1937 Sanger 206 AAUS00000000
NCTC 8457 VC8457 O1 Inaba El Tor Saudi Arabia Clinical 1910 Sanger 390 AAWD01000000
O395 VCO395 O1 Ogawa Classical India Clinical 1965 Complete 2 CP000626/CP000627
V52 VCV52 O37 Sudan Clinical 1968 Sanger 268 AAKJ02000000
12129(1) VC12129 O1 Inaba El Tor Australia Water 1985 S/4/E 12 ACFQ00000000
TM 11079-80 VCTM11079 O1 Ogawa El Tor Brazil Sewage 1980 S/4/E 35 ACHW00000000
VL426 VCVL426 non-O1/O139 Albensis Maidstone, Kent, UK Water Unknown Complete 5 ACHV00000000
TMA21 VCTMA21 non-O1/O139 Brazil Seawater 1982 S/4/E 20 ACHY00000000
1587 VC1587 O12 Lima, Peru Clinical 1994 Sanger 254 AAUR01000000
RC385 VCRC385 O135 Chesapeake Bay Plankton 1998 Sanger 550 AAKH02000000
MZO-2 VCMZO2 O14 Bangladesh Clinical 2001 Sanger 162 AAWF01000000
V51 VCV51 O141 USA Clinical 1987 Sanger 360 AAKI02000000
MZO-3 VCMZO3 O37 Bangladesh Clinical 2001 Sanger 292 AAUU01000000
AM-19226 VCAM19226 O39 Bangladesh Clinical 2001 Sanger 154 AATY01000000
623–39 VC62339 non-O1/O139 Bangladesh Water 2002 Sanger 314 AAWG00000000
*Sanger, Draft assemblies by Sanger sequencing; S/4, Sanger sequencing and 454 pyrosequencing were combined; S/4/E, S/4 followed by quality improvement
by standard genome sequencing procedures.
O135 RC385
biovar albensis
VL426
O141
V51
non-O1/O139
623-39
O1 El Tor
12129(1)
O14 MZO-2
O12 1587
O37 MZO-3
Non-O1/O139
TMA21
O39
AM-19226
O1 El Tor
TM11079-80
Vibrio cholerae
Phylocore Genome (PG)
clade
PG-1
subclade
PG-2
subclade
O37
V52
O1 classical
O395
0.002
O37 V52
O1 classical O395
O1 El Tor 2740-80
O1 El Tor MAK757
O1 El Tor BX330286
O1 El Tor NCTC 8457
7th Pandemic (7P)
clade
0.0005
100
100
100
100
Genome “Shift”
c
Genome “Drift”
A
O1 El Tor RC9
O1 El Tor N16961
O139 MO10
O1 El Tor B33
O1 El Tor MJ-1236
100
100
99
0.000005
O1 El Tor CIRS 101
100
B
C
Fig. 1. Neighbor-joining trees showing phylogenetic relationships of 23 V. cholerae strains representing diverse serogroups. (A) All V. cholerae strains based
on 1,676 genes (1,370,469 bp). (B) Phylocore genome (PG) clade based on 2,663 genes (2,567,393 bp). (C) Seventh pandemic (7P) clade based on 3,364 genes
(3,291,577 bp). Bootstrap supports, as percentage, are indicated at the branching points. Bars represent the numbers of substitution per site, respectively. Only
orthologous genes showing 95% nucleotide sequence similarity to those of V. cholerae N16961 were selected. The tree was rooted using Vibrio vulnificus YJ016
and Vibrio parahaemolyticus RIMD 2210633.
Chun et al. PNAS
September 8, 2009
vol. 106
no. 36
15443
MICROBIOLOGY
core OS and O antigen using wav and wb* gene clusters, respectively
(10, 11). Molecular phylogeny and genetic organization of the wav
and wb* gene clusters are summarized in SI Text, and Figs. S2 and
S3.
In contrast to the limited diversity observed in the wav gene
cluster (5 major types), 11 different types of wb* gene clusters were
observed among the 23 strains. Phylogeny and genetic organization,
based on the whole genome (Fig. 1), core OS, and O antigen gene
clusters (Fig. S2A), clearly indicate both core OS and O antigen
gene clusters have been horizontally transferred. The relatively
stable gene order (synteny) of the core OS gene cluster suggests that
it transfers as an entity. In contrast, the region coding for the O
antigen is comprised of combinations of several smaller gene sets
of different origin, leading to a remarkable diversity of the various
O antigens seen in nature (Fig. S2B). This finding is in good
agreement with the study in ref. 12 showing that the gene cluster
coding for the O139 antigen is similar to that of V. cholerae
serogroup O22, where substitution of a part of the cluster occurred,
but not a deletion.
Genome phylogeny (Fig. 1A) revealed that strains of O1 sero-
group are found in three different phyletic lineages, namely the PG
clade, and the V. cholerae O1 El Tor 12129 (1) and TM11079-80
strains, in which the coding region for the O1 antigen is nearly
identical. It is concluded that the O1 antigen phenotype arose by
lateral gene exchange at least three times in the evolution of V.
cholerae presented here. Furthermore, we hypothesize that the
ancestor of the PG clade possessed a combination of the type 1 core
OS and the O1 antigen gene clusters, giving rise to the present 12
V. cholerae PG strains, including the two V. cholerae non-O1 strains
(V52 and MO10). The latter two became different serogroups by
gene replacement, via lateral gene transfer, with strain V52 receiv-
ing both type 1 core OS and O37 antigen gene clusters from a V.
cholerae O37 strain and V. cholerae MO10 receiving only the V.
cholerae O139 antigen gene cluster from an unknown source, most
likely a variant of the V. cholerae O22 serogroup (12).
The V. cholerae O1 strains not belonging to the PG group, V.
cholerae 12129 (1) and TM11079-80, are environmental isolates
from Australia isolated in 1985 (13), and from Brazil, isolated in
1980 (14). They showed the typical El Tor phenotype, but unlike
other V. cholerae O1 El Tor strains in the PG-1 subclade, lack the
two major virulence-related genomic islands, i.e., CTX prophage
containing ctxAB and Vibrio pathogenicity island-1 (VPI-1) con-
taining genes for biosynthesis of the toxin coregulated pilin (TCP).
By comparing genome phylogenies based on the whole genome
(Fig. 1) and gene clusters coding for the core OS (Fig. S2A) and O1
antigen (Fig. S2C), it is clear that genesis of these nontoxigenic V.
cholerae O1 El Tor strains can be attributed to independent lateral
gene transfer events, most probably transfer of only the O1 antigen
gene cluster, but not the core OS region.
Four O serogroup conversions, from non-O1 to O1 (twice), O1
to O139, and O1 to O37, were detected among the 23 V. cholerae.
Several previous studies suggested such conversions take place in
nature (11, 14, 15), and chitin-induced natural transformation has
been proposed as the mechanism in the natural environment (16).
V. cholerae O1 to O139 serogroup conversion by a single-step
exchange of large fragments of DNA was demonstrated in a
microcosm experiment (9), and is supported by the conclusion of
this study that O serogroup conversion occurs frequently in nature.
Mobility of the O phenotype in V. cholerae was first proposed by
Colwell et al. (17), and the cumulative results of both in vivo and in
vitro experiments are compelling. Given the inconsistency between
O serogroup typing and genome-based phylogeny, we conclude
that, at the very least, the term ‘‘O1 El Tor’’ is both misleading and
inaccurate for describing a set of phylogenetically coherent V.
cholerae strains, in light of the frequency of serogroup conversion.
Therefore, we propose a new terminology based on genome
sequence; namely the phylocore genome (PG) clade, PG-1 and
PG-2 subclades, and seventh pandemic (7P) clade, to describe
homologous intraspecific groups of V. cholerae (Fig. 1).
Virulence-Associated Prophage and Genomic Islands Within the Con-
text of the Genome. V. cholerae possesses several known virulence
factors, of which the cholera toxin (CT) and TCP are considered the
most significant. Genes coding for CT (ctxAB) are part of a
temperate filamentous bacteriophage CTX
(8) that can be incor-
porated into both chromosomes of V. cholerae at specific positions.
The CTX
genes were found to be present in members of the PG
clade, except for V. cholerae NCTC 8457 and 2740–80. Among
non-PG strains, only V. cholerae serogroup O141 (V51) contains
this prophage.
The CTX
found in classical and El Tor biotypes differs in the
sequence of their repressor gene, rstR, and are classified as
CTX
Class
and CTX
El Tor
, according to the biotype of the original
hosts in which they were described (18). From the genome se-
quences, we found that CTX
Class
is not restricted to the classical
biotype, but is also widely distributed in V. cholerae O1 El Tor and
O141 strains (Fig. 2). Given that V. cholerae O1 El Tor MAK 757,
a clinical strain isolated in 1937, has this type of prophage, corre-
lation of host biotype and prophage type is not considered signif-
icant. A recent study(19) showing the infection of CTX
Class
to V.
cholerae non-O1 supports this finding.
Chromosomal attachment sites for CTX
are known to harbor
Large chromosome Small chromosome
O1 El Tor
MJ-1236/B33 CTXClass CTXClass
O1 El Tor
RC9 TLC RS1 RS1 CTXEl Tor CTXEl Tor RS1
O1 El Tor
N16961 RS1
CTXEl Tor
TLC
Classical type CT B
O1 El Tor
NCTC 8457 GI-19
O39 AM-19226
O12 1587 GI-19 VSK
623-39 GI-33
GI-19
O39
MZO-3 GI-48
GI-19
TMA2 1
O135 RC385 GI-43
GI-33
O1 El Tor
12129(1) GI-19
O14
MZO-2 GI-19
O1 El Tor
TM11079-80 GI-33
VL426 GI-19
Truncated prophage
O1 El Tor
CIRS101 RS1
TLC CTXEl Tor
O139
MO10 CTXEl Tor
TLC VSK
O1 El Tor
BX330286 RS1Env
CTXClass CTXClass
TLC
O1 El Tor
MAK757 GI-22
TLC CTXClass
O1 El Tor
2740-80 TLC
O1 classical
O395 TLC CTXClass CTXClass CTXClass
O37 V52 *TLC CTXEl Tor RS1
O141 V51 GI-33 CTXClass RS1
Fig. 2. Schematicrepresentation of various prophages and genetic elements
present in the target regions of CTX
insetion. *, TLC, El Tor type CTX
, RS1
element are found, but no positional information can be obtained from
genome assemblies. †, classical type CTX
and RS1 are present, but no posi-
tional information can be obtained.
15444
www.pnas.orgcgidoi10.1073pnas.0907787106 Chun et al.
other genetic elements, including toxin-linked cryptic (TLC), RS1
elements, and VSK(pre-CTX) prophages (20, 21). We have
discovered five genomic islands (GI-19, GI-22, GI-33, GI-43, GI-48;
for details see Table S1) in the region of the CTX
attachment sites
on both chromosomes. In total, nine distinct genetic elements were
found in these regions, where they appear in different combinations
(Fig. 2). Seven strains possess GI-19 in either chromosome, which
is similar but not identical to KSF-1
, previously discovered in an
environmental V. cholerae strain (22). It is evident that more
bacteriophages/genetic elements are located in the CTX
attach-
ment regions of PG strains than non-PG strains. The ability to
harbor more, especially toxigenic, bacteriophage-like elements in
these regions of the PG strains might explain why only PG strains
have been the agents of the pandemics. We found no two toxigenic
(CTX
-harboring) strains with identical GI organization and com-
bination, with the exception of two hybrid strains (the only 7P
members harboring CTX
Class
). It is evident from Fig. 2 that the
two CTX
attachment sites serve as an engine of genetic diversity
for the V. cholerae PG clade.
Genes coding for TCP are part of a genomic island, VPI-1
present in all PG strains. Among the non-PG strains, only V.
cholerae O141 V51 contained VPI-1 but with less sequence simi-
larity. Because TCP serves as a receptor for CTX
(8), it explains
why only this strain, of all non-PG strains, possesses CTX
. Results
of phylogenetic analysis using the 24 genes of VPI-1 indicate that
the original GI of V. cholerae NCTC 8457 was replaced by VPI-1 of
a non-PG strain (Fig. S4). Interestingly, GI-47, but not VPI-1, was
found in strains MZO-3, 1587, MZO-2, and VL426 in the same
genomic region. This cassette-like property of GI mobility was also
observed for the other known pathogenicity islands, including
VPI-2, VSP-1, and VSP-2 (Table S2).
Extensive Lateral Gene Transfer in
V. cholerae
.Because it is generally
accepted that lateral gene transfer plays an important role in the
evolution of many pathogenic bacteria, V. cholerae serves as a useful
paradigm. For purposes of this study, a GI is defined as a genomic
region containing five or more ORFs, where transfer, but not
deletion, is obvious from comparison of genome phylogeny and its
presence/absence among test strains. A total of 73 GIs were
identified (Table S1) and their chromosomal locations are shown in
Fig. 3. As discussed above, with respect to GIs associated with O
antigen biosynthesis, CTX
, VPI-1,2 and VSP-2, a total of 13
genomic regions (eight in the large and five in the small chromo-
some) were found to have a cassette-like property, whereby differ-
ent GIs occupy the same or a similar region (Table S2). Most GIs
were singletons in a given genome, although two (GI-12, GI-21)
were present as four and two copies, respectively. Thus, we con-
clude that genetic diversity of V. cholerae derives most significantly
from lateral gene transfer, of which several transfers are cassettes.
Genomic Definition of the
V. cholerae
Phylocore Genome (PG) Clade
and Pandemic Strains. The V. cholerae PG clade, with both sixth and
seventh pandemic strains, is defined by gene content. Twenty-seven
genes are present exclusively in the genomes of the PG strains, but
only five genes are unique to the PG-1 subclade. Four of these
(VCA0198 –VCA0201) comprise a genomic island (GI-5) on the
small chromosome, including genes coding for cytosine-specific
DNA methyltransferase (23) and hypothetical proteins, adjacently
located to the IS1004 transposase gene. The 7P strains are differ-
entiated in harboring two unique GIs, the Vibrio seventh pandemic
island-1 (VSP-1) and VSP-2, first discovered by microarray analysis
(24). In addition to the 7P strains, a variant of VSP-1 was found in
V. cholerae biovar albensis VL426 (Fig. S5). Similarly, VSP-2 like
GIs were detected in three non-PG strains (TMA21, O39 MZO-3,
O135 RC385). Interestingly, a similar GI was also detected in Vibrio
vulnificus YJ016 and Vibr io splendidus 12B01, suggesting that VSP-2
may be widespread among vibrios (Fig. S6). It should be noted that
the stability of these well known pathogenicity islands among 7P
members is questionable, because most of VPI-2 and VSP-2 were
deleted in MO10 and CIRS 101, respectively.
V. cholerae contains a superintegron, a large integron island
(gene capture system), in the small chromosome (120 Kbp),
comprising predominantly hypothetical genes and proposed as a
source of genetic variation (25). All V. cholerae strains examined in
this study have this integron, a source of much of the variation in
gene content (Fig. S7). Interestingly, if this region is excluded, all six
members of the 7P clade have an identical gene content, with the
exception of a few genomic islands, including those found in the
CTX
attachment region. An SXT element belonging to a family
of conjugative transposon-like mobile genetic elements encodes
multiple antibiotic resistance genes and is present only in V. cholerae
MO10, CIRS 101, MJ-1236, and B33, but not in the other V.
cholerae strains. V. cholerae O139 MO10 differs from other mem-
bers of the 7P clade in having an O139 antigen specific genomic
island, a finding strongly supporting the conclusion of several
previous studies that V. cholerae O139 derives from a seventh
Fig. 3. Genomic representation of genomic islands of both V. cholerae chromosomes. The two circles in the middle represent the genes in V.cholerae O1 El
Tor N16961. The inner circle indicates genomic islands found in strain N16961, whereas the outer circles are those absent in strain N16961.
Chun et al. PNAS
September 8, 2009
vol. 106
no. 36
15445
MICROBIOLOGY
pandemic V. cholerae O1 El Tor strain (26). No other V. cholerae
O139-specific genes were found in V. cholerae MO10.
The hybrid strains, possessing an El Tor biotype phenotype, but
classical biotype CTX
, were isolated during current cholera
epidemics in Asia and Africa (6, 7). Two hybrid strains (B33 and
MJ-1236) share a virtually identical genome backbone. Among
3,587,239 bp of orthologous protein-coding regions, only 106 nu-
cleotide positions are different and the only significant difference
is the presence of a V. cholerae MJ-1236 specific 19,729 bp genomic
island (GI-12). This GI occurs four times as an almost identical
sequence in the large chromosome, with 14 genes including those
coding for the putative phage integrase and type I restriction-
modification system, probably a recently introduced temperate
bacteriophage. It is not clear why the hybrid strains outcompete V.
cholerae O1 El Tor/O139 in the clinical setting, but a key to this
puzzle surely lies in differences among closely related strains, i.e.,
tandem copies of CTX
Class
, GI-14 and single nucleotide polymor-
phisms. In addition to these hybrid clones, V. cholerae O1 El Tor
strains producing the classical type of cholera toxin B repeatedly
have been isolated from patients in Asia and Africa (6). The
genome sequence of a representative of this newly emerged group,
i.e., V. cholerae CIRS 101, reveals that these strains also have a
typical 7P gene content, but with CTX
El Tor
, not CTX
Class
, albeit
expressing the classical type subunit B protein (Fig. 2).
The comparative genomics of phylogenetically diverse strains has
permitted analysis of the mechanism by which current seventh
pandemic clones may have arisen. An highly conserved gene
content, synteny, and significant similarity among the six strains of
the 7P clade indicate that these V. cholerae strains share an almost
identical genome ‘‘backbone,’’ having evolved very recently from a
common ancestral strain. An hypothetical evolutionary pathway
proposed for V. cholerae (Fig. 4), with GI migration matched to a
genome-based phylogenetic tree, allows the conclusion that the
ancestor for the 7P clade was a V. cholerae O1 El Tor strain
containing several GIs (VPI-1,2, GI-1 to GI-10), receiving VSP-1,
VSP-2 and GI-11 by lateral gene transfer, and finally giving rise to
the contemporary V. cholerae O1 El Tor and O139 strains. Inter-
estingly, such an hypothetical ancestral strain shows a gene content
similar to V. cholerae O1 El Tor BX330286, isolated from a water
sample collected in Australia in 1986, a geographic location near
Indonesia where the first seventh pandemic V. cholerae O1 El Tor
was reported in 1961.
Mechanism of
V. cholerae
Evolution. There are only a few human
pathogens for which the complete sequences of many isolates are
available (27, 28, 29). Because V. cholerae is both highly pathogenic
for humans and an autochthonous inhabitant of estuaries world-
wide, it provides a unique opportunity to elucidate evolutionary
mechanisms. Furthermore, it is the natural inhabitant of the
estuarine environment of both cholera epidemic and nonepidemic
countries (3).
Unlike Salmonella enterica serovar Typhi and Bacillus anthracis,
bacterial species showing clonal properties, V. cholerae,withStrep-
tococcus agalactiae and Escherichia coli, offers a prime example of
the important role of lateral gene transfer in the evolution of a
bacterial species. The transition from sixth to seventh cholera
pandemic genome type is concluded to result from a change from
V. cholerae O1 classical to O1 El Tor biotype. We propose the term
shift for the event occurring between two distinct phyletic lineages
(Fig. 1B). It should be noted that only one genome of O1 classical
biotype was included in this study, therefore more isolates of this
biotype should be examined to determine its population structure.
In contrast, the present cholera global pandemic is ascribed to a
change among 7P strains, e.g., emergence of V. cholerae O139, V.
cholerae O1 El Tor hybrid, and V. cholerae O1 El Tor with altered
cholera toxin subunit B. These represent transitions among genet-
ically nearly identical clones, with a few different GIs, for which we
propose the term drift. Much as in the case of inf luenza viruses,
cholera bacteria undergo a shift/drift cycle over time, although the
drift in V. cholerae is derived mainly from lateral gene transfer, most
likely occurring in the natural environment in association with its
plankton hosts (3, 30).
Fig. 4. Proposed hypothetical evolutionary pathway of the V. cholerae species. Probable insertions and deletions of genomic islands (Table S1) found in 23 V.
cholerae strains are indicated by black and red arrows, respectively, along the phylogenetic tree based on genome sequence data. Hypothetical ancestral strains
are indicated by open circles.
15446
www.pnas.orgcgidoi10.1073pnas.0907787106 Chun et al.
The present cholera global pandemic is concluded to have been
initiated by multiple descendants of a V. cholerae O1 El Tor
ancestor, diversified and continuously rapidly evolving, mainly via
lateral gene transfer and most likely driven by environmental
factors. Most importantly, the common genome backbone and
variable genomic islands of the 7P clade of V. cholerae require that
a reevaluation be done of the epidemiological practice that employs
serogroups as the primary marker for V. cholerae. The so-called
pandemic clones, identified by serogroup, instead, should be de-
fined by gene content, the description of which offers significantly
greater potential for development of reliable and useful diagnostics,
vaccines, and therapeutics for cholera. Without doubt, more vari-
ants of the 7P clade, as a result of drift, will be encountered in the
future, yielding new serogroups (other than O1 and O139) and
phenotypic combinations. Public health workers will be unprepared
if the evolution of this species remains unappreciated as an ongoing
process in the natural environment, where V. cholerae is autoch-
thonous and plays an important role in the nutrient cycles of the
natural aquatic ecosystem.
Materials and Methods
Genome Sequencing. Draft sequences were obtained from a blend of Sanger and
454 sequences and involved paired end Sanger sequencing on 8-kb plasmid
libraries to 5coverage, 20coverage of 454 data, and optional paired end
Sanger sequencing on 35-kb fosmid libraries to 1–2coverage (depending on
repeat complexity). To finish the genomes, a collection of custom software and
targeted reaction types were used. In addition to targeted sequencing strategies,
Solexa/Illumina data in an untargeted strategy were used to improve low quality
regions and to assist gap closure. Repeat resolution was performed using in-
house custom software. Targeted finishing reactions included transposon bombs
(31), primer walks on clones, primer walks on PCR products, and adapter PCR
reactions. Gene-finding and annotation were achieved using the RAST server (32)
and details are given in Table S3.
Comparative Genomics. Genome to genome comparison was performed using
three approaches, because completeness and quality of nucleotide sequences
varied from strain to strain (Table 1). First, ORFs of a given pair of genomes were
reciprocally compared each other, using the BLASTN, BLASTP and TBLASTX
programs (ORF-dependent comparison). Second, a bioinformatic pipeline was
constructed to identify homologous regions of a given query ORF. Initially, a
segment on target contig, which is homologous to a query ORF, was identified
using the BLASTN program. This potentially homologous region was expanded in
both directions by 2,000 bp. Nucleotide sequences of the query ORF and selected
target homologous region were then aligned using a pairwise global alignment
algorithm (33), and the resultant matched region in the subject contig was
extracted and saved as a homolog (ORF-independent comparison). Orthologs
and paralogs were differentiated by reciprocal comparison. In most cases, both
ORF-dependent and -independent comparisons yielded the same orthologs,
although ORF-independent method performed better for draft sequences of low
quality, in which sequencing errors, albeit rare, hampered identification of
correct ORFs.
Identification and Annotation of Genomic Islands. In this study, we defined
genomic islands (GIs) as a continuous array of five or more ORFs that were found
to be discontinuously distributed among genomes of test strains. Correct transfer
or insertion of GIs was readily differentiated from deletion event by comparing
genome-based phylogenetic tree and full matrices showing pairwise detection of
orthologous genes between test strains. Identified GIs were designated, and
annotated using the BLASTP search of its member ORFs against GenBank NR
database.
Phylogenetic Analyses Based on Genome Sequences. A set of orthologues for
each ORF of V. cholerae N16961 was obtained for different sets of strains, and
then aligned using the CLUSTALW2 (34) program. The resultant multiple align-
ments were concatenated to generate genome scale alignments, which were
subsequently used to reconstruct the neighbor-joining phylogenetic tree (35).
The evolutionary model of Kimura (36) was used to generate the distance matrix.
The program MEGA (37) was used for phylogenetic analysis.
ACKNOWLEDGMENTS. This work was supported in part by Korea Science and
Engineering Foundation National Research Laboratory Program Grant R0A-
2005-000-10110-0 (to J.C.); National Institutes of Health Grant 1RO1A139129-01
(to R.R.C.); National Oceanic and Atmospheric Administration, Oceans and Hu-
man Health Initiative Grant S0660009 (to R.R.C.); Department of Homeland
Security Grant NBCH2070002 (to R.R.C.); Intelligence Community Post-Doctoral
Fellowship Program (to C.J.G.); and the Korean and Swedish governments (to
I.V.I.). Funding for genome sequencing was provided by the Office of the Chief
Scientist and National Institute of Allergy and Infectious Diseases Microbial
Sequencing Centers Grants N01-AI-30001 and N01-AI-40001.
1. Chatterjee SN, Chaudhuri K (2003) Lipopolysaccharides of Vibrio cholerae. I. Physical
and chemical characterization. Biochim Biophys Acta 1639:65–79.
2. Kaper JB, Morris JG, Jr, Levine MM (1995) Cholera Clin Microbiol Rev 8:48– 86.
3. Colwell RR (1996) Global climate and infectious disease: The cholera paradigm. Science
274:2025–2031.
4. Ramamurthy T, et al. (1993) Emergence of novel strain of Vibrio cholerae with epidemic
potential in southern and eastern India. Lancet 341:703–704.
5. Nair GB, et al. (2002) New variants of Vibrio cholerae O1 biotype El Tor with attributes
of the classical biotype from hospitalized patients with acute diarrhea in Bangladesh.
J Clin Microbiol 40:3296–3299.
6. Nair GB, et al. (2006) Cholera due to altered El Tor strains of Vibrio cholerae O1 in
Bangladesh. J Clin Microbiol 44:4211–4213.
7. Ansaruzzaman M, et al. (2004) Cholera in Mozambique, variant of Vibrio cholerae.
Emerg Infect Dis 10:2057–2059.
8. Waldor MK, Mekalanos JJ (1996) Lysogenic conversion by a filamentous phage encod-
ing cholera toxin. Science 272:1910–1914.
9. Blokesch M, Schoolnik GK (2007) Serogroup conversion of Vibrio cholerae in aquatic
reservoirs. PLoS Pathog 3:e81.
10. Nesper J, et al. (2002) Comparative and genetic analyses of the putative Vibrio cholerae
lipopolysaccharide core oligosaccharide biosynthesis (wav) gene cluster. Infect Immun
70:2419–2433.
11. Li M, Shimada T, Morris JG, Jr, Sulakvelidze A, Sozhamannan S (2002) Evidence for the
emergence of non-O1 and non-O139 Vibrio cholerae strains with pathogenic potential
by exchange of O-antigen biosynthesis regions. Infect Immun 70:2441–2453.
12. Yamasaki S, et al. (1999) The genes responsible for O-antigen synthesis of Vibrio
cholerae O139 are closely related to those of Vibrio cholerae O22. Gene 237:321–332.
13. Safa A, et al. (2009) Multilocus genetic analysis reveals that the Australian strains of
Vibrio cholerae O1 are similar to the pre-seventh pandemic strains of the El Tor biotype.
J Med Microbiol 58:105–111.
14. Farfan M, Minana D, Fuste MC, Loren JG (2000) Genetic relationships between clinical
and environmental Vibrio cholerae isolates based on multilocus enzyme electrophore-
sis. Microbiology 146 (Pt 10):2613–2626.
15. Bik EM, Gouw RD, Mooi FR (1996) DNA fingerprinting of Vibrio cholerae strains with
a novel insertion sequence element: A tool to identify epidemic strains. J Clin Microbiol
34:1453–1461.
16. Meibom KL, Blokesch M, Dolganov NA, Wu CY, Schoolnik GK (2005) Chitin induces
natural competence in Vibrio cholerae. Science 310:1824–1827.
17. Colwell RR, Huq A, Chowdhury MA, Brayton PR, Xu B (1995) Serogroup conversion of
Vibrio cholerae. Can J Microbiol 41:946–950.
18. Davis BM, Moyer KE, Boyd EF, Waldor MK (2000) CTX prophages in classical biotype
Vibrio cholerae: Functional phage genes but dysfunctional phage genomes. J Bacteriol
182:6992–6998.
19. Udden SM, et al. (2008) Acquisition of classical CTX prophage from Vibrio cholerae
O141 by El Tor strains aided by lytic phages and chitin-induced competence. Proc Natl
Acad Sci USA 105:11951–11956.
20. Rubin EJ, Lin W, Mekalanos JJ, Waldor MK (1998) Replication and integration of a Vibrio
cholerae cryptic plasmid linked to the CTX prophage. Mol Microbiol 28:1247–1254.
21. Faruque SM, et al. (2007) Genomic analysis of the Mozambique strain of Vibrio
cholerae O1 reveals the origin of El Tor strains carrying classical CTX prophage. Proc
Natl Acad Sci USA 104:5151–5156.
22. Faruque SM, et al. (2003) CTXphi-independent production of the RS1 satellite phage by
Vibrio cholerae. Proc Natl Acad Sci USA 100:1280–1285.
23. Banerjee S, Chowdhury R (2006) An orphan DNA (cytosine-5-)-methyltransferase in
Vibrio cholerae. Microbiology 152(Pt 4):1055–1062.
24. Dziejman M, et al. (2002) Comparative genomic analysis of Vibrio cholerae: Genes that
correlate with cholera endemic and pandemic disease. Proc Natl Acad Sci USA 99:1556–1961.
25. Heidelberg JF, et al. (2000) DNA sequence of both chromosomes of the cholera
pathogen Vibrio cholerae. Nature 406:477–483.
26. Karaolis DK, Lan R, Reeves PR (1994) Molecular evolution of the seventh-pandemic
clone of Vibrio cholerae and its relationship to other pandemic and epidemic V.
cholerae isolates. J Bacteriol 176:6199– 6206.
27. Rasko DA, et al. (2008) The pangenome structure of Escherichia coli: Comparative genomic
analysis of E. coli commensal and pathogenic isolates. J Bacteriol 190:6881–6893.
28. Tettelin H, et al. (2005) Genome analysis of multiple pathogenic isolates of Strepto-
coccus agalactiae: Implications for the microbial ‘‘pan-genome.’’ Proc Natl Acad Sci
USA 102:13950–13955.
29. Holt KE, et al. (2008) High-throughput sequencing provides insights into genome
variation and evolution in Salmonella Typhi. Nat Genet 40:987–993.
30. Constantin de Magny G, et al. (2008) Environmental signatures associated with cholera
epidemics. Proc Natl Acad Sci USA 105:19676–19681.
31. Goryshin IY, Reznikoff WS (1998) Tn5 in vitro transposition. J Biol Chem 273:7367–7374.
32. Aziz RK, et al. (2008) The RAST Server: Rapid annotations using subsystems technology.
BMC Genomics 9:75.
33. Myers EW, Miller W (1988) Optimal alignments in linear space. Comput Appl Biosci
4:11–17.
34. Larkin MA, et al. (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948.
35. Saitou N, Nei M (1987) The neighbor-joining method: A new method for reconstructing
phylogenetic trees. Mol Biol Evol 4:406– 425.
36. Kimura M (1980) A simple method for estimating evolutionary rate of base substitu-
tions through comparative studies of nucleotide sequences. J Mol Evol 16:111–120.
37. Kumar S, Nei M, Dudley J, Tamura K (2008) MEGA: A biologist-centric software for
evolutionary analysis of DNA and protein sequences. Brief Bioinform 9:299–306.
Chun et al. PNAS
September 8, 2009
vol. 106
no. 36
15447
MICROBIOLOGY
... The VPI was present in all L3 and two L2 isolates in this study. The VPI was present in all epidemic and pandemic V. cholerae strains and sporadically in non-O1/non-O139 environmental isolates (43,44). Since the VPI encodes the TCP as a key colonization factor and the receptor for the CTXφ (45), it is widely accepted that by acquiring the VPI and the CTXφ, an environmental strain can become pathogenic (46). ...
... VSP-1 and VSP-2 were markers associated with the seventh pandemic clone although their roles were unclear (47). VSP-1 was found only in the seventh pandemic V. cholerae isolates, but VSP-2 was also sporadically present in isolates of other Vibrio species (43). We found that the VSP-1 and VSP-2 gene clusters were present in a small number of isolates in this study. ...
Article
Full-text available
Vibrio cholerae O1 has caused cholera pandemics. Non-pandemic V. cholerae O1 strains, which are genetically distinctive from the pandemic clones, have been isolated from both human infections and the environment. We aimed to better understand the non-pandemic O1 strains and their pandemic potential. We sequenced 109 non-pandemic O1 isolates from Zhejiang, China (from 1963 to 1996) and compared them with 62 publicly available non-pandemic O1 genomes. The isolates from Zhejiang can be classified into three lineages (L1–L3). All grouped together with L3 sharing the most recent common ancestor with the pandemic clones. L2 and L3 emerged in the 1960s while L1 emerged in the 1970s. L1 and L2 disappeared after the 1990s, but L3 persisted until recently. All isolates contained the type VI secretion system. The Vibrio pathogenicity island was present in all L3 isolates, whereas the type III secretion system was present in all L1 isolates. L2 did not carry any unique virulence genes. An intact CTXφ was present in only two L3 isolates. An intact Vibrio seventh pandemic island 1 was present in only three L3 isolates. The bla CARB-7 gene was identified in 96.3% of L2 isolates. Each of the non-pandemic O1 lineages has unique properties contributing to their capacity to cause disease. Our findings offer new insight into the evolution of O1 V. cholerae for cholera prevention and control. IMPORTANCE It is well recognized that only Vibrio cholerae O1 causes cholera pandemics. However, not all O1 strains cause pandemic-level disease. In this study, we analyzed non-pandemic O1 V. cholerae isolates from the 1960s to the 1990s from China and found that they fell into three lineages, one of which shared the most recent common ancestor with pandemic O1 strains. Each of these non-pandemic O1 lineages has unique properties that contribute to their capacity to cause cholera. The findings of this study enhanced our understanding of the emergence and evolution of both pandemic and non-pandemic O1 V. cholerae .
... Following the reports from numerous investigators, 20,31-36 the detection of SXT/R391 ICE families amongst V. cholerae and other bacterial strains is associated with a tendency for incorporation/recombination of new genes into conserved regions either from the aquatic and/or clinical environment. 20,34,35 Bacterial strains that harbor such incorporated genes are also adjudged as recombinant strains and/or evolving/emerging strains. Based on evolutionary fitness, it is inferred from the study that the environmental strains of non-agglutinating O1/O139 V. cholerae with such positive mobilome genes/dynamics contribute to evolution and emergence hence their detection may be adjudged as emerging strains. ...
... 10,16,17 It is noteworthy that these mobilome genotypes act collectively to facilitate horizontal genetic exchange, change in gene-based characters, and transfer/promote the acquisition of numerous genes. This was the presentation/submission in the reports of Chun et al., 34 Tang et al., 37 During the study, a prototypical ICE (ICEVchInd5Hotspot IV gene) was not detected and/or observed which is similar to Boyd and Waldor, 41 Faruque et al. 42 reports and other investigators. 31,[43][44][45][46] However, following previous studies, such occurrence shows that the environmental strains may be inclined to potential outbreak but no outbreak was reported during the study period. ...
Article
Full-text available
Acute diarrhea and cholera (AWD/C) result in more than 21000 to 143000 global mortality annually and are associated with Vibrio cholerae. The pathogen has shown increasing evolutionary/emerging dynamics linked with mobilome or ubiquitous nature of mobile integrative genetic and conjugative elements (MIGCE), however, such dynamics are rarely reported amongst somatic-antigen non-agglutinating Type-1/-139 V. cholerae (SA-NAG-T-1/139Vc). The study reports the genetic detection of mobilome-associated indices in SA-NAG-T-1/139Vc as a potential strategy for differentiating/discriminating emerging environmental bacteria. Presumptive V. cholerae isolates were retrieved from five water sources, while strains were characterized/serogrouped and confirmed using simplex and comparative-genomic-multiplex Polymerase Chain Reaction (PCR). Genomic island (GI-12det, GI-14det, GI-15det); Phages (TLC-phagedet, Kappa-phagedet) and ICEs of the SXT/R391 family genes (SXT/R391-ICEs integrase, SXT-Hotspot-IV, ICEVchInd5Hotspot-IV, ICEVchMoz10Hotspot-IV) were detected. Other rare ICE members such as the ICEVcBan8att gene and Vibrio Seventh Pandemic island detection (VSP-II Integrase, Prototypical VSP-II) were also detected. Results revealed that the 8.22% (61/742) SA-NAG-T-1/139Vc serogroup observed harbors the Vibrio Seventh Pandemic island integrase (34/61; 55.7%) and other rare genetic traits including; attB/attP (29/61; 47.5%, 14/61; 23%), integrative genetic elements (4/61; 6.56%), phage types (TLC-phagedet: 2/61; 3.28% and Kappa-phagedet: 7/61; 11.48%) as well as the integrase genes (INT1, Sul1, Sul2) (29/61: 47.5%; 21/61: 34.4%; 25/61: 41%). Such genetic detection of mobilome determinants/MIGCE suggests potential discriminatory tendencies amongst SA-NAG-T-1/139Vcwhich may be applied in mobilome typing of evolving/emerging environmental bacteria. The need to encourage the application of such mobilome typing indices and continuous study of these strains is suggestive of interest in controlling future potential emerging environmental strains.
... Dziejman et al. 50 first described VSP1 and VSP2 in seventh pandemic V. cholerae isolates and later the acquisition of these islands is explained by the phenomenon of lateral gene transfer event 51,52 . It was reported that the GC content of the classical and El Tor strains is 40%. ...
Article
Full-text available
Seventh pandemic Vibrio choleare O1 El Tor strain is responsible for the on-going pandemic outbreak of cholera globally. This strain evolved from non-pathogenic V. cholerae by acquiring seventh pandemic gene (VC 2346), pandemic Islands (VSP1 and VSP2), pathogenicity islands (VP1 and VP2) and CTX prophage region. The cholera toxin production is mainly attributed to the presence of ctx gene in these strains. However, several variants of this strain emerged as hybrid strains or atypical strains. The present study aimed to assess the aquatic environment of Cochin, India, over a period of 5 years for the emergence of multidrug resistant V. cholerae and its similarity with seventh pandemic strain. The continuous surveillance and monitoring resulted in the isolation of ctx negative, O1 positive V. cholerae isolate (VC6) from coastal water, Cochin, Kerala. The isolate possessed the biotype specific O1 El Tor tcpA gene and lacked other biotype specific ctx, zot, ace and rst genes. Whole genome analysis revealed the isolate belongs to pandemic sequence type (ST) 69 with the possession of pandemic VC2346 gene, pathogenic island VPI1, VPI2, and pandemic island VSP1 and VSP2. The isolate possessed several insertion sequences and the SXT/R391 family related Integrative Conjugative Elements (ICEs). In addition to this, the isolate genome carried virulence genes such as VgrG, mshA, ompT, toxR, ompU, rtxA, als, VasX, makA, and hlyA and antimicrobial resistance genes such as gyrA, dfrA1, strB, parE, sul2, parC, strA, VC1786ICE9-floR, and catB9. Moreover, the phylogenetic analysis suggests that the isolate genome is more closely related to seventh pandemic V.cholerae O1 N16961 strain. This study reports the first incidence of environmental ctx negative seventh pandemic V. choleare O1 El Tor isolate, globally and its presence in the aquatic system likely to induce toxicity in terms of public health point of view. The presence of this isolate in the aquatic environment warns the strict implementation of the epidemiological surveillance on the occurrence of emerging strains and the execution of flagship program for the judicious use of antibiotics in the aquatic ecosystem.
... Sequencing data were assembled using SPAdes 3.13.0 and protein-coding sequences (CDSs) were predicted by Prodigal 2.6.2 (Hyatt et al. 2010). Pan-genome orthologous groups (POGs), which contain at least one CDS, were determined by a combined reciprocal best hit method using uBLAST with an e-value threshold of 1 × 10 −6 (Ward and Moreno-Hagelsieb 2014) and an open reading frame-independent method using nucleotide sequences with cutoff values of at least 70% of gene coverage (Chun et al. 2009). Differentially present POGs were analyzed by separating the genomes into two different groups based on vancomycin susceptibility and the POGs that were uniquely present in one of two groups were extracted. ...
Article
Full-text available
This study aimed to analyze the genetic characteristics of Staphylococcus aureus with reduced vancomycin susceptibility (RVS-SA). Whole-genome sequencing was performed on 27 RVS-SA clinical isolates, and comparative genomic analysis was performed using S. aureus reference strains. Pan-genome orthologous groups (POGs) were identified that were present in RVS-SA but absent in the reference strains, but further analysis showed that the presence of these POGs was influenced by tetracycline resistance rather than vancomycin resistance. Therefore, we restricted our analysis to tetracycline-resistant (tetR) RVS-SA and tetR vancomycin-susceptible S. aureus (VSSA). Phylogenomic analysis showed them to be closely related, and further analysis revealed the presence of an uncharacterized protein SAB0394 and the absence of lytA in tetR RVS-SA, which are involved in cell wall thickening. In summary, using whole-genome sequencing we identified gain or loss of genes in tetR RVS-SA strains. These findings provide insights into the investigation of mechanisms associated with reduced vancomycin susceptibility and have the potential to contribute to the development of molecular biomarkers for the rapid and efficient detection of RVS-SA.
... However irrespective of the concentration of PB applied for susceptibility testing, there had been reports of some sensitive El tor V. cholerae strains creating a potential indeterminate situation for the biotyping scheme. Some further studies have revealed the emergence of PB sensitive El tor V. cholerae strains in addition to diverse biotyping dynamics and dual and/or atypical phenotype [3,4,[6][7][8][9][10][11][12][13][14][15]. Suffice it to say that the Lipopeptides antibiotics (polymyxin B, daptomycin, surotomycin, and colistin) have been routinely used in the management of enteric potential pathogens following the Clinical Laboratory Standard Institute (CLSI) guidelines [16]. ...
Article
Resistance/sensitivity to polymyxin-B (PB) antibiotic has been employed as one among other epidemiologically relevant biotyping-scheme for Vibrio cholerae into Classical/El Tor biotypes. However, recent studies have revealed some pitfalls bordering on PB-sensitivity/resistance (PBR/S) necessitating study. Current study assesses the PBR/S cosmopolitan prevalence, epidemiology/distribution among O1/O139 and nonO1/nonO139 V. cholerae strains. Relevant databases (Web of Science, Scopus and PubMed) were searched to retrieve data from environmental and clinical samples employing the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). Random-effect-model (REM) and common-effect-model (CEM) of meta-analysis was performed to determine prevalence of PBR/S V. cholerae strains, describe the cosmopolitan epidemiological potentials and biotype relevance. Heterogeneity was determined by meta-regression and subgroup analyses. The pooled analyzed isolates from articles (7290), with sensitive and resistance are 2219 (30.44%) and 5028 (69.56%). Among these PB-sensitive strains, more than 1944 (26.67%) were O1 strains, 132 (1.81%) were nonO1 strains while mis-reported Classical biotype were 2080 (28.53) respectively indicating potential spread of variant/dual biotype. A significant PB-resistance was observed in the models (CEM = 0.66, 95% CI [0.65; 0.68], p-value = 0.001; REM = 0.83 [0.74; 0.90], p = 0.001) as both models had a high level of heterogeneity (I² = 98.0%; df=332=1755.09,Qp=2.4932). Egger test (z = 5.4017, p < 0.0001) reveal publication bias by funnel plot asymmetry. The subgroup analysis for continents (Asia, Africa) and sources (acute diarrhea) revealed (98% CI (0.73; 0.93); 55% CI (0.20; 0.86)), and 92% CI (0.67; 0.98). The Epidemiological prevalence for El tor/variant/dual biotype showed 88% CI (0.78; 0.94) with O1 strains at 88% CI (0.78; 0.94). Such global prevalence, distribution/spread of phenotypes/genotypes necessitates updating the decades-long biotype classification scheme. An antibiotic stewardship in the post antibiotic era is suggestive/recommended. Also, there is need for holistic monitoring/evaluation of clinical/epidemiological relevance of the disseminating strains in endemic localities.
Article
Vibrio species pose significant threats worldwide, causing mortalities in aquaculture and infections in humans. Global warming and the emergence of worldwide strains of Vibrio diseases are increasing day by day. Control of Vibrio species requires effective monitoring, diagnosis, and treatment strategies at the global scale. Despite current efforts based on chemical, biological, and mechanical means, Vibrio control management faces limitations due to complicated implementation processes. This review explores the intricacies and challenges of Vibrio-related diseases, including accurate and cost-effective diagnosis and effective control. The global burden due to emerging Vibrio species further complicates management strategies. We propose an innovative integrated technology model that harnesses cutting-edge technologies to address these obstacles. The proposed model incorporates advanced tools, such as biosensing technologies, the Internet of Things (IoT), remote sensing devices, cloud computing, and machine learning. This model offers invaluable insights and supports better decision-making by integrating real-time ecological data and biological phenotype signatures. A major advantage of our approach lies in leveraging cloud-based analytics programs, efficiently extracting meaningful information from vast and complex datasets. Collaborating with data and clinical professionals ensures logical and customized solutions tailored to each unique situation. Aquaculture biotechnology that prioritizes sustainability may have a large impact on human health and the seafood industry. Our review underscores the importance of adopting this model, revolutionizing the prognosis and management of Vibrio-related infections, even under complex circumstances. Furthermore, this model has promising implications for aquaculture and public health, addressing the United Nations Sustainable Development Goals and their development agenda.
Article
Full-text available
Horizontal gene transfer, facilitated by mobile genetic elements (MGEs), is an adaptive evolutionary process that contributes to the evolution of bacterial populations and infectious diseases. A variety of MGEs not only can integrate into the bacterial genome but also can survive or even replicate like plasmids in the cytoplasm, thus requiring precise and complete removal for studying their strategies in benefiting host cells. Existing methods for MGE removal, such as homologous recombination-based deletion and excisionase-based methods, have limitations in effectively eliminating certain MGEs. To overcome these limitations, we developed the Cas9-NE method, which combines the CRISPR/Cas9 system with the natural excision of MGEs. In this approach, a specialized single guide RNA (sgRNA) element is designed with a 20-nucleotide region that pairs with the MGE sequence. This sgRNA is expressed from a plasmid that also carries the Cas9 gene. By utilizing the Cas9-NE method, both the integrative and circular forms of MGEs can be precisely and completely eliminated through Cas9 cleavage, generating MGE-removed cells. We have successfully applied the Cas9-NE method to remove four representative MGEs, including plasmids, prophages, and genomic islands, from Vibrio strains. This new approach not only enables various investigations on MGEs but also has significant implications for the rapid generation of strains for commercial purposes. IMPORTANCE Mobile genetic elements (MGEs) are of utmost importance for bacterial adaptation and pathogenicity, existing in various forms and multiple copies within bacterial cells. Integrated MGEs play dual roles in bacterial hosts, enhancing the fitness of the host by delivering cargo genes and potentially modifying the bacterial genome through the integration/excision process. This process can lead to alterations in promoters or coding sequences or even gene disruptions at integration sites, influencing the physiological functions of host bacteria. Here, we developed a new approach called Cas9-NE, allowing them to maintain the natural sequence changes associated with MGE excision. Cas9-NE allows the one-step removal of integrated and circular MGEs, addressing the challenge of eliminating various MGE forms efficiently. This approach simplifies MGE elimination in bacteria, expediting research on MGEs.
Article
Full-text available
Cholera is an important, recurrent source of morbidity and mortality in many developing countries. Illness is caused by infection with toxigenic Vibrio cholerae O1 or O139 bacteria, most often acquired through ingestion of fecally contaminated water or food. Symptoms include nausea, vomiting, and profuse watery diarrhea. Severe disease causes rapid dehydration, is marked by loss of skin turgor and sunken eyes, and can result in death within hours if untreated. The first aim of this study was to review the prevalence of cholera infection in different governorates of Iraq during the period from 1/1 to 9/12 / 2015. Secondly, to clarify the governorates with the highest cholera incidence, and try to explain the factors behind this incidence if found. In this prospective cohort study, this was comprised of 2866 subjects out of 3547 examined cases. They were sent from different parts of Iraq, who were diagnosed with cholera infection. These cholera patients were collected during the period from January to December 2015. Bacteriology, serology and all other lab investigations were worked out in the central health laboratory in Baghdad. There were high cholera casualties' proportions in a number of Iraq governorates as Baghdad Al-Rusfa (n=627, 21.9 %%), Baghdad Al-Karkh (n=357, 12.5%), Al-Hilla (n=657, 23.6%), and Al-Diwanyia (n=445, 15.5%) compared to other parts of Iraq in the same period, and the disease seems to localize in middle and to lesser degree in southern parts of Iraq. We recommend repeating the study, in a larger frame, using more sophisticated tools, especially molecular diagnostics which have proven their value as extremely sensitive and specific techniques, that can improve the diagnosis of cholera and also help in putting a more accurate epidemiological characters of this disease.
Article
Full-text available
Isolates of Salmonella enterica serovar Typhi (Typhi), a human-restricted bacterial pathogen that causes typhoid, show limited genetic variation. We generated whole-genome sequences for 19 Typhi isolates using 454 (Roche) and Solexa (Illumina) technologies. Isolates, including the previously sequenced CT18 and Ty2 isolates, were selected to represent major nodes in the phylogenetic tree. Comparative analysis showed little evidence of purifying selection, antigenic variation or recombination between isolates. Rather, evolution in the Typhi population seems to be characterized by ongoing loss of gene function, consistent with a small effective population size. The lack of evidence for antigenic variation driven by immune selection is in contrast to strong adaptive selection for mutations conferring antibiotic resistance in Typhi. The observed patterns of genetic isolation and drift are consistent with the proposed key role of asymptomatic carriers of Typhi as the main reservoir of this pathogen, highlighting the need for identification and treatment of carriers.
Article
Full-text available
This communication reports the development of an efficient in vitro transposition system for Tn5. A key component of this system was the use of hyperactive mutant transposase. The inactivity of wild type transposase is likely to be related to the low frequency of in vivotransposition. The in vitro experiments demonstrate the following: the only required macromolecules for most of the steps in Tn5 transposition are the transposase, the specific 19-bp Tn5 end sequences, and target DNA; transposase may not be able to self-dissociate from product DNAs; Tn5 transposes by a conservative “cut and paste” mechanism; and Tn5release from the donor backbone involves precise cleavage of both 3′ and 5′ strands at the ends of the specific end sequences.
Article
Full-text available
Episodes of cholera stemming from indigenous Vibrio cholerae strains in Australia are mainly associated with environmental sources. In the present study, 10 V. cholerae O1 strains of Australian origin were characterized. All of the strains were serogroup O1 and their conventional phenotypic traits categorized them as belonging to the El Tor biotype. Genetic screening of 12 genomic regions that are associated with virulence in V. cholerae showed variable results. Analysis of the ctxAB gene showed that the Australian environmental reservoir contains both toxigenic and non-toxigenic V. cholerae strains. DNA sequencing revealed that all of the toxigenic V. cholerae strains examined were of ctxB genotype 2. Whole genome PFGE analysis revealed that the environmental toxigenic V. cholerae O1 strains were more diverse than the non-toxigenic environmental O1 strains, and the absence of genes that make up the Vibrio seventh pandemic island-I and -II in all of the strains indicates their pre-seventh pandemic ancestry.
Article
Full-text available
The causative agent of cholera, Vibrio cholerae, has been shown to be autochthonous to riverine, estuarine, and coastal waters along with its host, the copepod, a significant member of the zooplankton community. Temperature, salinity, rainfall and plankton have proven to be important factors in the ecology of V. cholerae, influencing the transmission of the disease in those regions of the world where the human population relies on untreated water as a source of drinking water. In this study, the pattern of cholera outbreaks during 1998-2006 in Kolkata, India, and Matlab, Bangladesh, and the earth observation data were analyzed with the objective of developing a prediction model for cholera. Satellite sensors were used to measure chlorophyll a concentration (CHL) and sea surface temperature (SST). In addition, rainfall data were obtained from both satellite and in situ gauge measurements. From the analyses, a statistically significant relationship between the time series for cholera in Kolkata, India, and CHL and rainfall anomalies was determined. A statistically significant one month lag was observed between CHL anomaly and number of cholera cases in Matlab, Bangladesh. From the results of the study, it is concluded that ocean and climate patterns are useful predictors of cholera epidemics, with the dynamics of endemic cholera being related to climate and/or changes in the aquatic ecosystem. When the ecology of V. cholerae is considered in predictive models, a robust early warning system for cholera in endemic regions of the world can be developed for public health planning and decision making.
Article
Full-text available
The El Tor biotype of Vibrio cholerae O1, causing the current seventh pandemic of cholera, has replaced the classical biotype, which caused the sixth pandemic. The CTX prophages encoding cholera toxin in the two biotypes have distinct repressor (rstR) genes. Recently, new variants of El Tor strains that carry the classical type (CTXclass) prophage have emerged. These “hybrid” strains apparently originate through lateral gene transfer and recombination events. To explore possible donors of the CTXclass prophage and its mode of transfer, we tested environmental V. cholerae isolates for the presence of CTXclass prophage and mobility of the phage genome. Of the 272 environmental V. cholerae isolates tested, 6 were found to carry the CTXclass prophage; all of these belonged to the O141 serogroup. These O141 strains were unable to produce infectious CTXclass phage or to transmit the prophage to recipient strains in the mouse model of infection; however, the CTXclass prophage was acquired by El Tor strains when cultured with the O141 strains in microcosms composed of filtered environmental water, a chitin substrate, and a V. cholerae O141-specific bacteriophage. The CTXclass prophage either coexisted with or replaced the resident CTXET prophage, resulting in El Tor strains with CTX genotypes similar to those of the naturally occurring hybrid strains. Our results support a model involving phages and natural chitin substrate in the emergence of new variants of pathogenic V. cholerae. Furthermore, the O141 strains apparently represent an alternative reservoir of the CTXclass phage genome, because the classical V. cholerae O1 strains are possibly extinct. • hybrid Vibrio cholerae strain • toxigenic Vibrio cholerae
Article
Full-text available
Whole-genome sequencing has been skewed toward bacterial pathogens as a consequence of the prioritization of medical and veterinary diseases. However, it is becoming clear that in order to accurately measure genetic variation within and between pathogenic groups, multiple isolates, as well as commensal species, must be sequenced. This study examined the pangenomic content of Escherichia coli. Six distinct E. coli pathovars can be distinguished using molecular or phenotypic markers, but only two of the six pathovars have been subjected to any genome sequencing previously. Thus, this report provides a seminal description of the genomic contents and unique features of three unsequenced pathovars, enterotoxigenic E. coli, enteropathogenic E. coli, and enteroaggregative E. coli. We also determined the first genome sequence of a human commensal E. coli isolate, E. coli HS, which will undoubtedly provide a new baseline from which workers can examine the evolution of pathogenic E. coli. Comparison of 17 E. coli genomes, 8 of which are new, resulted in identification of ∼2,200 genes conserved in all isolates. We were also able to identify genes that were isolate and pathovar specific. Fewer pathovar-specific genes were identified than anticipated, suggesting that each isolate may have independently developed virulence capabilities. Pangenome calculations indicate that E. coli genomic diversity represents an open pangenome model containing a reservoir of more than 13,000 genes, many of which may be uncharacterized but important virulence factors. This comparative study of the species E. coli, while descriptive, should provide the basis for future functional work on this important group of pathogens.
Article
A new method called the neighbor-joining method is proposed for reconstructing phylogenetic trees from evolutionary distance data. The principle of this method is to find pairs of operational taxonomic units (OTUs [= neighbors]) that minimize the total branch length at each stage of clustering of OTUs starting with a starlike tree. The branch lengths as well as the topology of a parsimonious tree can quickly be obtained by using this method. Using computer simulation, we studied the efficiency of this method in obtaining the correct unrooted tree in comparison with that of five other tree-making methods: the unweighted pair group method of analysis, Farris's method, Sattath and Tversky's method, Li's method, and Tateno et al.'s modified Farris method. The new, neighbor-joining method and Sattath and Tversky's method are shown to be generally better than the other methods.
Article
Several studies have shown that the emergence of the O139 serogroup of Vibrio cholerae is a result of horizontal gene transfer of a fragment of DNA from a serogroup other than O1 into the region responsible for O-antigen biosynthesis of the seventh pandemic V. cholerae O1 biotype El Tor strain. In this study, we show that the gene cluster responsible for O-antigen biosynthesis of the O139 serogroup of V. cholerae is closely related to those of O22. When DNA fragments derived from O139 O-antigen biosynthesis gene region were used as probes, the entire O139 O-antigen biosynthesis gene region could be divided into five classes, designated as I–V based on the reactivity pattern of the probes against reference strains of V. cholerae representing serogroups O1–O193. Class IV was specific to O139 serogroup, while classes I–III and class V were homologous to varying extents to some of the non-O1, non-O139 serogroups. Interestingly, the regions other than class IV were also conserved in the O22 serogroup. Long and accurate PCR was employed to determine if a simple deletion or substitution was involved to account for the difference in class IV between O139 and O22. A product of approx. 15 kb was amplified when O139 DNA was used as the template, while a product of approx. 12.5 kb was amplified when O22 DNA was used as the template, indicating that substitution but not deletion could account for the difference in the region between O22 and O139 serogroups. In order to precisely compare between the genes responsible for O-antigen biosynthesis of O139 and O22, the region responsible for O-antigen biosynthesis of O22 serogroup was cloned and analyzed. In concurrence with the results of the hybridization test, all regions were well conserved in O22 and O139 serogroups, although wbfA and the five or six genes comprising class IV in O22 and O139 serogroups, respectively, were exceptions. Again the genes in class IV in O22 were confirmed to be specific to O22 among the 155 ‘O’ serogroups of V. cholerae. These data suggest that the gene clusters responsible for O139 O-antigen biosynthesis are most similar to those of O22 and genes within class IV of O139, and O22 defines the unique O antigen of O139 or O22.