ArticlePDF Available

The Bacilus subtilis genome from gerBC (311°) to licR (334°)

Authors:

Abstract and Figures

As part of the international project to sequence the Bacillus subtilis genome, the DNA region located between gerBC (311°) and licR (334°) was assigned to the Institut Pasteur. In this paper, the cloning and sequencing of 176 kb of DNA and the analysis of the sequence of the entire 271 kb region (6.5% of the B. subtilis chromosome) is described; 273 putative coding sequences were identified. Although the complete genome sequences of seven other organisms (five bacteria, one archaeon and the yeast Saccharomyces cerevisiae) are available in public databases, 65 genes from this region of the B. subtilis chromosome encode proteins without significant similarities to other known protein sequences. Among the 208 other genes, 115 have paralogues in the currently known B. subtilis DNA sequences and the products of 178 genes were found to display similarities to protein sequences from public databases for which a function is known. Classification of these genes shows a high proportion of them to be involved in the adaptation to various growth conditions (non-essential cell wall constituents, catabolic and bioenergetic pathways); a small number of the genes are essential or encode anabolic enzymes.
Content may be subject to copyright.
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
Microbiology
(1997), 143,3313-3328
Printed in Great Britain
The
Bacillus
subtilis
genome from
gerBC
(31
1
")
to
lid?
(334")
E.
Presecan,'
I.
Moszer,'
L.
Boursier,l
H.
Cruz Ramos,'
V.
de
la
Fuente,2
M.-F. Hullo,'n2
C.
Lelong,'
5.
Schleich,'
A.
Sekowska,'
B.
H.
Song,'t
G.
Villani,'
F.
Kunst,*
A.
Danchin' and
P.
Glaser'
Author for correspondence: Philippe Glaser. Tel:
+33
1
45
68
84
41.
Fax:
+33
1
45
68
89
48.
e-mail
:
pglaser@pasteur.fr
~~
1~
Unite de Regulation de
"Expression
Genetiquel,
and Unite de Biochirnie
Microbienne2.
lnstitut
As
part
of
the international project
to
sequence the
Bacillus
subtilis
genome,
the DNA region located between
gerBC
(31
1
")
and
licR
(334")
was assigned
to
the lnstitut Pasteur.
In
this paper, the cloning and sequencing
of
176
kb
Of
DNA
Pasteur,
2s'
rue
Docteur Roux, 75724
Cedex 15, France
du
Paris
and the analysis
of
the sequence
of
the entire
271
kb
region
(65%
of
the
B.
subtilis
chromosome) is described;
273
putative coding sequences were
identified. Although the complete genome sequences
of
seven other
organisms (five bacteria, one archaeon and the yeast
Saccharomyces
cerevisiae)
are available
in
public databases,
65
genes
from
this region
of
the
B.
subtilis
chromosome encode proteins
without
significant similarities
to
other
known
protein sequences. Among the
208
other genes,
115
have
paralogues
in
the currently
known
B.
subtilis
DNA sequences and the products
of
178
genes were
found
to
display similarities
to
protein sequences
from
public databases
for
which a
function
is
known.
Classification
of
these genes
shows a
high proportion
of
them
to
be involved
in
the adaptation
to
various
growth
conditions (non-essential cell wall constituents, catabolic and
bioenergetic pathways); a small
number
of
the genes are essential
or
encode
anabolic enzymes.
Keywords
:
genome sequencing,
Bacillus subtilis,
intergenic repeated element,
orthologues/paralogues, spore coat
INTRODUCTION
The systematic sequencing of the chromosome of the
Gram-positive bacterium
Bacillus subtilis
is being con-
ducted by an international consortium of European and
Japanese laboratories (Kunst
et al.,
1995;
Ogasawara
&
Yoshikawa,
1996).
By April
1997,95
%
of the sequence
was determined, and the analysis of several long regions
has been published (e.g. Ogasawara
et al.,
1994;
Albertini
et al.,
1995;
Mizuno
et al.,
1996;
Soldo
et al.,
1996;
Sorokin
et al.,
1996;
Wipat
etal.,
1996;
Yoshida
et
al.,
1996).
The completion of this genome sequence will
be the first complete genetic determination of a sporulat-
ing
Bacillus
sp. with a high level
of
competence and
recombination potential. As for the other completely
t
Present address:
Department
of
Biology, Teachers College, Kyung-
Pook
University, TAEGU,
702-701,
South Korea.
Abbreviation
:
PTS,
phosphotransferase system.
The EMBUGenBanWDDBJ accession numbers for the sequence reported in
this paper are
249782, 249792, 249884, 280355, 282987, 283337, 292952,
292953,292954
and
293767.
known genomes, the Gram-negative bacteria
Haemo-
philus influenzae
(Fleischmann
et al.,
1995)
and
Escheri-
chia coli
(O'Brien,
1997),
the mycoplasmas
Mycoplasma
genitalium
(Fraser
et al.,
1995)
and
Mycoplasma pneu-
moniae
(Himmelreich
et al.,
1996),
the cyanobacterium
Synechocystis
PCC
6803
(Kaneko
et al.,
1996),
the
archaeon
Methanococcus jannaschii
(Bult
et al.,
1996)
and the yeast
Saccharomyces cereuisiae
(Goffeau
et al.,
1996),
about one-third
of
the genes in the
B. subtilis
chromosome are 'unknown' genes (Moszer
et al.,
1996),
i.e. they encode proteins without similarities to proteins
with a known function. To elucidate their role, an
original international research project
of
systematic
functional analysis was set up, using the approach of
reverse genetics for which
B.
subtilis
is particularly
suited (Harwood
&
Wipat,
1996).
Accessibility of the
whole genetic information
of
an organism associated
with the knowledge
of
other genome sequences opens
new possibilities for sequence analysis, for example
classification
of
genes according to their codon usage
(Moszer
et al.,
1995).
It becomes possible
to
study the
distribution
of
nucleotide motifs along the chromosome
0002-1 790
0
1997
SGM
3313
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E. PRESECAN
and
OTHERS
333" 330" 326" 324"
and thereby get insights into regulatory processes
(HCnaut
et
al.,
1996), and to compare chromosomal
gene clusters conserved during evolution in different
organisms. These analyses may reveal functions for
unknown genes and new functions for already known
genes (Danchin, 1996). For this kind
of
analysis ap-
propriate computer tools will be required.
Several lines
of
investigation have been opened or
speeded up as a direct consequence of the recent
availability of
B.
subtilis
sequence data. An interesting
area
of
research in
B.
subtilis
is the sporulation pathway
(for a recent review see Stragier
&
Losick, 1996). The
knowledge
of
the whole genome sequence will con-
tribute to the discovery
of
new genes involved in this
bacterial cell differentiation process. In particular, sev-
eral important sporulation loci were discovered in the
described region (Perego
et
al.,
1994, 1996; Londoiio-
Vallejo
&
Stragier, 1995; Karow
et
al.,
1995).
This paper describes the features of the
B.
subtilis
chromosomal region initially assigned
to
our group
(Kunst
et
al.,
1995). This 271 kb region between
gerBC
(311") and
licR
(334") covers
6.5
YO
of the chromosome.
It includes a region
of
97 kb which has been sequenced
and published (Glaser
et
al.,
1993).
METHODS
Bacterial strains.The same isolate
of
B.
subtilis
strain 168
provided by
C.
Anagnostopoulos is used by all the participant
groups in the European
B.
subtilis
genome sequencing project.
This strain was the source for the chromosomal DNA used to
construct a lambda library (Kunst
&
Devine, 1991) and for all
the other cloning methods. Direct cloning by plasmid rescue
was performed in the
E.
coli
pcnB
strain TP611 (Glaser
et
af.,
1993). Strain P2392 was used for propagation
of
the lambda
library (Silhavy
et al.,
1984).
E.
coli
strains used for DNA
sequencing were XL-1 Blue (Bullock
et al.,
1987) for dsDNA
preparations and TG1 for ssDNA preparations (Gibson,
1984).
Media. Luria-Bertani (LB) medium was used for standard
cultures of
B.
subtilis
and
E.
coli
(Miller, 1972). 2YT medium
was used for M13 recombinant phage preparation. Lambda
stocks were prepared from LB plates
(15
g agarose
1-')
using
LB overlays containing
7
g agarose
1-l.
Antibiotics were added
to the following final concentrations, when necessary: for
E.
coli,
100
mg ampicillin
1-1
;
for
B.
subtilis,
5
mg chlorampheni-
col l-', 10 mg erythromycin
1-l.
Phages and plasmids. Gene libraries of
B.
subtilis
DNA were
constructed in Lambda FIX I1 (Stratagene) (Kunst
&
Devine,
1991). DNA sequences were determined from subclones in the
phage M13mp8 (Messing
&
Vieira, 1982) or in pUC18
(Yanisch-Perron
et
af.,
1985). The plasmid rescue method was
performed after an initial cloning step in pDIA5304 (Glaser
et
al.,
1993) or pDIA5305. pDIAS305 was obtained by insertion
of the pC194
cat
gene (Horinouchi
&
Weisblum, 1982) into
the unique NaeI restriction site
of
pMTL22 (Chambers
et
al.,
1988). The high-copy-number
B.
subtilis
replicon pMTL5OOE
(Swinfield
et
al.,
1990) was used for marker rescue in
B.
subtilis.
DNA
manipulations.
E.
coli
transformation was generally
performed as described by Chung
&
Miller (1988). Shotgun
libraries in M13mp8 or in pUC18 were used
to
transform
E.
coli
XL-1 Blue
as
described by Hanahan (1983). Recombinant
plasmids were transferred to
E.
coli
TP611 by the calcium
chloride transformation method (Sambrook
et
al.,
1989).
B.
subtilis
cells were transformed as described by Kunst
&
Rapoport (1995). Southern blots, and plaque and colony
transfers were performed as described by Sambrook
et
af.
(1989). Membranes were further hybridized with non-radio-
actively labelled probes (Boehringer DIG-dUTP labelling).
-
-
-
(Glaser
et
ul.,
1993)
111111111111111111111111111-111-1111-
5348
5363 5364 5365
narA glyC
spoIID
gerB
spoIZZD
316" 314" 312
140 160 180 200 220 240 260 280
NotI
,
NotI NotI
I
sp1
I
Sfi'
pfi1
I
I
I
5353 5354 5329
hnarA
pPP41 5355 5366 5367 5368 5369 5370 5371 5372
..................
*
...........
......
.....................................................
...
..........
.
............
.
...............
.
.......................
.
.....................
...
................
.
.......
.
.......
.
...................
.
....................................
.
................................
...
Fig.
1.
Cloning
of
the
licR-gerBC
region. The thin upper line represents the genetic map with divisions representing
chromosomal degrees. Relevant genetic markers are indicated. Underneath this line, the physical map is represented by a
thicker line. Restriction site positions are deduced from the sequence and are in good agreement with the physical
mapping
of
ltaya
&
Tanaka
(1991).
The thick dotted line represents the
97
kb region previously sequenced (Glaser
et
a/.,
1993).
The short dotted line represents the single
B.
subtilis
plasmid derivative. Numbers refer
to
the
f.
coli
pDlA plasmids
used
for
sequencing the newly sequenced region.
3314
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
311"-334"
B.
subtilis
chromosomal region
Table
I.
Boundaries of the
17
cloned fragments
Plasmid/phage Beginning' End'
pDIA
5363
97 kb"
pDIA
5364
pDIA 5365
pDIA
5348
pDIA
5353
pDIA 5354
pDIA 5355
pDIA
5329
harA
pDIA 5366
pDIA 5367
pDIA 5368
pDIA 5369
pDIA 5370
pDIA 5371
pDIA
5372
pPP4lb
1
(XbaI)
6 543
101
448
(EcoRV)
107788 (NotI)
126 849 (XbaI)
142069 (KpnI)
151 210 (BglII)
156207 (PstI)
162280 (PstI)
167912
(NotI)
177261 (Sau3A)
194576 (HincII)
204509 (PstI)
214619 (SzuaI)
227 320 (SacII)
240 641 (BglII)
253 248 (KpnI)
263 330 (SacII)
7851 (EcoRV)
103 557
108016 (HindIII)
128 777 (EcoRV)
142330 (SspI)
152 949 (EcoRI)
157 5
18
(BglII)
166327
(SalI)
168 828 (EcoRV)
178 399 (HindIII)
195 141 (Sau3A)
205 685 (SacI)
216491 (KpnI)
228 235 (XbaI)
241 670 (ApaI)
255 853 (SacI)
263 953 (EcoRI)
271 513 (PstI)
*First
base
of
the
restriction site.
References: "Glaser
et
al.
(1993); bTrach
et
al.
(1988).
Chromosome walking experiments involved either a plasmid
rescue method, using integrative plasmids (Niaudet
et
al.,
1982), or a marker rescue method after chromosomal in-
tegration of a selective marker (Cm") (Glaser
et
al., 1993).
Fifteen walking steps (Fig.
1
and Table
1)
were performed as
follows.
A
single walking step was performed from the
ywaA
end of the 97 kb contig. The resulting plasmid (pDIA5363)
contains the whole lic operon (Tobisch
et
al., 1997). From
plasmid pPP41 containing the SPOOF gene (Trach
et
al., 1988),
four successive walking steps were performed in the direction
of the pta gene, leading to plasmids pDIA5353, pDIA5348,
pDIA5365 and pDIA5364. pDIA5364 has a 2.1 kb overlap with
the 97 kb contig. In the other direction a single walking step
was possible (pDIA5354). From InarA, only a single step
could be performed in the direction of the pPP41 insert
(pDIA5329). The region (only 1.6 kb long) between the
pDIA5354 and pDIA5329 inserts could not be cloned into
E.
coli
by
this method. It was therefore cloned in
B.
subtilis using
the pMTL5OOE plasmid (pDIA.5355). From the other end of
IZnarA,
five successive walking steps were performed, giving
plasmids pDIA5366-5370. Finally, two walking steps were
performed from the cloned
gerBC
gene, giving plasmids
pDIA5372 and pDIA5371. Plasmids pDIA5370 and pDIA5371
overlap by 2.6 kb.
Shotgun cloning and
DNA
sequencing.
Fifteen shotgun
sequencing experiments were performed independently. The
corresponding strategy has already been described (Moszer
et
al., 1991
;
Glaser
et
al., 1993).
Data handling and computer analysis.
DNA sequences were
compiled using the
XBAP
program of R. Staden (Dear
&
Staden, 1991). Sequences were analysed using the DNA Strider
1.1
software (Marck, 1988). The
BLAST
program (Altschul
et
al.,
1990) was used to search for similar sequences in the non-
redundant database
(5
April 1997) at the Institut Pasteur
computer centre. The
CLUSTAL
w
program was used for the
multiple alignment of the repeated DNA element (Higgins
et
al., 1992).
B.
subtilis sequences were handled using the
SubtiList
database (Moszer
et
al., 1995).
RESULTS
AND
DISCUSSION
Cloning and sequencing
of
the
gerBC-licR
region
We have previously described the cloning
of
a
97
kb
region from
ywaA
to
ywf0
as four lambda clones, three
E.
coli
plasmids and two
B.
subtilis
plasmids (Glaser
et
al.,
1993).
For the remaining part
of
the assigned region,
one fragment was cloned as a lambda insert isolated
using the cloned
narA
gene as a probe
(AnarA)
(Santana
et al.,
1994).
To
obtain the rest
of
the region, a plasmid
rescue walking strategy was used (Glaser
et al.,
1993).
Successive walking steps were performed, starting from
ends
of
the cloned fragments
of
the region as described
in Methods. The new regions were obtained as
15
E.
coli
plasmids, one lambda clone and one
B.
subtilis
plasmid
(Fig.
1
and Table
1).
The sequences
of
15
fragments were
determined after shotgun subcloning. Only the
1.6
kb
region
of
plasmid pDIA53.55 was sequenced
by
directed
subcloning and primer walking. The extensively studied
SPOOF
region (Trach
et al.,
1988)
was only partially
resequenced. The chromosomal region from
licR
to
gerBC
is
271518
bp long and overlaps the regions
sequenced in Dr
Y.
Fujita's laboratory in Hiroshima
(Japan) (Yoshida
et al.,
1996)
and in Dr
D.
Karamata's
laboratory in Lausanne (Switzerland)
(Soldo
et al.,
1996).
Identification of coding sequences and general
analysis
The putative coding sequences in the six reading frames
were identified according
to
previously described rules
(Glaser
et al.,
1993),
and
273
coding sequences were
predicted. They were
39
codons
(phrF)
to
1228
codons
(narG)
long with a mean length
of
875
bp (Fig.
2).
Compared
to
other systematic sequencing projects, a
relatively large number
of
small genes was predicted
:
19
genes were shorter than
100
codons.
Of
these,
10
encode
proteins with known function
(dltC, rpmE, atpE,
spoZZZD,
sbo)
or
similar
to
known
proteins
(ywbE,
ywhB, phrF, ywmG, ywql),
and for nine
(ywcE, ywdA,
ywhR, ywjc,
ywkF,
ywmE, ywq0, ywsA, ywtC)
no
counterpart could be identified in the databases.
A
majority
of
the genes are transcribed in the direction
of
the replication fork movement
(189
of
273, 69%),
but
their distribution is not uniform. Indeed, three regions
with a majority
of
genes in the opposite direction could
be defined (from
ywaA
to
ywbH,
from
ywhB
to
sbo,
and
from
ywqM
to
rsbR;
see Fig.
2).
Because
of
the limited
size
of
the region, the significance
of
this observation
could not be assessed and analysis awaits the complete
genome sequence.
Operon organization and non-coding regions
The
273
coding sequences were grouped into
147
putative transcription units by analysis
of
the intergenic
regions, mainly by the identification
of
the putative
3315
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E. PRESECAN
and
OTHERS
a
WE4
Fig.
2.
Coding sequence organization in the
B.
subtilis
licR-gerBC
chromosomal region.
A
number was assigned to each
gene corresponding to
its
rank in the sequence. Genes are represented by arrows with different fill-in motifs, to
distinguish them according to their classification (see Table
2):
.,
cell
envelope and cellular processes; M,
intermediary meta bolism;
,
information pathways;
m,
other functions;
m,
miscellaneous;
m,
unknown genes similar to
other genes of unknown function;
0,
unknown genes without similarities. The direction of the replication complex
is
from left to right. Putative transcription terminators are indicated by
9.
3316
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
311'-334'
B.
subtilis
chromosomal region
transcription terminators and by taking into account
their length. Eighty-two of these transcription units
were predicted to consist of a single gene. Fifty-seven
may be involved in translational coupling as assessed
from the relative positions of the ribosome-binding site
and the upstream gene stop codon. The longest operon
is the
sps
operon, which is comprised of
11
genes. In the
region, several operons of known function, including
the
atp
(Santana
et al.,
1994),
the
narGHJZ
(Cruz Ramos
et al.,
1995)
and the
qox
(Santana
et al.,
1992)
operons,
have a conserved organization in a broad range
of
bacteria. However, in the case of the urease operon, only
the structural genes
ureABC
were found, and genes
encoding nickel accessory proteins, required for nickel
incorporation into the enzyme, are missing (Cruz Ramos
et al.,
1997).
In the thermophilic
Bacillus
sp. strain TB-
90,
these genes
(ureEFGDH)
map downstream from the
ureC
gene in the
ure
operon (Maeda
et al.,
1994).
In
some cases, such as
ywbHG
in
B.
subtilis
and
yohJK
in
E.
coli,
the transcriptional organization of unknown
genes is also conserved. Three genes with unknown
function,
ywqH, ywqI
and
ywqJ,
are similar to three
other
B.
subtilis
genes,
yxiB,
yxiC
and
yxiD,
respectively,
and in both cases the three genes seem to be organized in
an operon.
As for other long segments of the
B.
subtilis
chromo-
some, the overall coding capacity of the region is high
(88
%
)
. Only
10
intergenic regions are longer than
500
bp. In seven cases they are located between divergent
coding sequences:
dltA
and
ywaB
(613
bp),
ywcB
and
ywcC
(957
bp),
upr
and
ywcZ
(530
bp),
ywfN
and
ywf0
(601
bp),
ywmA
and
ywmB
(529
bp),
ywqB
and
ywqC
(581
bp), rbsR and
ywsC
(784
bp), and in three cases
between genes in the same orientation
:
galT
and
qoxA
(835
bp),
mmr
and
thrZ
(595
bp),
ywmG
and
ureA
(1000
bp). Most of these regions may be regulatory.
Alternatively, they may contain short genes devoid of
typical ribosome-binding site sequences, or may be
transcribed as non-translated
RNA.
Systematic comparison with known nucleotide se-
quences using the intergenic regions as query sequences
revealed that two intergenic regions
(ywmE-narQ
and
ywbN-ywbO)
contained a repeated element. This se-
quence was found six times in the publicly available
B.
subtilis
sequences (Fig.
3).
This element, named Bs-rep
was described in
1994
as being repeated three times in
the sequences available at that time (Popham
&
Setlow,
1994).
The six repeats are located in less than one-third
of the chromosome (from position
3230
kb
to position
513
kb) and are all in the same orientation with respect
to the replication fork movement. Comparisons with
sequence databases show that this element is extremely
well conserved in two sequences from the phylogen-
etically closely related bacterium
:
Bacillus licheni-
formis
(Fig.
3).
Furthermore, two classes could be
distinguished on the basis
of
sequence similarities
:
ydbT-ydcA
and
ywmE-narQ
in one group and the
remaining six sequences including the two
B.
Zicheni-
formis
sequences in the other. The nature (IS element
or scar of a mobile element) and the possible role of this
repeated sequence are unknown.
Gene function analysis
At the beginning
of
the sequencing project, only eight
genes were known in this region of the
B.
subtilis
chromosome. Since then, independent of our sequencing
project,
25
other genes have been characterized. Most
are involved in two aspects
of
bacterial life
:
carbon and
nitrogen source utilization and sporulation.
sacXY
(Zukowski
et al.,
1990)
and
sacTPA
(Fouet
et al.,
1986,
1987)
encode proteins involved in saccharose metab-
olism. The
rbs
operon is involved in ribose utilization
(Woodson
&
Devine,
1994).
Two exocellular proteases
are encoded by the
epr
(Bruckner
et al.,
1990)
and
upr
(Sloma
et al.,
1991)
genes. The
als
operon
(alsRSD;
Renna
et
al.,
1993)
encodes proteins required for the
synthesis
of
acetoin as a by-product, and the
nrgAB
operon (Wray
et al.,
1994)
encodes an ammonium
permease (NrgA). Seven sporulation genes have been
analysed:
SPOOF
(Yoshikawa
et
al.,
1986),
spoZZQ
Fig.
3.
Alignment of Bs-rep repeats. Gene
names indicate the genes bracketing each of
the repeated elements.
BI,
B.
licheniformis.
8.
subtilis sequences were extracted from
the SubtiList database (Moszer et
a/.,
1995)
and those of
B.
lichenifomis
from EMBL
(accession nos
280222
and
D31955).
Strictly
conserved nucleotides found in at least four
sequences are represented as black boxes.
3317
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E.
PRESECAN
and
OTHERS
(a) Cots
(381
aa) CotG
(195
aa)
100 200 300
1
1
0
-1
-2 -2
-2
-2
-3
-3 -3
-3
-4
-4-4
-4
100 200 300 100
100
(b)
A A A A
B
B
B B
100 200
300
100
(c)
253
SSKSYSSSKSSKR SLKSSDDQSSKSGR
SSRSKSSSKSSKR SLKSSDYQSSKSGR
SSRSKSSSKSSKR SLKSSDYQSSK
SSKR SPRSSDYQSSR345
36
HKKSHRTHKKSRS HKKSYCSHKKSRS
HKKSFCSHKKSRS HKKSYCSHKKSRS
HKKSYRSHKKSRS YKKSYRSYKKSRS
YKKSCRSYKKSRS YKKSYCSHKKKSRS
YKKSCRTHKKSYRS
54
Consensus sequence: Consensus sequence:
KK
LK
D
SSRSySSSKSSKR
S
pRSSDyQSSKSGR
Fig.
4.
Sequence comparisons between the
cotB
and the
cotG
genes. (a) Kyte-Doolittle plots
of
the amino acid
hydropathy (Kyte
&
Doolittle,
1982).
(b) Distribution
of
the acidic (A) and basic
(B)
amino acids. (c) Alignments
of
the
13
or
14
amino acid repeated sequences
of
each protein and the corresponding consensus sequences.
(Londoiio-Vallejo
et
al.,
1997),
spollD (Lopez-Diaz
et
al.,
1986),
spolIlD (Kunkel
et
al.,
1989),
cotG
(Sacco
et
al.,
1995),
cotH
(Naclerio
et
al.,
1996)
and
cotB
(Donovan
et
al.,
1987).
Only the first
72
codons
of
cot&
which encodes a
protein located in the outer coat of the spore, were
known. The analysis
of
its complete sequence (Fig.
4)
revealed that this protein may have two domains. The
150
residues C-terminal domain is hydrophilic and has a
low complexity organization (two types of repeated unit
of
13
and
14
amino acids, respectively), similar
to
that
of
CotG (Sacco
et
al.
,
1995
;
Fig.
4).
Although the sequence
identity is low, both types
of
repeated unit are serine-
rich and highly charged. The similar organization may
reflect a conserved structural domain in both coat
proteins. It has been reported that the CotB protein is
absent from a
cotG
mutant strain (Sacco
et
al.,
1995).
It
is therefore possible that both proteins are part
of
a
complex or molecular assembly involving these repeated
structures.
Eight other genes with various activities from the
ZicR-gerBC
region have been described by other groups.
They determine or encode methylenomycin A resistance
(mmr;
Putzer
et
al.,
1992),
one
of
the two fructose-
bisphosphate aldolases (fbaA
;
Trach
et
al.,
1988
;
Mitchell
et
al.,
1992),
the minor threonine-tRNA syn-
thetase
(thrZ;
Putzer
et
al.,
1990),
the non-essential
RNA polymerase
6
subunit (rpoE; Lampe
et
a!.,
1988),
the transcription termination factor Rho
(rho
;
Quirk
et
al.,
1993),
the CTP synthase (ctrA; Trach
et
al.,
1988),
the thymidine kinase
(tdk;
Trach
et
al.,
1988)
and a
morphogene gene
(mbl;
Abhayawardhane
&
Stewart,
1995).
Fifty-four genes discovered during the systematic se-
quencing of this part
of
the chromosome were analysed
either in our laboratory, or in collaboration with other
groups. Several loci are involved in energy conversion
:
the atp operon (encoding the nine subunits
of
the F,F,
ATPase
;
Santana
et
al.,
1994),
the
qox
operon (encoding
the four subunits
of
the major aa,-type terminal oxidase
;
Santana et al.,
1992),
the narGHJI, narK and narA genes
(involved in the utilization
of
nitrate as an alternative
terminal electron acceptor; Cruz Ramos
et
al.,
1995;
Glaser
et
al.,
1995),
and
fnr
(a regulatory gene for
anaerobic respiration; Cruz Ramos
et
al.,
1995).
Five
loci are involved in carbon and nitrogen source utiliza-
tion: lichenan degradation (the
lic
operon; Tobisch
et
al.,
1997),
fatty acid utilization (acyl-CoA dehydro-
genase, acdA; V. de la Fuente and others, unpublished),
acetate production as a by-product (phosphotrans-
acetylase, pta
;
E.
Presecan and others, unpublished)
,
urea utilization (the
ure
operon; Cruz Ramos
et
al.,
1997),
and arginine, ornithine and proline degradation
(the
roc
operon
;
Calogero
et
al.,
1994).
The
dlt
and the
sps
operons are involved in the synthesis
of
cell wall
components
:
D-alanylation
of
cell wall lipoteichoic and
3318
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
311"-334"
23.
subtilis
chromosomal region
Table
2.
Classification of the genes of the
B.
subtilis
licR-gerBC
region
Gene Product Total no.
BLAST
Genes encoding similar protein/similar gene Function/activity
of
the gene product
or
Gene
name size paralogues score products* similar protein
n0.t
(a4
1.
Cell envelope and cellular processes
1.1.
Cell wall components
dltE
252 9 24e-21
7.5e
-
18
dltD
392
0
1.6e
-
42
dltC
78
0
1.6e
-
3
1
dltB
39s
0
3.Se
-
159
dltA
503 10 2.3e
-
153
ywhK
45
1
1
4*4e
-
20
1.3e
-
137
ywhL
458
1
6.9e
-
20
7.4e- 138
murA
436
1
54e
-
139
mbl
313 2 94e- 182
YWqC
248
1
1.0e
-
56
YW9D
237
1
20e
-
57
YW9E
254
0
26e
-
62
ywsc
247
0
2.4e
-
184
ywtA
149
0
45e
-
84
ywtB
380
0
1.6e
-
124
ywtD
390
5
29e
-
21
9.8e -20
1.2.
Transpodbinding proteins and lipoproteins
licB
102
1
28e
-
37
licC
452 2 7.8e
-
2.53
60e
-
221
licA
110
1
49e-38
ywbF
399
0
2Se
-
108
ywcA
516 2 75e
-
174
YWCI
256
1
4Se -42
sacP
460
4
1.8e-
171
rocC
470 9 3.9e -215
YWfA
412 13 1.2e
-
26
ywhE
647 3 2Se
-
143
YWhQ
239 23 68e
-
33
narK
39s
1
7.4e
-
43
1.2e
-
66
ywiA
575 26 3-7e
-
136
ywkB
319
0
3.9e
-
37
ywoA
193
0
1.4e
-
25
nrgA
404
0
l.le
-
128
ywoD
452 7 24e
-
28
1.7e -36
ywoE
490
1
1.4e
-
79
93e
-
135
ywoG
396 9 27e
-
40
ywrA
178
1
6Se
-
16
ywrB
197
1
70e
-
16
ywrK
442
1
28e-79
rbsB
305
0
21e
-
96
rbsC
322
0
2.8e
-
117
rbsA
493 9 1.0e
-
149
rbsD
131
0
52e
-
39
ywtG
457 S.3e
-
143
ydfG, Escherichia coli
(sw
:
P39831)
trn2, Hyoscyamus niger
(sw
:
P50164)
dltD, Staphylococcus aureus
(embl
:
D86240)
dltC, Staphylococcus aureus
(embl
:
D86240)
dltB, Staphylococcus aureus
(embl
:
D86240)
dltA, Lactobacillus casei
(sw
:
P35854)
orf492, Methanosarcina mazeii
(embl
:
X84710)
ywhL, Bacillus subtilis
(embl
:
280360)
orf375, Methanosarcina mazeii
(embl
:
X84710)
ywhK, Bacillus subtilis
(embl
:
280360)
murZ, Synechocystis
sp. PCC 6803 (embl
:
D6400)
mbl, Bacillus cereus
(sw
:
P32444)
capA, Staphylococcus aureus
(sw
:
P39850)
capB, Staphylococcus aureus (sw
:
P39851)
capC, Staphylococcus
aureus
(sw
:
P39852)
capB, Bacillus anthracis
(sw
:
P19580)
capC, Bacillus anthracis
(sw
:
P19581)
capA, Bacillus anthracis
(sw
:
P19579)
spy, Escherichia coli
(embl
:
D86610)
nlpC, Haemophihs influenzae
(sw
:
P45296)
cdA, Bacillus stearothermophilus
(pir
:
B49898)
ywbA, Bacillus subtilis
(sw
:
P39584)
celB, Bacillus stearothermophilus
(pir
:
C49898)
celD, Bacillus stearothermophilus
(pir
:
E49898)
malA, Bacillus stearothermophilus
(pir
:
S43915)
dhlc, Xanthobacter autotrophicus
(pir
:
S52839)
nirC, Escherichia coli
(sw
:
P11097)
pts3, Klebsiella pneumoniae
(sw
:
P27219)
rocE, Bacillus subtilis
(sw
:
P39137)
PID
:
g1742740,
Escherichia coli
(embl
:
D90810)
mrcA, Synechocystis
sp. PCC 6803 (embl
:
D90913)
ecsA, Bacillus subtilis
(sw
:
P55339)
nasA, Bacillus subtilis
(sw
:
P42432)
MTCY04C12.22c,
Mycobacterium tuberculosis
ygaD, Bacillus subtilis
(embl: 282044)
mdcE, Klebsiella pneumoniae
(embl
:
U56096)
bcrC, Bacillus licheniformis
(sw
:
P42334)
amt, Mycobacterium tuberculosis
(sw
:
410968)
qacA, Staphylococcus aureus
(sw
:
P23215)
MTCY3G12.01,
Mycobacterium tuberculosis
fur4, Schizosaccharomyces pombe
(embl
:
X98696)
PID
:
gl773191,
Escherichia coli
(embl: U82664)
norA, Synechocystis
sp. PCC 6803 (embl
:
D90909)
chrA, Pseudomonas aeruginosa (sw
:
P14285)
(embl
:
281360)
(embl
:
279702)
chrA, Pseudomonas aeruginosa
(sw
:
P14285)
arsB, Staphylococcus xylosus
(sw
:
QOl2.55)
rbsB, Escherichia coli
(sw
:
P0292.5)
rbsC, Haemophilus influenzae (sw
:
P44736)
xylC, Escherichia coli (sw
:
P37388)
rbsD, Haemophilus
influenme
(sw
:
P44734)
yxbc, Bacillus subtilis
(sw
:
P46333)
D-Alanine esterification of lipoteichoic acid
D-Alanine esterification of lipoteichoic acid
D-Alanine esterification
of
lipoteichoic acid
D-Alanine esterification of lipoteichoic acid
D-Alanine esterification of lipoteichoic acid
Surface antigen
(carrier)
Surface antigen
UDP-N-acetylglucosamine 1-carboxyvinyl
Rod-shape-determining protein
Required for the biosynthesis of type
1
capsular
Required for the biosynthesis of type
1
capsular
Required for the biosynthesis of type
1
capsular
Synthesis of the polyglutamate capsule
Synthesis of the polyglutamate capsule
Synthesis of the polyglutamate capsule
Lipoprotein precursor
transferase
polysaccharide
polysaccharide
polysaccharide
B-Glucoside permease IIB component (PTS)
B-Glucoside permease IIC component (PTS)
8-Glucoside permease IIA component (PTS)
Maltose permease
Na+-dependent symport
Nitrite transporter
Sucrose permease IIBC component (PTS)
Arginine utilization
;
permease
Arabinose polymer transport
Penicillin-binding protein
ABC-type transporter
Nitrite extrusion protein
ABC-type transporter
Unknown transporter
Bacitracin transport permease protein
Ammonium transporter
Drug-resistance protein
Uracil permease
Quinolone-resistance protein
Essential
for
chromate resistance chromate
Essential for chromate resistance chromate
Arsenical pump membrane protein
Ribose transport operon
;
D-ribose-binding protein
Ribose transport operon
;
D-ribose-binding protein
Ribose transport operon
;
ATP-binding protein
Ribose transport operon
;
D-ribose-binding protein
Metabolite transporter
transport
transport
7
8
9
10
11
114
115
181
215
230
23
1
232
266
267
268
270
2
3
4
27
37
54
55
84
85
108
120
126
135
154
204
206
209
210
212
243
244
2.53
260
261
262
263
273
3319
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E. PRESECAN
and
OTHERS
Table
2
(cont.)
Gene Product Total no.
BLAST
Genes encoding similar protein/similar gene Function/activity
of
the gene product
or
Gene
name size paralogues score products" similar protein n0.t
(4
1.4.
Membrane bioenergetics
;
electron-transport chain
;
ATP
synthase
qoxA
qoxB
qoxc
qoxD
ywcc
narc
narH
nar]
322
649
204
124
249
1228
487
184
narl 223 0
atpl 126
0
atpB 244 0
atpE 70 0
atpF 170
0
atpH 181
0
atpA 502
0
atpG 287 0
atpD 473 0
atpC 132 0
narQ 262
0
ywrO 175 2
1.5.
Mobility and chemotaxis
flh0 270 3
flhP 268 2
ywcF 393 2
1.8.
Sporulation
spsA 2.56
1
spsB
472
0
spsc
389 2
spsD 289
0
spsE 373 0
spsF
239
0
spsc 222
0
spsH 117
0
spsl 246 3
1.7.
Cell division
SPSl
315 2
spsK 432
0
phrF
rapF
spoOF
spoIlR
spollD
rapB
spollQ
spolllD
rapD
cotc
cotH
cotB
39
381
124
224
343
377
283
93
354
195
362
380
1
8
8
0
1
8
0
0
8
0
0
0
2.2e
-
40
75e
-
247
91e-70
3.0e
-
09
1.4e
-
67
00
2.9e
-
229
3.3e
-
15
4.3e
-
11
1.8e
-
31
1.3e
-
62
27e
-
38
1.2e
-
128
20e
-
30
3.4e
-
67
1.5e
-
59
1.3e
-
300
1.4e
-
143
l5e
-
288
3.7e
-
60
1.2e-36
6.3e
-
71
7.8e
-
20
8.4e
-
45
4.5e
-
39
8-1e-69
3.4e
-
155
3.le
-
82
1.2e
-
46
7.9e
-
14
3.5e -05
48e
-
65
75e-115
25e
-
66
2.3e
-
08
61e--118
6.1e-62
5.3e
-
180
1.4e
-
137
1-3e
-
37
qox-2, Acetobacter aceti (sw
:
P50653)
cyoB, Escherichia coli (sw
:
P18401)
cox-3, Thermophilic bacterium PS-3 (sw
:
403439)
cyoD, Escberichia coli (sw
:
P18403)
frp,
Vibrio harueyi (embl
:
U08996)
narc, Escherichia coli (sw
:
P09152)
narH, Escherichia coli (sw
:
P11349)
narJ, Thiosphaera pantotropha (pir
:
S61308)
narJ, Escherichia coli (sw
:
P11351)
narV, Escherichia coli (sw
:
P19316)
MTCY04C12.21c, Mycobacterium tuberculosis
atpl, Bacillus megaterium (sw
:
P20598)
atpB, Bacillus megaterium (sw
:
P20600)
atpE,
Bacillus megaterium (sw
:
P20603)
atpF, Bacillus megaterium (sw
:
P20601)
atpH, Bacillus stearothermophilus (sw
:
P42008)
atpA, Bacillus megaterium (sw
:
P17674)
atpC, Bacillus megaterium (sw
:
P20602)
atpD, Bacillus caldotenax (sw
:
P41009)
atpC, Bacillus megaterium (sw
:
P12699)
fdhD, Wolinella succinogenes (sw
:
P28181)
yrkL, Bacillus subtilis (sw
:
P54439)
(embl: 281360)
flgG, Caulobacter crescentus (sw
:
406172)
flgG, Bacillus subtilis (sw
:
P23446)
rodA, Haemophilus influenzae (sw
:
P44468)
cgeD, Bacillus subtilis (sw
:
P42092)
MJ1066, Methanococcus jannaschii (pir
:
A64433)
MJ1065, Methanococcus jannaschii (pir
:
H64432)
MJ1063, Methanococcus jannaschii (pir
:
F64432)
MJ1062, Methanococcus jannaschii (pir
:
E64432)
flaR,
Caulobacter crescentus (embl
:
U27302)
rhsA, Sphingomonas
S88
(embl: U51197)
No significant similarities
No significant similarities
rfbB,
Leptospira interrogans (embl
:
U61226)
rfbD2, Synechocystis sp. PCC 6803
phr, Bacillus
subtilis
(embl: D50453)
rapB, Bacillus subtilis (embl
:
281356)
SPOOF,
Bacillus thuringiensis (sw
:
P52942)
No significant similarities
SPZD, Bacillus amyloliquefaciens (sw
:
P132.51)
rapA, Bacillus subtilis
(sw
:
400828)
No significant similarities
No significant similarities
rapC, Bacillus subtilis (embl
:
D50453)
No significant similarities
No significant similarities
No significant similarities
(embl
:
D90899)
2.
Intermediary metabolism
2.1.
Metabolism
of
carbohydrates, sugar and amino sugars
licH 442 2 1.4e
-
147 celF, Escherichia cola (sw
:
Pi741
1)
@PA 286
0
3.0e-20 lgtA, Rhizobium leguminosarum (embl
:
X94963)
ywbA
444
2 8.9e-2.54 licB, Bacillus subtilis (sw :P46317)
ywbC 126
1
28e
-
10 lguL, Lycopersicon esculentum (tomato)
(sw
:
442891)
Quinol oxidase polypeptide
I1
precursor
Quinol oxidase polypeptide
I
Quinol oxidase polypeptide
111
Quinol oxidase polypeptide
IV
NADPH
:
flavin oxidoreductase
Nitrate reductase
a
subunit
Nitrate reductase
/3
subunit
Nitrate reductase protein J
Nitrate reductase subunit
y
ATP synthase
I
subunit
ATP synthase a subunit
ATP synthase c subunit
ATP synthase b subunit
ATP synthase
S
subunit
ATP synthase
a
subunit
ATP synthase
y
subunit
ATP synthase
B
subunit
ATP synthase
E
subunit
Unknown
Hypothetical NAD(P)H oxidoreductase
Unknown
;
similar to flagellar hook-basal body
Unknown
;
similar to flagellar hook-basal body
Rod-shape-determining protein
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide
Spore-coat polysaccharide (glucose-1-phosphate
Spore-coat polysaccharide (UDP-N-
Spore-coat polysaccharide (dTDP-C-deoxy-~-
Phosphatase regulator peptide
Response regulator aspartate phosphatase
Sporulation
Sporulation
Sporulation
Response regulator aspartate phosphatase
Sporulation
Sporulation
Response regulator aspartate phosphatase
Spore-coat protein
G
Spore-coat protein
H
Spore-coat protein
B
thy midyly ltransferase)
acetylglucosamine epimerase)
mannose-deh ydrogenase)
6-Phospho-B-glucosidase
Galactosyltransferase
Cellobiose permease IIC component (PTS)
Lactoylglutathione lyase
43
44
45
46
49
130
131
132
133
170
171
172
173
174
175
176
177
178
186
257
216
217
48
69
70
71
72
73
74
75
76
77
78
79
112
113
145
161
182
188
202
214
218
249
250
25
1
5
18
22
24
3320
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
31 lo-334'
B.
subtilis
chromosomal
region
Table
2
(cont.)
Gene Product Total no.
BLAST
Genes encoding similar protein/similar gene Function/activity
of
the gene product
or
Gene
name sue paralogues score products* similar protein
n0.t
(4
390
513
480
255
259
323
62
705
285
186
429
321
582
149
189
440
571
255
293
0
0
2
19
0
0
0
0
1
0
1
0
1
0
0
0
3
0
0
6.0e
-
122
20e
-
122
65e
-
177
1.6e
-
123
3.7e
-
69
73e
-
36
1.le-28
2.7e
-
97
3.6e
-
12
5.le
-
30
2.7e
-
111
998e
-
72
3.9e
-
128
73e-112
5.8e
-
183
2.0e
-
192
74e-44
1.0e
-
12
93e -52
49e
-
137
6.0e
-
203
2.0e
-
66
3.7e
-
69
gal-1, Lactobacillus helveticus
(sw
:
Q00052)
gal-7, Butyrivibrio fibrisolvens
(sw
:
P45981)
surA, Bacillus stearothermophilus
(embl
:
U34872)
scrB, Salmonella typhimurium
(sw
:
P37075)
dhg-2, Bacillus megaterium
(sw
:
P39483)
MTCY09F9.36,
Mycobacterium tuberculosis
pbbB,
Alcaligenes eutropbus
(sw
:
P14697)
pta, Clostridium acetobutylicum
(embl
:
U38234)
dmpl, Pseudomonas putida
(pir
:
S24422)
MJ0092,
Methanococcus jannaschii
(pir
:
D64311)
f.aB,
Bacillus subtilis
(sw
:
P42420)
M 50960,
Methanococcus jannaschii
murZ, Synechocystis
sp. PCC 6803 (embl
:
D6400)
(embl
:
284498)
(embl: H64419)
Galactokinase
Galactose-1-phosphate uridyltransferase
Sucrose-6-phosphate hydrolase
41
42
56
Glucose dehydrogenase
88
Short-chain alcohol dehydrogenase family 92
Acetoacetyl-CoA reductase
Phosphate acetyltransferase 94
4-Oxalocrotonate tautomerase 105
Fumarate reductase 140
Fructose-l,6-diphosphate
aldolase 146
Transaldolase 147
signature
UDP-N-acetylglucosamine
1-
Glycerol metabolism
Malate oxidoreductase
carboxyvinyltransferase
148
149
153
glpX,
Escherichia coli
(sw
:
P28860)
maoX, Saccharomyces cerevisiae
(sw
:
P36013)
MTCY3G 12.02c,
Mycobacterium tuberculosis
rpiB, Escherichia coli
(sw
:
P37351)
dhbB, Bacillus subtilis
(sw
:
P45747)
yecD, Escherichia coli
(sw
:
P37347)
slr1299,
Synechocystis
sp. PCC 6803
als,
Lactococcus lactis
(embl
:
L16975)
aldC, Enterobacter
aerogenes
(sw
:
P05361)
rbsK, Haemophilus
influenme
(sw
:
P44331)
(embl
:
279702)
(embl
:
D90916)
Ribose 5-phosphate isomerase B
Isochorismatase
166
208
UDPglucose dehydrogenase 233
Acetolactate synthase (pH 6)
Acetolactate decarboxylase
Ribose transport operon
;
ribokinase
255
256
264
2.2.
Metabolism
of
amino acids and related molecules
ywaA
ywaD
epr
qr
yweB
rocA
rocB
YWfG
ywbF
ywhG
glYA
ureA
ureB
ureC
nrgB
ywrD
363
455
645
806
425
515
566
399
276
290
415
105
124
569
116
525
0
0
3
6
1
8
1
4
0
2
0
0
0
0
0
1
1.5e-111
53e- 15
1.7e
-
91
26e
-
57
25e
-
148
7-3e-231
1.9e
-
166
5.2e
-
259
6.1e-66
3.9e
-
97
3.0e
-
96
2.4e
-
78
1.5e
-
220
8.0e
-
46
24e-33
8.0e
-
272
3.8e -28
1.2e
-
87
7.5e
-
114
ilvE, Mycobacterium tuberculosis
(sw
:
P10399)
ape3, Saccharomyces cerevisiae
(sw
:
P37302)
aprM, Bacillus
sp. strain B18' (pir
:
A48373)
subT,
Bacillus amyloliquefaciens
(sw
:
P00782)
dhe-2, Clostridium dificile
(sw
:
P27346)
ypcA, Bacillus subtilis
(sw
:
PSO73S)
putA, Synechocystis
sp. PCC 6803 (embl
:
D90913)
ycgN, Bacillus subtilis
(embl
:
D50453)
yqjN,
Bacillus subtilis
(sw
:
P54551)
aspC, Synechocystis
sp. PCC 6803 (embl
:
D90913)
MJ0313,
Methanococcus jannaschii
(pir
:
B64339)
MJ0309,
Methanococcus jannaschii
(pir
:
F64338)
glyA, Hyphomicrobium methylovorum
ureA, Bacillus pasteurii
(sw
:
P41022)
ureB,
Bacillus pasteurii
(sw
:
P41021)
ureC,
Bacillus pasteurii
(sw
:
P41020)
glnB, Klebsiella pneumoniae
(sw
:
P11671)
ggt,
Escherichia coli
(sw
:
P18956)
MTCY369.18,
M ycobacterium tuberculosis
(pir
:
S30382)
(embl
:
280226)
Branched-chain amino acid aminotransferase 6
Proteinase precursor 14
Minor extracellular serine protease 21
Minor extracellular serine protease 51
Glutamate dehydrogenase (NAD) 81
1-Pyrroline-5-carboxylate
dehydrogenase 82
Unknown
Aspartate transaminase
Spermidine synthase
Agmatinase
Serine
hydroxymethyltransferase
83
91
109
110
168
Urease
y
subunit
Urease subunit
Xrease
CI
subunit
Nitrogen-regulated PI1 protein
y-Glutamyltranspeptidase
precursor
191
192
193
205
246
2.3.
Metabolism
of
nucleotides and nucleic acids
ctrA
535
0
41e-243
tdk
195
0
1.3e-54
UPP
209
0
45e
-
125
2.4.
Metabolism
of
lipids
ywdH
457
8
1.4e
-
144
MJ1174,
Methanococcus jannaschii
(pir
:
E64446)
kitH, Mycoplasma genitalium
(sw
:
P47280)
upp, Bacillus caldolyticus
(embl
:
X99545)
CTP synthase
Thymidine kinase
Uracil
phosphoribosyltransferase
143
152
169
sIr0091,
Synechocystis
sp.
PCC 6803
cis, Escherichia coli
(sw
:
P31071)
ywnE, Bacillus subtilis
(embl
:
Y08559)
ywnE, Bacillus subtilis
(embl
:
YOSSSS)
cls, Escherichia coli
(sw
:
P31071)
acdS, Clostridium acetobutylicum
(sw
:
P52042)
cls, Escherichia coli
(sw
:
P31071)
ywiE, Bacillus subtilis
(sw
:
P45860)
(embl
:
D64004)
Aldehyde dehydrogenase 64
ywiE
500
2
48e
-
120
WE
398 2 26e
-
55
25e
-
78
8.3e
-
45
acdA
379 3 2.3e
-
157
ywnE
482 2 1.2e-61
2.9e
-
120
Cardiolipin synthase
Cardiolipin synthase
134
139
Acyl-CoA dehydrogenase
Cardiolipin synthetase
141
198
3321
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E.
PRESECAN
and
OTHERS
Table
2
(cont.)
Gene Product Total no.
BLAST
Genes encoding similar proteinhimilar gene Function/activity
of
the gene product
or
Gene
name size paralogues score products* similar protein n0.t
(a4
YWPB
132 0 1.3e -43 fabZ, Neisseria meningitidis (embl
:
U79481)
2.5.
Metabolism of coenzymes and prosthetic groups
ywaB 311
0
l.le -21 menA, Haemophilus influenzae
(sw
:
P44739)
YWbl 272
0
20e
-
27
41e-74
ywbK 222
1
1.2e-31
l.le -53
ywiB 174
1
5.Oe
-
09
1.2e
-
34
ywkE 288
0
1.2e
-
26
narA
341
0
46e
-
67
thi6,
Saccharomyces cerevisiae
(sw
:
P41835)
HI0417, Haemophilus influenzae (pir
:
A64151)
thi4, Schizosaccharomyces pombe
(sw
:
P40386)
HI0415, Haemophilus
influenzae
(pir
:
A64152)
dyrA, Thermotoga maritima (pir
:
JC4116)
yyaP, Bacillus subtilis
(sw
:
P37508)
hemK, Escherichia coli (pir
:
183570)
moaA, Arthrobacter nicotinovorans (pir
:
139637)
3. Information pathways
3.1.
DNA replication
YWPH
113
1
3-8e-45 ssb, Bacillus subtilis (sw
:
P37455)
3.2.
DNA
restriction/modification
and repair
ung 225
0
2.4e -84 ung, Escherichia coli
(sw
:
P12295)
YwiD 320
0
8.4e
-
35
uvl
,
Neurospora crassa (pir
:
S55262)
3.5.
RNA synthesis
;
initiation
;
regulation
;
elongation
;
termination
IicR
ywaE
sacY
sacX
ywbl
ywcc
sacT
YWfK
ywhA
ywhH
fnr
rpoE
rho
ywnD
ywoH
YWPl
YwqA
YwqM
ywrc
alsR
rbsR
ywtF
641
171
280
459
301
223
276
299
139
157
238
173
427
257
137
258
922
293
158
302
326
322
2
13
2
4
15
0
2
15
11
1
0
0
0
8
9
3
1
15
4
15
11
2
23e
-
48
1-le-05
9.2e
-
107
1-0e
-
98
4.2e
-
172
1.2e-47
3.3e
-
07
6.5e
-
120
2.2e
-
97
62e
-
44
1.8e
-
09
3.9e
-
07
22e
-
16
1.k
-
170
1.7e
-
60
65e
-
07
1.2e-42
1-6e
-
66
1.4e
-
156
34e-38
8-3e
-
23
1-3e
-
56
3-le-58
61e-50
mtlR, Bacillus stearothermophilus (embl
:
U18943)
slyA, Salmonella typhimurium
(sw
:
P40676)
surT,
Bacillus stearothermophilus (embl
:
U34873)
sacT, Bacillus subtilis
(sw
:
P26212)
sacP, Bacillus subtilis
(sw
:
P05306)
cynR, Escherichia coli
(sw
:
P27111)
arpA, Streptomyces griseus (embl: D49782)
SUTT,
Bacillus stearothermophilus (embl
:
U34873)
sacY, Bacillus subtilis
(sw
:
P15401)
rbcR, Synechocystis sp. PCC 6803 (embl
:
D64004)
ysmB, Bacillus subtilis (embl: 275208)
orfl,
Pseudomonas aeruginosa (embl
:
U49666)
ntcA, Synechococcus sp. PCC 7942
(sw
:
P29283)
No significant similarities
rho,
Escherichia coli
(sw
:
P03002)
tipA, Streptornyces lividans
(sw
:
P32184)
slyA, Escherichia coli
(sw
:
P55740)
agaR, Escherichia coli
(sw
:
P42902)
motl, Saccharomyces cerevisiae
(sw
:
P32333)
~111366, Synechocystis sp. PCC 6803
gltR, Bacillus subtilis (embl: U79494)
MJO151, Methanococcus jannaschii (pir
:
H64318)
xapR, Escherichia coli
(sw
:
P23841)
scrR, Pediococcus pentosaceus
(sw
:
P43472)
lytR, Bacillus subtilis
(sw
:
402115)
(embl
:
D90916)
(3R)-Hydroxymyristoyl-(acyl
carrier protein)
dehydratase saturated fatty acid biosynthesis
Menaquinone biosynthesis 1,4-dihydroxy-2-
Thiamin biosynthetic bifunctional enzyme
naphthoate octaprenyltransferase
Thiamin-phosphate pyrophosphorylase
Dihydrofolate reductase
Protoporphyrinogen oxidase
Nitrate assimilation and anaerobic respiration
ssDNA-binding protein
Uracil-DNA glycosylase
UV-endonuclease
Transcriptional regulator
of
the lic operon
Transcriptional regulator MarR family
Transcriptional regulator BglG family
Transcriptional regulator of sacY
Transcriptional activator LysR family
Transcriptional regulator (A factor receptor)
Transcriptional regulator BglG family
Transcriptional regulator LysR family
Transcriptional regulator MarR family
Transcriptional regulator
Anaerobic regulatory protein
RNA polymerase
6
subunit
Transcription termination factor
Transcriptional activator MerR family
Transcriptional regulator MarR family
Transcriptional regulator DeoR family
Probable helicase of the Snf2/Rad54 family
Transcriptional regulator LysR family
Transcriptional regulator AsnC family
Transcriptional regulator LysR family
Transcription regulator Lac1 family
Attenuator for itself and lytABC operon
3.7.
Protein synthesis
;
ribosomal proteins
;
aminoacyl-tRNA synthetases
;
initiation
;
elongation
;
termination
tyrz 413
1
3.9e
-
135 tyrS, Haemophilus influenzae
(sw
:
P43836) Minor tyrosyl-tRNA synthetase
thrZ
638
1
1.8e-241 thrS, Bacillus subtilis (sw
:
P18255) Minor threonyl-tRNA synthetase
argS 556
0
6.7e
-
161 argS, Brevibacterium lactofermentum (sw
:
P41253) Arginyl-tRNA synthetase
rpmE 66 1 54e-26 rpmE, Mycobacterium leprae
(sw
:
P45834) Ribosomal protein L31
PrfA 356
1
3.7e
-
130 prf, Mycoplasma capricolum (embl
:
50100) Peptide-chain-releasing factor
4.
Other functions
4.1.
Adaptation to atypical conditions
YWPC 130
0
1.4e-48 mscL, Escherichia coli
(sw
:
P23867) Large conductance mechanosensitivity channel
4.3.
Antibiotic production
sbo 43
0
No significant similarities Antibiotic peptide, subtilosin A
4.4.
Drug/analogue sensitivity
mmr
466 3 28e- 151 mmr, Streptomyces coelicolor
(sw
:
P11545) Methylenomycin A resistance protein
219
12
31
32
136
158
187
225
63
138
1
16
19
20
30
39
53
95
104
111
127
142
150
197
213
226
228
240
245
254
265
272
15
103
125
151
157
220
123
102
3322
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
311'-334'
B.
subtilis
chromosomal region
Table
2
(cont.)
~~ ~ ~
Gene Product Total no.
BLAST
Genes encoding similar proteinhimilar gene Function/activity
of
the gene product
or
Gene
name size paralogues score products,
similar
protein n0.t
(a4
ywnH
163
0
5.
Miscellaneous
ywcH
333 3
ywdL
181
0
ywlE
150
0
Genes without known function
ywaC
ywbD
ywbG
ywbH
ywbL
ywbM
ywbN
ywcD
ywdB
ywdF
ywdK
yweA
YW~J
YWFi
YWfO
YwgB
ywhC
ywhN
ywiA
ywiC
YWkD
ywlC
ywlD
ywmC
ywmD
ywmH
ywnG
ywoF
YWPD
YWPJ
ywnc
YWH
YWl
YW~J
YWqL
YWN
ywtE
210
396
241
128
481
385
416
127
271
268
460
113
154
296
433
156
219
426
448
239
128
346
185
227
224
62
127
172
468
278
285
140
86
602
238
181
286
1
0
3
3
0
0
0
0
1
0
0
0
0
2
0
0
0
0
0
0
2
0
0
1
1
0
1
1
0
0
5
1
1
3
0
0
7
l.5e
-
15
94e
-
56
1*4e
-
09
63e-61
1.2e
-
05
1.2e
-
12
9.7e
-
15
1.8e-35
77e-51
24e
-
16
46e
-
09
3.9e
-
67
5Oe
-
40
65e
-
06
1.0e
-
70
1.0e
-
14
7.5e
-
41
3.le
-
19
79e
-
17
1.4e
-
14
1.7e
-
39
2.0e
-
43
24e
-
09
7.5e
-
12
8.8e
-
14
23e-47
3.3e
-
53
1.4e
-
88
6.8e
-
22
1.6e
-
64
1.4e
-
64
41e-07
23e
-
30
43e
-
29
53e
-
06
7.Se
-
29
9.8e
-
15
3.7e
-
34
3.6e
-
30
3.3e
-
26
21e-22
00019
94e
-
10
6.2e
-
143
5.8e
-
09
8.4e
-
19
61e-10
22e
-
66
7*3e
-
31
97e
-
33
2.6e
-
27
9.8e
-
22
bar, Streptomyces coelicolor
(pir
:
JH0246)
~111647,
Synechocystis
sp. PCC 6803
(embl
:
D90903)
lxa-I, Photobacterium leiognati
(sw
:
F99140)
yhbW, Escherichia coli
(sw
:
P45529)
bncl
,
Brassica
napus
(sw
:
P33523)
wafE, Escherichia coli
(embl: U38473)
Gene in
mleR
region
of
Lactococcus factis
PID
:
g1787201,
Escherichia coli
(embl
:
AE000198)
yohK, Escherichia coli
(sw
:
P33373)
yoh
J,
Escherichia coli
(sw
:
P33372)
slr0964,
Synechocystis
sp. PCC 6803
PID
:
g17872.54,
Escherichia coli
(embl
:
AE000203)
PID
:
g1787255,
Escherichia coli
(embl
:
AE000203)
rfbl,
Escherichia coli
(embl
:
D90865)
Y416,
Haemophilus
influenzae
(sw
:
P44697)
HI1696,
Haemophilus influenzae
(pir
:
D64175)
PID
:
g1786723
Escherichia coli
(embl
:
AE000157)
ygdD, Haemophilus influenzae
(sw
:
P45019)
orfl,
upstream
gbsAB
operon
Bacillus subtilis
yicL, Escherichia coli
(sw
:
P31437)
MG461,
Mycoplasma genitahm
(sw
:
P39651)
orfl56, clpA
region
Lactococcus factis
MJ0611,
Methanococcus jannaschii
(pir
:
C64376)
cpn-60, Synechocystis
sp. PCC 6803
MTCY210.10
Mycobacterium tuberculosis
HI1626,
Haemophilus influenzae
(sw
:
P44278)
yaeR, Escherichia coli
(sw
:
P52096)
sua5, Saccharomyces cerevisiae
(sw
:
P32.579)
ywlD, Escherichia coli
(embl
:
D90826)
ywmD, Bacillus subtilis
(embl
:
281356)
ywmC, Bacillus subtilis
(embl
:
281356)
yjbJ, Escherichia coli
(sw
:
P32691)
ywnG, Bacillus subtilis
(embl: Y08559)
ywnC, Bacillus subtilis
(embl
:
Y08559)
MJ1396,
Methanococcus jannaschii
(pir
:
C64474)
yehU, Escherichia coli
(embl: D90868)
lyts, Staphylococcus
aureus
(embl
:
L42945)
ycsE, Bacillus subtilis
(sw
:
P42962)
yxeH, Bacillus subtilis
(sw
:
P54947)
yidA, Escherichia coli
(sw
:
P09997)
ybhA, Escherichia coli
(sw
:
P21829)
yxiB, Bacillus subtilis
(sw
:
P42294)
yxiC, Bacillus subtilis
(sw
:
P42295)
yqcG, Bacillus subtilis
(sw
:
P45942)
yxiD, Bacillus subtilis
(sw
:
P42296)
yjaF, Escherichia coli
(sw
:
P32679)
MJ1083,
Methanococcus jannaschii
(pir
:
B64435)
ywpJ, Bacillus subtilis
(embl
:
283337)
ycsE, Bacillus subtilis
(sw
:
P42962)
yxeH, Bacillus subtilis
(sw
:
P54947)
yidA, Escherichia coli
(sw
:
P09997)
ybhA, Escherichia coli
(sw
:
P21829)
(embl
:
M26949)
(embl
:
U47861)
(em bl
:
L36907)
(embl
:
D64001)
(embl
:
284395)
Phosphinotricine N-acetyltransferase
Alkanal monooxygenase a-chain, bacterial
luciferase
Seed storage protein
Acid phosphatase
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
Unknown
201
50
68
165
13
25
28
29
33
34
35
40
58
62
66
67
80
97
99
101
106
117
122
128
156
163
164
183
184
190
196
200
211
221
227
235
236
237
2.39
241
271
*Accession numbers are for SWISS-PROT
(sw),
Protein Information Resource (pir) or
EMBL
(embl).
tNumbers refer to Fig.
2.
3323
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E. PRESECAN
and
OTHERS
teichoic acids
(dlt
operon; Perego
et
al.,
1995)
and
synthesis of a spore-coat polysaccharide
(sps
operon
;
M. F. Hullo and others, unpublished). Two sporulation
loci have also been characterized:
spollR
(Karow
et
al.,
1995
;
Londoiio-Vallejo
&
Stragier,
1995)
and
rapB
(Perego
et
al.,
1996).
Finally, the uracil phosphoribosyl-
transferase involved in the pyrimidine salvage pathway
(upp;
Martinussen
et
al.,
1995)
and a minor alternative
tyrosine-tRNA synthetase
(tyrZ
;
Glaser
et
a[.,
1991)
of
this
B.
subtilis
chromosomal region have been studied.
In the latest version
of
the
B.
subtilis
genetic map, there
are
96
markers located between
gerBC
and
licR (celR)
(Biaudet
et
al.,
1996).
The relative positions of the
markers determined by genetic methods are in good
agreement with the position of the corresponding genes
on the chromosome. Only eight genetic markers could
not be unambiguously associated to a gene.
furC
and
furE
(resistance to 5-fluorouracil) may both correspond
to
upp.
Alternatively, one of these loci and
axpB
(resistance to azopyrimidines) may lie in
ywoE,
which
encodes a protein similar
to
uracil permeases. For the
other five loci,
flaC,
outF,
divll, bac
and
gneA,
no
obvious gene could be allotted.
The predicted gene products of the
licR-gerBC
region
were compared with all known protein sequences
available in public sequence libraries
(5
April
1997).
The
273
genes of the region were thereby classified into five
groups: (i) genes with known function (non-y gene)
encoding proteins similar to known proteins
(94
genes)
;
(ii) orphan genes (for which no similar sequence has
been found) with known function
(9
genes); (iii) genes
with unknown function but encoding proteins similar to
proteins with known function
(75
genes)
;
(iv) unknown
genes encoding proteins similar
to
proteins with un-
known function
(39
genes) and
(v)
orphan genes with
unknown function
(56
genes). Although this classifica-
tion is subjective, the proportion of genes in classes (iv)
and
(v)
is similar to that found for other eubacterial
genomes
(35
‘7’0)
using the
BLAST
program. Although the
complete genomes of five bacteria, one archaeon and the
yeast
S.
cerevisiae
are known, no phylogenetically
related genes could be detected
by
this method for more
than
20%
of the genes in this region of the
B.
subtilis
chromosome. In
10
cases
(spsA,
rocB,
phrF, ywmC,
ywmD, ywnC, ywnG, ywqH, ywql, ywq]),
counterparts
were detected only in
B.
subtilis
sequences (paralogue
orphan genes). These genes, together with the class (ii)
and
(v)
genes, may constitute a kind of signature for
B.
subtilis
and related species. It is however not possible
to
know whether the absence of detectable similarities
reflects a recent origin
of
these genes or whether the
relatedness to other proteins has been obscured by
evolution.
The
217
genes in classes (i), (ii), (iii) and (iv) were further
classified according to Moszer
et
al.
(1996)
and are
presented in Table
2
(see also the arrow fill-in motifs in
Fig.
2).
No member was found for
10
categories
[numbers in parentheses represent classes and subclasses
according to Moszer
et
al.
(1996)l:
sensors
(1.3),
Table
3.
Orthologues
of
the
B.
subtilis
genes
from
the
/id?-gerBC
reg ion
Genome Total number Number
of
of
genes orthologues*
Escherichia coli
4285 158
Haemophilus influenzae
1743 92
Synechocystis
PCC
6803 3168 94
Methanococcus jannaschii
1682 59
Saccharomyces cerevisiae
6200 54
My coplasma genitalium
470 27
*
Number
of
genes in the
licR-gerBC
region having an orthologue
in each genome.
secretion
(1.6),
germination
(1.9),
competence
(1. lo),
metabolism
of
phosphate
(2.6)
and of sulphur
(2.7),
DNA recombination
(3.3),
RNA modification
(3.6),
protein modification
(3.8)
and folding
(3.9).
The dis-
tribution of genes into the remaining classes is consistent
with what has been described for the whole genome
(Moszer
et
al.,
1996).
However, there are several striking
features. About half of the genes in the
licR-gerBC
region belong to the ‘cell envelope and cellular pro-
cesses
class
of
genes, with a large proportion in the
cell
wall components’ subclass (the
sps
operon in the
sporulation subclass is also involved in the synthesis of
wall components). The region contains very few genes
encoding anabolic enzymes (in the
intermediary metab-
olism’ class)
:
glyA, narA
and the
thiC
operon. The
information pathways
class contains mainly regu-
latory genes and very few essential genes. Three amino-
acyl-tRNA synthetase genes are found in this region, but
only
argS
is essential (A. Sekowska and others, un-
published),
thrZ
and
tyrZ
encode minor synthetases of
unknown physiological function (Glaser
et
al.,
1991
;
Putzer
et
al.,
1990).
Other than
argS,
only
rpmE
and
rho,
encoding the ribosomal protein
L31
and a transcription
terminator factor, respectively, are important genes of
this class.
Orthologues and paralogues in the
gerBC-licR
region
We have searched for
B.
subtilis
sequences similar
to
sequences present in six known genomes (orthologues).
The number
of
genes
of
the
B.
subtilis licR-gerBC
region
having an orthologue in each
of
these genomes is
presented in Table
3.
The largest number
of
orthologues
(58%) was with
E.
coli
sequences. Note however that
the
E.
coli
genome contains more than twice as many
genes as the
H.
influenzae
genome. A striking result of
this analysis is the small number
(27)
of
genes in the
licR-gerBC
region having an orthologue in
Mycoplasma
genitalium.
This may be due
to
the information content
of this part of the
B.
subtilis
chromosome (see above).
Indeed mycoplasmas have minimal genomes which
appear to have evolved from Gram-positive bacteria by
elimination of non-essential genes (Maniloff,
1983).
The
3324
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
311"-334"
B.
subtilis
chromosomal region
Conclusion
1
0.12
0
500
1000 1500 2000 2500
3000
3500
4000
kb
Fig.
5.
Chromosomal distribution of the paralogues of the
genes in the
/icR-gerBC
region. Bars represent the frequency of
paralogues (number of paraloguedcorresponding number of
genes) found in
100
kb segments of the chromosome. The
darker bars are
in
the
/id?-gerK
region.
number of orthologues with the archaeon
(Methano-
coccus
jannaschii)
and with the eukaryote
(S.
cerevisiae)
genomes are similar
(49
and
54)
but correspond
to
different genes (data not shown).
We systematically compared the
273
putative gene
products with protein sequences deduced from known
B.
subtilis
DNA
sequences.
Of
these,
115
genes
(42
'/o
)
are similar to at least one other
B.
subtilis
gene
(paralogues). Forty-five genes have only one counter-
part. In contrast,
17
genes have more than eight
paralogues (Table
2).
These genes encode proteins which
belong to large families of regulatory or transporter
proteins. For example, three genes in this region
of
the
chromosome encode regulatory proteins of the LysR
family and there are
13
other members of this class in the
remaining known sequences.
We
performed a prelimi-
nary analysis of the distribution of the paralogues along
the genome map, considering only genes which are not
part
of
large families of proteins (Fig.
5).
The dis-
tribution is not uniform, presenting a higher density of
paralogues in or near our
271
kb region (between
3600
and
4100
kb). In several cases paralogues are in tandem
repeat, in the same transcriptional unit
(ywbH
and
yw6G
or
ywhK
and
ywhL),
or in two transcriptional
units
(ywmC
and
ywmD).
Some protein families are
particularly well represented in this part of the chromo-
some. The three known genes putatively encoding
cardiolipin synthase are found in a
70
kb region:
ywiE
and
ywnE
are transcribed in the same orientation and
ywjE
is transcribed in the opposite direction. Three
of
the eight known members of the newly discovered
regulatory aspartyl phosphatase family (Perego
et al.,
1996)
are encoded in this part of the chromosome
(rapA,
rapD, rapF).
The systematic sequencing approach has considerably
accelerated the study
of
the genetic content and the gene
function analysis
of
the
licR-gerBC
region of the
B.
subtilis
chromosome. This region appears to contain
genes mainly involved in adaptation
to
growth condi-
tions. In particular, it carries a high density
of
loci
responsible for the utilization
of
various carbon and
nitrogen sources and alternative terminal electron ac-
ceptors. The synthesis of non-essential cell wall com-
ponents and their modification may reflect adaptation to
physical and chemical variations in
B.
subtilis'
ecological
niche: the soil and the surface of plant leaves (Priest,
1993).
The small number
of
genes having an orthologue
in the
Mycoplasma genitalium
genome, considered as a
minimal genome with very little adaptive capacity,
together with the large number
of
paralogues in this part
of
the chromosome are consistent with this observation.
Our analysis also supports the hypothesis
of
a functional
organization
of
the
B.
subtilis
chromosome. The sys-
tematic analysis
of
the unknown genes will confirm or
invalidate this view.
ACKNOWLEDGEMENTS
We thank Dr
J.
Hoch
for
providing plasmid pPP41. This work
was
supported by
the
Institut
Pasteur, the
European
Com-
mission (Biotechnology contract B102-CT-930272), the Mini-
stkre
de 1'Education Nationale, de 1'Enseignement SupCrieur
et
de
la
Recherche,
the
Groupement de
Recherche
et
d'Etude
sur
les Gknomes (GREG) (grant 1993-no.
20)
and the Centre
National de la Recherche Scientifique (GDR 1029 and URA
1300). H.C.R.
was
supported by
a
fellowship
from
the
Universidad Nacional Autonoma de Mkxico.
REFERENCES
Abhayawardhane,
Y.
&
Stewart, G.
C.
(1 995).
Bacillus subtilis
possesses
a
second determinant
with
extensive sequence
similarity
to
the
Escherichia coli mreB
morphogene.
J
Bacteriol 177,
Albertini, A.
M.,
Caramori,
T.,
Scoffone,
F.,
Scotti,
C.
&
Galizzi, A.
(1995).
Sequence
around
the
159'
region
of
the
Bacillus subtilis
genome:
the
pksX
locus
spans
33.6
kb.
Microbiology
141,
Altschul,
5.
F.,
Gish, W., Miller, W., Myers,
E.
W.
&
Lipman, D.
J.
(1990).
Basic
local
alignment
search
tool.
J
Mol
Biol215,403-410.
Biaudet,
V.,
Samson,
F.,
Anagnostopoulos,
C.,
Ehrlich,
5.
D.
&
Bessihres,
P.
(1 996).
Computerized
genetic
map
of
Bacillus
subtilis. Microbiology
142, 2669-2729.
Bruckner,
R.,
Shoseyov,
0.
&
Doi,
R.
H.
(1990).
Multiple
active
forms
of
a
novel
serine
protease
from
Bacillus subtilis. Mol Gen
Genet
221,486490.
Bullock,
W.
0..
Fernandez,
1.
M.
&
Short,
J.
M.
(1987).
A
high
efficiency
plasmid
transforming
recA Escherichia coli
strain
with
p-galactosidase
selection.
BioTechniques
5,
376.
Bult,
C.
J.,
White,
O.,
Olsen, G.
1.
&
37
other authors
(1996).
Complete
genome
sequence
of
the
methanogenic
archaeon,
Methanococcus jannaschii. Science
273,
1058-1073.
Calogero,
S.,
Gardan,
R.,
Glaser,
P.,
Schweizer,
J.,
Rapoport, G.
&
765-773.
299-309.
3325
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E.
PRESECAN
and
OTHERS
DBbarbouillB, M.
(1994).
RocR, a novel regulatory protein
controlling arginine utilization in
Bacillus subtilis,
belongs to the
NtrC/NifA family of transcriptional activators.
J
Bacteriol 176,
1234-1241.
Chambers,
5.
P.,
Prior,
5.
E.,
Barstow, D. A.
&
Minton, N.
P.
(1988).
The pMTL
nic-
cloning vectors.
1.
Improved pUC polylinker
regions to facilitate the use of sonicated DNA for nucleotide
sequencing.
Gene 68, 139-149.
Chung,
C.
T.
&
Miller,
R.
H.
(1988).
A rapid and convenient
method for the preparation and storage of competent bacterial
cells.
Nucleic Acids Res 16, 3580.
Cruz Ramos, H., Boursier, L., Moszer,
I.,
Kunst,
F.,
Danchin, A.
&
Glaser,
P.
(1 995).
Anaerobic transcription activation in
Bacillus
subtilis
:
identification of distinct FNR-dependent and -inde-
pendent regulatory mechanisms.
EMBO
J
14,5984-5994.
Cruz Ramos, H., Glaser,
P.,
Wray, L. V.
&
Fisher,
5.
H.
(1997).
The
Bacillus subtilis ureABC
operon.
J
Bacteriol 179, 3371-3373.
Danchin, A.
(1996).
mRNA turnover and DNA synthesis: a lesson
from bacterial genome comparisons.
Mol Microbiol20,895-897.
Dear,
S.
&
Staden,
R.
(1991).
A sequence assembly and editing
program for efficient management of large projects.
Nucleic Acids
Res 19,3907-3911.
Donovan, W., Zheng,
L.
B., Sandman, K.
&
Losick,
R.
(1987).
Genes
encoding spore coat polypeptides from
Bacillus subtilis.
J
Mol
Biol 196,
1-10.
Fleischmann,
R.
D., Adams, M. D., White,
0.
&
37
other authors
(1 995).
Whole-genome random sequencing of
Haemophilus
influenzae
Rd.
Science 269,496-512.
Fouet, A., Klier, A.
&
Rapoport, G.
(1986).
Nucleotide sequence of
the sucrase gene of
Bacillus subtilis. Gene 45,221-225.
Fouet, A., Arnaud, M., Klier, A.
&
Rapoport, G.
(1987).
Bacillus
subtilis
sucrose-specific enzyme I1 of the phosphotransferase
system
:
expression in
Escherichia coli
and homology to enzymes
I1 from enteric bacteria.
Proc Nut1 Acad Sci USA 84, 8773-8777.
Fraser,
C.
M., Gocayne,
J.
D., White,
0.
&
26
other authors
(1995).
The minimal gene complement of
Mycoplasma genitalium.
Science 270,397-403.
Gibson,
T.
J.
(1984).
Studies on the Epstein-Burr virus genome.
PhD thesis. University of Cambridge.
Glaser,
P.,
Kunst,
F.,
DBbarbouillB, M., VertBs, A., Danchin, A.
&
Dedonder,
R.
(1 991).
A gene encoding a tyrosine-tRNA synthetase
is located near
sacs
in
Bacillus subtilis. DNA Seq 1, 251-261.
Glaser,
P.,
Kunst,
F.,
Arnaud, M.
&
14
other authors
(1993).
Bacillus subtilis
genome project
:
cloning and sequencing of the
97
kb region from
325"
to
333". Mol Microbiol 10, 371-384.
Glaser,
P.,
Danchin, A., Kunst,
F.,
Zuber,
P.
&
Nakano, M. M.
(1995).
Identification and isolation of a gene required for nitrate
assimilation and anaerobic growth of
Bacillus subtilis. Bacteriol
Goffeau, A., Barrell,
B.
G., Bussey, H.
&
13
other authors
(1996).
Life with
6000
genes.
Science 274,563-567.
Hanahan, D.
(1983).
Studies on transformation of
Escherichia coli
with plasmids.
J
Mol Biol 16,557-580.
Harwood,
C.
R.
&
Wipat, A.
(1996).
Sequencing and functional
analysis of the genome of
Bacillus subtilis
strain
168.
FEBS
Lett
389, 84-87.
HBnaut, A., Rouxel, T., Gleizes, A., Moszer,
1.
&
Danchin, A.
(1996).
Uneven distribution
of
GATC motifs in the
Escherichia
coli
chromosome, its plasmids and its phages.
J
Mol Biol 257,
574-585.
Higgins, D. G., Bleasby, A.
J.
&
Fuchs,
R.
(1992).
CLUSTAL
v-
177, 1112-1115.
Improved software for multiple sequence alignment.
Comput
Appl Biosci 8, 189-191.
Himmelreich,
R.,
Hilbert, H., Plagens, H., Pirkl,
E.,
Li,
B.
&
Herrmann,
R.
(1996).
Complete sequence analysis of the genome
of the bacterium
Mycoplasma pneumoniae. Nucleic Acids Res 24,
4420-4449.
Horinouchi,
5.
&
Weisblum, B.
(1982).
Nucleotide sequence and
functional map of
pC194,
a plasmid that specifies inducible
chloramphenicol resistance.
J
Bacteriol 150, 815-825.
Itaya, M.
&
Tanaka,
T.
(1991).
Complete physical map
of
the
Bacillus subtilis 168
chromosome constructed by a gene-directed
mutagenesis method.
J
Mol Biol220,631-648.
Kaneko,
T.,
Sato,
S.,
Kotani, H.
&
21
other authors
(1996).
Sequence analysis of the genome of the unicellular cyanobac-
terium
Synechocystis
sp. strain PCC
6803. 2.
Sequence deter-
mination of the entire genome and assignment of potential
protein-coding regions.
DNA Res
3,
109-136.
Karow, M.
L.,
Glaser,
P.
&
Piggot,
P.
J.
(1995).
Identification of a
gene,
spoZlR,
that links the activation of
oE
to the transcriptional
activity of
crF
during sporulation in
Bacillus subtilis. Proc Nut1
Acad Sci USA 92,2012-2016.
Kunkel, B., Kroos, L., Poth, H., Youngman,
P.
&
Losick,
R.
(1989).
Temporal and spatial control
of
the mother-cell regulatory gene
spoZllD
of
Bacillus subtilis. Genes Dev
3,
1735-1744.
Kunst,
F.
&
Devine, K.
(1991).
The project of sequencing the entire
Bacillus subtilis
genome.
Res Microbiol 142, 905-912.
Kunst,
F.
&
Rapoport,
G.
(1995).
Salt stress is an environmental
signal affecting degradative enzyme synthesis in
Bacillus subtilis.
J
Bacteriol 177, 2403-2407.
Kunst,
F.,
Vassarotti, A.
&
Danchin, A.
(1995).
Organization of the
European
Bacillus subtilis
genome sequencing project.
Micro-
biology 141, 249-255.
Kyte,
J.
&
Doolittle,
R.
F.
(1982).
A simple method for displaying
the hydropathic character of a protein.
J
Mol Biol 157, 105-132.
Lampe, M., Binnie,
C.,
Schmidt,
R.
&
Losick,
R.
(1988).
Cloned gene
encoding the
6
subunit of
Bacillus subtilis
RNA polymerase.
Gene
67, 13-19.
Londofio-Vallejo, J.-A.
&
Stragier,
P.
(1 995).
Cell-cell signaling
pathway activating a developmental transcription factor in
Bacillus subtilis. Genes Dev 9, 503-508.
Londofio-Vallejo, J.-A., FrBhel,
C.
&
Stragier,
P.
(1997).
spoZZQ,
a
forespore-expressed gene required for engulfment in
Bacillus
subtilis. Mol Microbiol24, 29-39.
Lopez-Diaz,
I.,
Clarke,
5.
&
Mandelstam,
J.
(1986).
spoZlD
operon
of
Bacillus subtilis:
cloning and sequence.
J
Gen Microbiol 132,
Maeda,
M.,
Hidaka, M., Nakamura, A., Masaki, H.
&
Uozumi,
T.
(1 994).
Cloning, sequencing and expression
of
thermophilic
Bacillus
sp. strain
TP-90
urease gene complex in
Escherichia coli.
J
Bacteriol 176, 43242.
Maniloff,
1.
(1983).
Evolution of wall-less prokaryotes.
Annu Rev
Microbiol37,477-499.
Marck,
C.
(1988).
'DNA Strider': a 'C' program for the fast
analysis of DNA and protein sequences on the Apple Macintosh
family
of
computers.
Nucleic Acids Res 16, 1829-1836.
Martinussen,
J.,
Glaser,
P.,
Andersen,
P.
5.
&
Saxild, H. H.
(1995).
Two genes encoding uracil
phosphoribosyltransferase
are present
in
Bacillus subtilis.
J
Bacteriol 177, 271-274.
Messing,
1.
&
Vieira,
J.
(1982).
A new pair
of
M13
vectors for
selecting either DNA strand of double-digest restriction frag-
ments.
Gene 19, 269-276.
341-354.
3326
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
The
311"-334"
B.
subtilis
chromosomal
region
Miller,
1.
H.
(1972).
Experiments in Molecular Genetics.
Cold
Spring Harbor, NY: Cold Spring Harbor Laboratory.
Mitchell,
C.,
Morris,
P.
W., Lum, L., Spiegelman, G.
&
Vary,
J.
C.
(1992). The amino acid sequence of a
Bacillus subtilis
phospho-
protein that matches an
orp-tsr
coding sequence.
Mol Microbiol
Mizuno, M., Masuda,
S.,
Takemaru, K., Hosono,
S.,
Sato, T.,
Takeuchi, M.
&
Kobayashi, Y.
(1996). Systematic sequencing of the
283
kb
210'-232'
region of the
Bacillus subtilis
genome con-
taining the
skin
element and many sporulation genes.
Micro-
biology 142, 3103-3111.
Moszer,
I.,
Glaser,
P.
&
Danchin, A.
(1991). Multiple IS insertion
sequences near the replication terminus in
Escherichia coli K-12.
Biochimie 73, 1361-1374.
Moszer,
I.,
Glaser,
P.
&
Danchin, A.
(1995).
SubtiList:
a relational
database for the
Bacillus subtilis
genome.
Microbiology 141,
Moszer,
I.,
Kunst,
F.
&
Danchin, A.
(1996). The European
Bacillus
subtilis
genome sequencing project
:
current status and accessi-
bility of the data from a new World Wide Web site.
Microbiology
Naclerio, G., Baccigalupi,
L.,
Zilhao,
R.,
De Felice, M.
&
Ricca,
E.
(1996).
Bacillus subtilis
spore coat assembly requires
cotH
gene
expression.
J
Bacteriol 178, 43754380.
Niaudet, B., Goze, A.
&
Ehrlich,
5.
D.
(1982). Insertional muta-
genesis in
Bacillus subtilis:
mechanism and use in gene cloning.
Gene 19,277-284.
O'Brien,
C.
(1997). Entire
E.
coli
genome sequenced-at last.
Nature 385, 472.
Ogasawara, N.
&
Yoshikawa, H.
(1996). The systematic sequenc-
ing of the
Bacillus subtilis
genome in Japan.
Microbiology 142,
Ogasawara, N., Nakai,
5.
&
Yoshikawa, H.
(1994). Systematic
sequencing of the 180 kilobase region of the
Bacillus subtilis
chromosome containing the replication origin.
DNA Res 1,l-14.
Perego, M., Hanstein,
C.,
Welsh, K.
M.,
Djavakhishvili, T., Glaser,
P.
&
Hoch,
J.
A.
(1994). Multiple protein-aspartate phosphatases
provide a mechanism for the integration of diverse signals in the
control of development in
Bacillus subtilis. Cell 79, 1047-1055.
Perego, M., Glaser,
P.,
Minutello, A., Strauch, M. A,, Leopold, K.
&
Fischer, W.
(1995). Incorporation
of
D-alanine into lipoteichoic
acid and wall teichoic acid in
Bacillus subtilis
:
identification of
genes and regulation.
]
Biol Chem 270, 15598-15606.
Perego, M., Glaser,
P.
&
Hoch,
J.
A.
(1996). Aspartyl-phosphate
phosphatases deactivate the response regulator components of
the sporulation signal transduction system in
Bacillus subtilis.
Mol Microbioll9, 1151-1157.
Popham, D. L.
&
Setlow,
P.
(1994). Cloning, nucleotide sequence,
mutagenesis, and mapping of the
Bacillus subtilis pbpD
gene,
which codes for penicilin-binding protein
4.
J
Bacteriol 176,
Priest,
F.
G.
(1993). Systematics and ecology of
Bacillus.
In
Bacillus
subtilis and Other Gram-positive Bacteria
:
Biochemistry, Physi-
ology, and Molecular Genetics.
Edited by A.
L.
Sonenshein,
J.
A.
Hoch
&
R. Losick. Washington, DC: American Society for
Microbiology.
Putzer, H., Brakhage, A.
&
Grunberg-Manago, M.
(1990). Inde-
pendent genes for two threonyl-tRNA synthetases in
Bacillus
subtilis.
J
Bacteriol 172, 45934602.
Putzer, H., Gendron, N.
&
Grunberg-Manago, M.
(1992). Co-
ordinate expression
of
the two threonyl-tRNA synthetase genes
in
Bacillus subtilis
:
control by transcriptional antitermination
6, 1345-1349.
26 1-26
8.
142,2987-2991.
2993-2994.
7197-7205.
involving a conserved regulatory sequence.
EMBO
J
11, 3117-
3127.
Quirk,
P.,
Dunkley,
E.,
Lee,
P.
&
Krulwich, T.
(1993). Identification
of a putative
Bacillus subtilis rho
gene.
J
Bacteriol 175, 647-654.
Renna,
M.
C.,
Najimudin, N., Winik, L.
R.
&
Zahler, S.A.
(1993).
Regulation of the
Bacillus subtilis alsS, alsD,
and
alsR
genes
involved in post-exponential-phase production of acetoin.
]
Bacterioll75,3863-3875.
Sacco, M., Ricca,
E.,
Losick,
R.
&
Cutting,
5.
(1995). An additional
GerE-controlled gene encoding an abundant spore coat protein
from
Bacillus subtilis.
]
Bacteriol 177, 372-377.
Sambrook, J., Fritsch,
E.
F.
&
Maniatis, T.
(1989).
Molecular
Cloning: a Laboratory Manual,
2nd edn. Cold Spring Harbor,
NY: Cold Spring Harbor Laboratory.
Santana,
M.,
Kunst, F., Hullo,
M.
F., Rapoport, G., Danchin, A.
&
Glaser, P.
(1992). Molecular cloning, sequencing, and physio-
logical characterization of the
gox
operon from
Bacillus subtilis
encoding the
aa,-600
quinol oxidase.
J
Biol Chem 267, 10225-
1023 1.
Santana,
M.,
lonexu,
M.
S.,
Vertes, A., Longin,
R.,
Kunst,
F.,
Danchin, A.
&
Glaser, P.
(1994).
Bacillus subtilis
F,F,
ATPase:
DNA sequence of the
atp
operon and characterization of
atp
mutants.
J
Bacteriol 176, 68024811.
Silhavy, T.
J.,
Berman,
M.
L.
&
Enquist,
L.
W.
(1984).
Experiments
with Gene Fusions.
Cold Spring Harbor, NY: Cold Spring
Harbor Laboratory.
Sloma, A.,
Rufo,
G. A,, Jr, Theriault, K. A., Dwyer, M., Wilson,
5.
H.
&
Pero,
1.
(1991). Cloning and characterization of the gene
for an additional extracellular serine protease
of
Bacillus subtilis.
]
Bacteriol 173, 68894895.
Soldo, B., Lazarevic, V., MauBI,
C.
&
Karamata, D.
(1996). Sequence
of
the
305O-307O
region of the
Bacillus subtilis
chromosome.
Microbiology 142, 3079-3088.
Sorokin, A., Azevedo, V., Zumstein,
E.,
Galleron, N., Ehrlich,
5.
D.
&
Senor, P.
(1996). Sequence analysis of the
Bacillus subtilis
chromosome region between the
serA
and
kdg
loci cloned in a
yeast artificial chromosome.
Microbiology 142,2005-2016.
Stragier, P.
&
Losick,
R.
(1996). Molecular genetics of sporulation
in
Bacillus subtilis. Annu Rev Genetics 30, 297-341.
Swinfield, T. J., Oultram,
J.
D., Thompson, D.
E.,
Brehm,
J.
K.
&
Minton, N. P.
(1990). Physical characterisation of the replication
region
of
the
Streptococcus faecalis
plasmid pAMB1.
Gene 87,
Tobisch,
S.,
Glaser,
P.,
KrUger,
5.
&
Hecker,
M.
(1997). Identifi-
cation and characterization of a new B-glucoside utilization
system in
Bacillus subtilis.
J
Bacteriol 179, 496-506.
Trach, K., Chapman,
J.,
Piggot,
P.,
Lecoq, D.
&
Hoch,
J.
(1988).
Complete sequence and transcriptional analysis of the
SPOOF
region of the
Bacillus subtilis
chromosome.
J
Bacteriol 170,
4 194-4208.
Wipat, A., Carter, N., Brignell,
5.
C.,
Guy, B.
J.,
Piper, K., Sanders,
J.,
Emmerson, P.T.
&
Hatwood,
C.
R.
(1996). The
dnaB-pheA
(256O-24Oo)
region
of
the
Bacillus subtilis
chromosome containing
genes responsible for stress responses, the utilization
of
plant cell
walls and primary metabolism.
Microbiology 142, 3067-3078.
Woodson, K.
&
Devine, K. M.
(1994). Analysis of a ribose
transport operon from
Bacillus subtilis. Microbiology 140,
Wray,
L.
V.,
Atkinson,
M.
R.
&
Fisher,
5.
H.
(1994). The nitrogen-
regulated
Bacillus subtilis nrgAB
operon encodes a membrane
protein and a protein highly similar to the
Escherishia coli glnB-
encoded
PI1
protein.
J
Bacteriol 176, 108-114.
79-90.
1829-1838.
3327
Downloaded from www.microbiologyresearch.org by
IP: 45.45.132.90
On: Thu, 25 Aug 2016 08:45:06
E.
PRESECAN
and
OTHERS
Yanisch-Perron,
C.,
Vieira,
J.
&
Messing,
J.
(1985).
Improved M13
phage cloning vectors and host strains
:
nucleotide sequences
of
the M13mp18 and pUC19 vectors.
Gene
33,103-119.
Yoshida,
K.,
Shindo,
K.,
Sano,
H.,
Seki,
S.,
Fujimura,
M.,
Yanai,
N.,
Miwa, Y.
&
Fujita, Y.
(1996).
Sequencing
of
a 65
kb
region
of
the
Bacillus subtilis
genome containing the
lic
and
cel
loci, and
creation
of
a 177
kb
contig covering the
gnt-sacXY
region.
Microbiology
142,
3113-3123.
Yoshikawa,
H.,
Kazami,
J.,
Yamashita,
S.,
Chibazakura,
T.,
Sone,
H.,
Kawamura,
F.,
Oda, M., Isaka, M., Kobayashi, Y.
&
Saito,
H.
(1986).
Revised assignment for the
Bacillus subtilis
spoOF
gene
and its homology with
spoOA
and with two
Escherichia
coli
genes.
Nucleic Acids
Res
14,
1063-1072.
Zukowski,
M.,
Miller,
L.,
Cogswell,
P.,
Chen, K., Aymerich,
S.
&
Steinmetz, M.
(1990).
Nucleotide sequence
of
the
sacs
locus
of
Bacillus subtilis
reveals the presence
of
two regulatory genes.
Gene
90,
153-155.
Received 16 April 1997; revised
2
June 1997; accepted 6 June 1997.
3328
... The vesicle-producing cell leads to cell autolysis through the enzymatic action of releasing autolysin, which induces the formation of MV in neighboring cells [66]. The study of prophage sequences has been studied since as early as 1997; Presecan [67] identified many potentially non-essential chromosomal motifs in the whole genome nucleotide sequence of B. subtilis, particularly 10 relatively large AT-rich regions representing known prophages and prophage-like regions scattered across the genome. Analog sequences (pro1-7) are obtained by horizontal gene transfer [68][69][70]. ...
Article
Full-text available
Bacillus subtilis is a gram-positive bacterium, a promising microorganism due to its strong extracellular protein secretion ability, non-toxic, and relatively mature industrial fermentation technology. However, cell autolysis during fermentation restricts the industrial application of B. subtilis. With the fast advancement of molecular biology and genetic engineering technology, various advanced procedures and gene editing tools have been used to successfully construct autolysis-resistant B. subtilis chassis cells to manufacture various biological products. This paper first analyses the causes of autolysis in B. subtilis from a mechanistic perspective and outlines various strategies to address autolysis in B. subtilis. Finally, potential strategies for solving the autolysis problem of B. subtilis are foreseen.
... Putative ¹35, ¹22 and ¹10 sequences of Pho regulon genes (Eder et al., 1996) are labelled. and is 51% and 40% identical to the B. subtilis YwqF (Presecan et al., 1997) and YtaC (Lapidus et al., 1997 ) respectively . The exact enzymic activity and expression of these TuaD paralogues not involved in TUA synthesis remain uncharacterized (B. ...
Article
Full-text available
Sequence analysis reveals that the Bacillus subtilis 168 tuaABCDEFGH operon encodes enzymes required for the polymerization of teichuronic acid as well as for the synthesis of one of its precursors, the UDP-glucuronate. Mutants deficient in any of the tua genes, grown in batch cultures under conditions of phosphate limitation, were characterized by reduced amounts of uronate in their cell walls. The teichuronic acid operon belongs to the Pho regulon, as phosphate limitation induces its transcription. Placing the tuaABCDEFGH operon under the control of the inducible Pspac promoter allowed its constitutive expression independently of the phosphate concentration in the medium; the level of uronic acid in cell walls was dependent on the concentration of the inducer. Apparently, owing to an interdependence between teichoic and teichuronic acid incorporation into the cell wall, in examined growth conditions, the balance between the two polymers is maintained in order to insure a constant level of the wall negative charge.
... The CPS biosynthesis loci of S. pneumoniae encode four genes (cpsA-D also known as wzg, wzh, wzd & wze) found at the 59 end of the loci, which are involved in the regulation of CPS biosynthesis in the pneumococcus. Genes similar to these are found in the CPS loci of many other Grampositive bacteria [24,25,26,27]. While cpsA mutants produce significantly less CPS, the cpsA gene product is not essential for CPS production and is thought to function as a translational activator [28,29] . ...
Article
Full-text available
Capsule polysaccharide is a major virulence factor for a wide range of bacterial pathogens, including Streptococcus pneumoniae. The biosynthesis of Wzy-dependent capsules in both gram-negative and -positive bacteria is regulated by a system involving a protein tyrosine phosphatase (PTP) and a protein tyrosine kinase. However, how the system functions is still controversial. In Streptococcus pneumoniae, a major human pathogen, the system is present in all but 2 of the 93 serotypes found to date. In order to study this regulation further, we performed a screen to find inhibitors of the phosphatase, CpsB. This led to the observation that a recently discovered marine sponge metabolite, fascioquinol E, inhibited CpsB phosphatase activity both in vitro and in vivo at concentrations that did not affect the growth of the bacteria. This inhibition resulted in decreased capsule synthesis in D39 and Type 1 S. pneumoniae. Furthermore, concentrations of Fascioquinol E that inhibited capsule also lead to increased attachment of pneumococci to a macrophage cell line, suggesting that this compound would inhibit the virulence of the pathogen. Interestingly, this compound also inhibited the phosphatase activity of the structurally unrelated gram-negative PTP, Wzb, which belongs to separate family of protein tyrosine phosphatases. Furthermore, incubation with Klebsiella pneumoniae, which contains a homologous phosphatase, resulted in decreased capsule synthesis. Taken together, these data provide evidence that PTPs are critical for Wzy-dependent capsule production across a spectrum of bacteria, and as such represents a valuable new molecular target for the development of anti-virulence antibacterials.
Article
Full-text available
RNA degradation is one of several ways for organisms to regulate gene expression. In bacteria, the removal of two terminal phosphate moieties as orthophosphate (Bacillus subtilis) or pyrophosphate (Escherichia coli) triggers ribonucleolytic decay of primary transcripts by 5'-monophosphate-dependent ribonucleases. In the soil-dwelling firmicute species B. subtilis, the RNA pyrophosphohydrolase BsRppH, a member of the Nudix family, triggers RNA turnover by converting primary transcripts to 5'-monophospate RNA. Besides BsRppH, a source of redundant activity in B. subtilis has been proposed. Here, using recombinant protein expression and in vitro enzyme assays, we provide evidence for several additional RNA pyrophosphohydrolases, among them MutT, NudF, YmaB, and YvcI in B. subtilis We found that in vitro, YvcI converts RNA 5'-di- and triphosphates into monophosphates in the presence of manganese at neutral to slightly acidic pH. It preferred G-initiating RNAs and required at least one unpaired nucleotide at the 5'-end of its substrates, with the 5'-terminal nucleotide determining whether primarily ortho- or pyrophosphate is released. Exchanges of catalytically important glutamate residues in the Nudix motif impaired or abolished the enzymatic activity of YvcI. In summary, the results of our extensive in vitro biochemical characterization raise the possibility that YvcI is an additional RNA pyrophosphohydrolase in B. subtilis.
Thesis
Full-text available
Bacillus subtilis takes up mannitol by a phosphoenolpyruvate-dependent phosphotransferase system (PTS). The mannitol utilization system is encoded by the mtlAFD operon consisting of mtlA (encoding membrane-bound EIICBMtl), mtlF (encoding phosphocarrier EIIAMtl), and mtlD (encoding mannitol 1-phosphate dehydrogenase). This operon is activated by MtlR whose coding gene is located approx. 14.4 kb downstream of the operon. The regulation of the mannitol utilization genes in B. subtilis was studied by fusion of the promoters of mtlAFD (PmtlA) and mtlR (PmtlR) to lacZ as a reporter gene. Both the PmtlA and PmtlR were inducible by mannitol and glucitol, while glucose reduced their activities. The promoter strength of PmtlA was about 4.5-fold higher than that of PmtlR. Identification of the transcription start sites of PmtlA and PmtlR revealed that both of these promoters contain a sigma A-type promoter structure. The promoter -35 and -10 boxes in PmtlA were TTGTAT and TAACAT and in PmtlR TTGATT and TATATT, respectively. Catabolite responsive elements (cre) were detected in the sequences of PmtlA and PmtlR overlapping the -10 boxes. Shortening the mRNA 5’untranslated region (5’UTR) increased the PmtlA activity, whereas PmtlR activity was decreased by shortening of its mRNA 5’UTR. Alignment of the -35 upstream sequences of PmtlA and PmtlR revealed the putative MtlR binding site. This sequence comprised a similar incomplete inverted repeat in both the PmtlA and PmtlR sequences (TTGNCACAN4TGTGNCAA). This sequence was encompassed by two 11 bp distal and proximal flanking sequences. Construction of PmtlA-PlicB hybrid promoters and shortening of the 5’-end of PmtlA indicated the probable boundaries of putative MtlR binding site in PmtlA. Increasing the distance between the putative MtlR binding site and -35 box lowered the PmtlA maximal activity, although PmtlA remained inducible by mannitol. PmtlA became inactive by disruption of the TTGNCACAN4TGTGNCAA sequence. In contrast, manipulation of the distal and proximal flanking sequences only reduced the maximal activity of PmtlA, whereas PmtlA remained highly inducible. These flanking sequences contained AT-rich repeats similar to the consensus sequence of alpha CTD binding sites. Regulation of PmtlA and PmtlR was investigated by deletion of mtlAF, mtlF, mtlD, and mtlR. Deletion of the mtlAF genes rendered PmtlA and PmtlR constitutive showing the inhibitory effect of EIICBMtl and EIIAMtl (PTS transporter components) on MtlR in the absence of mannitol. The constitutive activity of PmtlA was increased by the deletion of mtlF. In contrast, the deletion of mtlAFD showed a significant reduction in the PmtlA constitutive activity. Disruption of mtlD made B. subtilis sensitive to mannitol in a way that addition of mannitol or glucitol to the bacterial culture ended in cell lysis. Besides, PmtlA and PmtlR were similarly induced by glucitol and mannitol in a mtlD::erm mutant. Also, deletion of mtlR rendered PmtlA and PmtlR uninducible by mannitol or glucitol. In contrast, deletion of the glucitol utilization genes had no influence on the inducibility of PmtlA or PmtlR by glucitol. The PmtlA activity was drastically reduced in ptsH-H15A (HPr-H15A) mutant similar to the delta mtlR mutant. The mutation of histidine 289 in the PRDI domain of MtlR to alanine reduced the activity of PmtlA, whereas the PmtlA activity in the mtlR H230A mutant was almost similar to wild type. In contrast, mutation of the PRDII domain of MtlR to H342D mainly relieved PmtlA from glucose repression. Moreover, MtlR double mutant H342D C419A which was produced in E. coli was shown to be active in vitro. These results represent the positive regulation of MtlR via phosphorylation of the PRDII domain by HPr(H15~P). Also, dephosphorylation of the domains EIIBGat- and EIIAMtl-like of MtlR by EIIAMtl and EIICBMtl transporter components causes activation. The PmtlA activity was repressed in the presence of glucose and fructose, while sucrose and mannose had no influence on the PmtlA activity. Therefore, catabolite repression of PmtlA and PmtlR were studied by CcpA-dependent carbon catabolite repression mutants, such as ptsH-S46A, delta crh, delta hprK, and delta ccpA. Induction of PmtlA and PmtlR in these mutants did not result in a complete loss of catabolite repression. Therefore, the catabolite responsive elements (cre sites) of PmtlA and PmtlR were investigated. Using a constitutive promoter, PgroE, it was shown that the cre sites of PmtlA and PmtlR were weakly functional. In contrast, deletion of the glucose PTS transporter, encoded by ptsG, resulted in a complete loss of glucose repression in PmtlA and PmtlR. Thus, the main glucose repression of mannitol PTS function at the posttranslational level in a HPr-mediated manner via MtlR-H342 and at transcriptional level by CcpA-dependent carbon catabolite repression.
Article
Full-text available
Bacillus methanolicus MGA3 is a thermophilic, facultative ribulose monophosphate (RuMP) cycle methylotroph. Together with its ability to produce high yields of amino acids, the relevance of this microorganism as a promising candidate for biotechnological applications is evident. The B. methanolicus MGA3 genome consists of a 3,337,035 nucleotides (nt) circular chromosome, the 19,174 nt plasmid pBM19 and the 68,999 nt plasmid pBM69. 3,218 protein-coding regions were annotated on the chromosome, 22 on pBM19 and 82 on pBM69. In the present study, the RNA-seq approach was used to comprehensively investigate the transcriptome of B. methanolicus MGA3 in order to improve the genome annotation, identify novel transcripts, analyze conserved sequence motifs involved in gene expression and reveal operon structures. For this aim, two different cDNA library preparation methods were applied: one which allows characterization of the whole transcriptome and another which includes enrichment of primary transcript 5'-ends. Analysis of the primary transcriptome data enabled the detection of 2,167 putative transcription start sites (TSSs) which were categorized into 1,642 TSSs located in the upstream region (5'-UTR) of known protein-coding genes and 525 TSSs of novel antisense, intragenic, or intergenic transcripts. Firstly, 14 wrongly annotated translation start sites (TLSs) were corrected based on primary transcriptome data. Further investigation of the identified 5'-UTRs resulted in the detailed characterization of their length distribution and the detection of 75 hitherto unknown cis-regulatory RNA elements. Moreover, the exact TSSs positions were utilized to define conserved sequence motifs for translation start sites, ribosome binding sites and promoters in B. methanolicus MGA3. Based on the whole transcriptome data set, novel transcripts, operon structures and mRNA abundances were determined. The analysis of the operon structures revealed that almost half of the genes are transcribed monocistronically (940), whereas 1,164 genes are organized in 381 operons. Several of the genes related to methylotrophy had highly abundant transcripts. The extensive insights into the transcriptional landscape of B. methanolicus MGA3, gained in this study, represent a valuable foundation for further comparative quantitative transcriptome analyses and possibly also for the development of molecular biology tools which at present are very limited for this organism.
Article
A 565 bp DNA stretch coding for the Bacillus caldolyticus DSM405 cdd gene was cloned by PCR amplification and sequenced. Nucleotide sequence analysis to the cloned DNA stretch revealed an open reading frame of 396 bp encoding a deduced polypeptide of 132 amino acids with a molecular mass of 14,235 Da. Comparison of the deduced polypeptide of the B. caldolyticus cdd gene to that of the Bacillus subtilis cdd showed a 71% sequence homology. Thermostable cytidine deaminase expressed in Escherichia coli JF611/pCJH53 was purified through rapid procedures consisting of anionic-exchange column chromatography and FPLC. Finally purified enzyme showed 167.5 u/mg of specific activity, corresponding to about 94-fold purification with respect to the crude extract. Molecular mass of the monomer was estimated to be about 14-kDa by SDS-polyacrylamide gel electrophoresis (PAGE), and tetrameric configuration of the native enzyme was supposed by gel filteration. Optimal deamination occurred at 70∼75°C and pH 7.5–8.0, and the activity was inhibited by Fe2+, Cu2+, Hg2+, and p-chloromercuribenzoic acid (pCMB). The enzyme was thermostable and the apparent Km and Vmax values for the cytidine were 2.1 × 10−3M and 36.4 mM min−1mg−1, respectively.
Article
Mechanosensation in bacteria involves transducing membrane stress into an electrochemical response. In Escherichia coli and other bacteria, this function is carried out by a number of proteins including MscL, the mechanosensitive channel of large conductance. MscL is the best characterized of all mechanosensitive channels. It has been the subject of numerous structural and functional investigations. The explosion in experimental data on MscL recently culminated in the solution of the three-dimensional structure of the MscL homologue from Mycobacterium tuberculosis. In this review, much of these data are united and interpreted in terms of the newly published M. tuberculosis MscL crystal structure.
Article
To decrease the costs of producing the anti-HIV drug, lamivudine, an enzymatic conversion process was developed instead of the traditional chemical method. Thermostable cytidine deaminase was over-produced by cloning the cdd gene into E.coli JF611/pCJH53 from Bacillus caldolyticus. The purified cytidine deaminase was recovered from the lysate of the recombinant E.coli JF611/pCJH53 by removing heat-denatured proteins and eluting sequential chromatography. When the enzyme was used to deaminate (–)--l-(2R, 5S)- and (+)--d-(2S, 5R)-1, 3-oxathiolanyl-cytosine, about 68% of the (+)--d-(2S, 5R)-1, 3-oxathiolanyl-cytosine was deaminated into the corresponding (+)-thiauridine maximally.
Article
Full-text available
Tandem paired genes encoding putative short-chain monodomain protein members of the chromate ion transporter (CHR) superfamily (ywrB and ywrA) were cloned from genomic DNA of Bacillus subtilis strain 168. The transcription of the paired genes, renamed chr3N and chr3C, respectively, was shown to occur via a bicistronic mRNA generated from a promoter upstream of the chr3N gene. The chr3N and chr3C genes conferred chromate resistance when expressed in Escherichia coli strain W3110. The cloned chr3N gene alone did not confer chromate resistance on E. coli, suggesting that both chr3N and chr3C genes are required for function. E. coli cells expressing paired chr3N and chr3C genes demonstrated diminished uptake of chromate compared to that by a vector-only control strain. These results suggest that short-chain CHR proteins form heterodimer transporters which efflux chromate ions from the cytoplasm.
Article
Transposon Tn917 mutagenesis of Bacillus subtilis BD99 followed by selection for protonophore resistance led to the isolation of strain MS119, which contained a single Tn917 insertion in an open reading frame whose deduced amino acid sequence was 56.6% identical to that of the Escherichia coli rho gene product. The insertional site was near the beginning of the open reading frame, which was located in a region of the B. subtilis chromosome near the spoOF gene; new sequence data for several open reading frames surrounding the putative rho gene are presented. The predicted B. subtilis Rho protein would have 427 amino acids and a molecular weight of 48,628. The growth of the mutant strain was less than that of the wild type on defined medium at 30 degrees C. On yeast extract-supplemented medium, the growth of MS119 was comparable to that of the wild type on defined medium at 30 degrees C. On yeast extract-supplemented medium, the growth of MS119 was comparable to that of the wild type at 30 degrees C but was much slower at lower temperatures; sporulation occurred and competence was developed in cells of the mutant grown at 30 degrees C. To determine whether the protonophore resistance and sensitivity to low growth temperature resulted from the insertion, a chloramphenicol resistance cassette was inserted into the wild-type B. subtilis rho gene of strain BD170; the resulting derivative displayed the same phenotype as MS119.
Article
Factors that affect the probability of genetic transformation of Escherichia coli by plasmids have been evaluated. A set of conditions is described under which about one in every 400 plasmid molecules produces a transformed cell. These conditions include cell growth in medium containing elevated levels of Mg2+, and incubation of the cells at 0 degrees C in a solution of Mn2+, Ca2+, Rb+ or K+, dimethyl sulfoxide, dithiothreitol, and hexamine cobalt (III). Transformation efficiency declines linearly with increasing plasmid size. Relaxed and supercoiled plasmids transform with similar probabilities. Non-transforming DNAs compete consistent with mass. No significant variation is observed between competing DNAs of different source, complexity, length or form. Competition with both transforming and non-transforming plasmids indicates that each cell is capable of taking up many DNA molecules, and that the establishment of a transformation event is neither helped nor hindered significantly by the presence of multiple plasmids.
Article
The nucleotide sequence of the Bacillus subtilis 168 chromosomal segment located between yvhJ (307°) and secA (305°) was determined. This 20.3 kb region encompasses 23 ORFs, 17 of which have been sequenced previously. Comparison of sequences obtained here with the previously obtained ones revealed seven discrepancies. The products of the sequenced genes are involved in the regulation of degradative enzymes, competence, flagellar motility and protein secretion. Putative functions of newly identified genes are based on sequence homologies.
Article
We have cloned and sequenced a gene (epr) encoding a novel serine protease from Bacillus subtilis. Several active forms of the enzyme with molecular masses between 40 and 34 kDa were found in the medium of B. subtilis cultures containing the epr gene cloned on a plasmid. Deletions at the 3 end of the gene, removing up to 240 amino acids of the reading frame, abolished the expression of the larger species but did not affect the expression of the 34 kDa enzyme. The C-terminal third of the protein is therefore not required for protease activity. The size variation of the active forms expressed by the complete epr gene appears to be the result of partial removal of the C-terminus either by processing or degradation. Thus, the epr gene consists of two domains, one encoding a serine protease homologous to subtilisin and the other a C-terminus of unknown function.