ArticlePDF AvailableLiterature Review

Abstract

The term 'exoproteome' describes the protein content that can be found in the extracellular proximity of a given biological system. These proteins arise from cellular secretion, other protein export mechanisms or cell lysis, but only the most stable proteins in this environment will remain in abundance. It has been shown that these proteins reflect the physiological state of the cells in a given condition and are indicators of how living systems interact with their environments. High-throughput proteomic approaches based on a shotgun strategy, and high-resolution mass spectrometers, have modified the authors' view of exoproteomes. In the present review, the authors describe how these new approaches should be exploited to obtain the maximum useful information from a sample, whatever its origin. The methodologies used for studying secretion from model cell lines derived from eukaryotic, multicellular organisms, virulence determinants of pathogens and environmental bacteria and their relationships with their habitats are illustrated with several examples. The implication of such data, in terms of proteogenomics and the discovery of novel protein functions, is discussed.
10.1586/EPR.12.52
561
ISSN 1478-9450
© 2012 Expert Reviews Ltd
www.expert-reviews.com
Review
Secreted proteins represent a specifi c fraction
of the proteome in the sense that they have to
be transported across membranes to ensure for
the biogenesis of membranes and cell walls,
nutrient uptake, motility and interactions. A
signi cant part of them are active outside the
cell. Proteolysis in the extracellular environment
may be important in some cases, and only the
most stable proteins will remain abundant in
the extracellular environment. Most secreted
proteins are synthesized as precursors with a
cleavable N-terminal signal sequence, but a
signi cant fraction is secreted by nonclassical
pathways, that is, without signaling peptides.
This signal peptide is required for targeting of
the protein to the membrane-embedded export
machinery [1] . Upon translocation across the
membrane, the signal peptide is either cleaved
from the precursor via a membrane-bound
signal peptidase or served as a signal anchor
into the membrane. The most predominant
signal peptide-dependent pathways are the
Sec- and Tat-secretory pathways [24] . The
Sec pathway is ubiquitous and essential for
viability in bacteria, archaea and eukaryotes [5] .
This export system translocates unfolded
proteins across the endoplasmic reticulum (ER)
membrane in eukaryotes and the cytoplasmic
membrane of bacteria and archaea, through a
protein-conducting channel. The twin-arginine
translocation (Tat) machinery exports folded
proteins across the cytoplasmic membrane
in bacteria and archaea, as well as across the
chloroplast thylakoid membrane in plants [4] .
Tat recognizes longer and less hydrophobic,
signal peptides than those of the Sec system,
comprising two contiguous arginine residues
[4] . In Gram-positive bacteria, protein substrates
of sortases are translocated via Sec-dependent
secretory pathway but are covalently anchored
to the cell wall. Lipoproteins are also secreted (in
Gram-positive bacteria) or exported (in Gram-
negative bacteria) via the Sec pathway (or Tat-
dependent pathways for some of them) but are
anchored to the cytoplasmic membrane (in both
types of bacteria) or the outer membrane (on
the periplasmic side in Gram-negative bacteria).
Integral membrane proteins are inserted via
YidC in a Sec-dependent or independent
manner. The Sec- and Tat-dependent secretion
Jean Armengaud*
1
,
Joseph A
Christie-Oleza
2
,
Gérémy Clair
1
,
3
,
4
,
ronique Malard
1
and
Catherine Duport
3 ,
4
1
CEA , DSV, IBEB, Lab Biochim System
Perturb, Bagnols-sur-Cèze, F-30207,
France
2
School of Life Sciences, University of
Warwick, Coventry CV4 7AL, UK
3
Université d’Avignon et des Pays de
Vaucluse, UMR408, Sécurité et Qualité
des Produits d’Origine Végétale,
F-84000 Avignon, France
4
INRA, UMR408, Sécurité et Qualité des
Produits d’Origine Végétale, F-84914
Avignon, France
*Author for correspondence:
Tel.: +33 046 679 6802
Fax: +33 046 679 1905
jean.armengaud@cea.fr
The term ‘exoproteome’ describes the protein content that can be found in the extracellular
proximity of a given biological system. These proteins arise from cellular secretion, other protein
export mechanisms or cell lysis, but only the most stable proteins in this environment will remain
in abundance. It has been shown that these proteins refl ect the physiological state of the cells
in a given condition and are indicators of how living systems interact with their environments.
High-throughput proteomic approaches based on a shotgun strategy, and high-resolution mass
spectrometers, have modifi ed the authors’ view of exoproteomes. In the present review, the
authors describe how these new approaches should be exploited to obtain the maximum useful
information from a sample, whatever its origin. The methodologies used for studying secretion
from model cell lines derived from eukaryotic, multicellular organisms, virulence determinants of
pathogens and environmental bacteria and their relationships with their habitats are illustrated
with several examples. The implication of such data, in terms of proteogenomics and the
discovery of novel protein functions, is discussed.
Exoproteomics: exploring
the world around biological
systems
Expert Rev. Proteomics 9(5), 561–575 (2012)
KEYWORDS: exoproteomics hypothetical proteins metaproteomics peptide signal proteogenomics proteomic
surfaceome secretion systems secretome shotgun proteomics subcellular localization predictor toxins
virulence factors
For reprint orders, please contact reprints@expert-reviews.com
Expert Rev. Proteomics 9(5), (2012)
562
Review
pathways also lead to the release of some proteins into the
extracellular medium
(FIGURE 1) . In Gram-positive bacteria, some
exported proteins share a determinant at their carboxyl termini
for covalent attachment to the cell wall. Such a signal consists of
a signature motif (such as LPXTG or PEP amino acid sequences),
a hydrophobic transmembrane α-helix, and a cluster of basic
amino acids (primarily arginine residues) and is processed by
sortases [6] . In eukaryotic cells, Sec-dependent extracellular
proteins are transported from the ER to the Golgi apparatus and
subsequently to the cell surface, where they are delivered to the
microenvironment by fusion of the Golgi-derived vesicles with the
plasma membrane [7] . This pathway of protein export is referred
to as the classical secretory pathway. Other secretory pathways
also exist in eukaryotic cells, allowing for protein secretion
independently of the presence of an identi able signal peptide
and the ER–Golgi. Diverse mechanisms have been proposed,
including lysosomal secretion, endosomal recycling and release
of membrane vesicle exosomes into the extracellular fl uid [8, 9] . In
bacteria, the mechanisms responsible for nonclassical secretion are
still largely undocumented. Nevertheless, cytoplasmic proteins
found in the extracellular milieu are thought to be released
during septum formation, and furthermore, membrane vesicles
from the envelope of growing Gram-negative bacteria have been
shown to be secretory vehicles for proteins [10 12] . Furthermore,
extracellular cytoplasmic proteins have been shown to participate
in a range of biological processes including virulence [13] and as
nonactive (enzymatically) binding substrates [14] . Such proteins,
which display two unrelated functions, have been referred to as
moonlighting’ proteins and could be selectively secreted [13] .
Among the proteins secreted by bacteria and archaea, S-layer
proteins are worth mentioning as they are abundantly assembled
on the surface of the cells as paracrystalline sheets and associated
with many other secreted proteins [15] .
The term ‘secretome’ refers to the pool of proteins which are
actively secreted via classical or nonclassical mechanisms or via
the release of exosomes, as well as the secretion machinery itself
[16] . The secreted proteins comprise membrane-linked proteins
as well as other translocated proteins. Proteins released into the
extracellular milieu belongs to the latter group of protein. A
proportion of the exported proteins are inserted in the membrane
via the presence of transmembrane hydrophobic segments or can
also be associated in some cases with other transmembrane-
inserted proteins, as exempli ed by some components of the
agellum apparatus of motile bacteria [17] . The speci c fraction
of secreted proteins bound to the external surface of the cell, also
known as the ‘surface proteome’ or ‘proteomic surfaceome’, can be
evaluated by means of shaving-based approaches [18 20] . For this,
intact cells are exposed to limited proteolysis, to cleave surface
proteins present on the outer membrane, cytoplasmic membrane
and/or cell wall. The peptides derived from shaved proteins can be
identifi ed by mass spectrometric analysis but are restricted to the
exposed, nonhydrophobic regions, sometimes hindering protein
identifi cation because of low sequence coverage. Numerous studies
have been reported regarding bacterial [21, 22] or human cell [23, 24]
surfaceomes and plant cell-wall proteomes
[25] . An interesting recent innovation is
the specifi c detection of secreted proteins
on the bacterial cell surface directly by
MALDI-TOF mass spectrometry using
colonies of Mycobacterium marinum [26] .
This and other recent studies have a large
implication on microbial diagnostics, a
eld of growing interest in terms of human
health and microbial ecology [27, 28] . A major
fraction of the secreted proteins will nally
end up outside the cell. These exported
proteins comprise soluble components of
transporters, chelators, proteases, toxins,
growth factors, cytokines, sensors and
numerous proteins, which are poorly
characterized, that is, hitherto annotated
as hypothetical proteins or assigned
with a putative function. The fraction
that corresponds to the protein content
present outside of the cells, either in the
conditioned medium for laboratory cell
cultures or in the extracellular matrix of
environmental samples, is, strictly, known
as the ‘exoproteome’. The exoproteome
includes both actively secreted proteins and
extracellular, but nonsecreted, proteins,
resulting from cell lysis, cell friction and
Periplasmic space
Outer membrane
Extracellular content
Cytoplasm
Exoproteome
Inner membrane
Surface
proteome
Secretome
Nonsecreted
exoproteins
Secreted
exoproteins
Expert Rev. Proteomics © Future Science Group (2012)
Figure 1. Secretome, surfaceome and exoproteome concepts illustrated with a
Gram-negative bacterium model. The exoproteome (black dotted line) regroups the
so-called ‘secreted exoproteins’ and ‘nonsecreted exoproteins’ (among them, proteins
from cellular lysis or fragments from proteomic surfaceome due to abrasion). The
secretome (red dotted line) regroups the so-called ‘secreted exoproteins’, most
membrane proteins here shown in orange, and the diverse secretion systems here drawn
in purple. The surface proteome (blue dotted line) regroups the accessible fraction of the
proteins inserted in the membrane in contact to the extracellular environment.
Armengaud, Christie-Oleza, Clair, Malard & Duport
563
www.expert-reviews.com
Review
protein degradation [16] . Thus, ‘secretome’, ‘proteomic surfaceome’
and ‘exoproteome’ are three different concepts, which represent
different and partially overlapping protein fractions, as clari ed
in
FIGURE 1 . The term ‘exoproteome’ should be used in preference
to ‘secretome’ when describing the subset of proteins present in
the extracellular milieu, as previously recommended [16] .
Comparative genomics have revealed that the fraction of puta-
tive exported/secreted proteins encoded by microbial genomes is
rather large. A fi rst study indicated that this ratio ranged from 8
to 37% for Methanococcus jannaschii and Mycoplasma pneumonia ,
respectively [29] . Another report estimated that between 25–35%
and 15–25% of proteins could be exported/secreted for Gram-
negative and Gram-positive bacteria, respectively [30] . In this
review, the authors highlight the importance of exoproteome
analysis in different biological systems. This is illustrated with
examples from cultured human cell lines, microbial pathogens
and environmental samples. It should be mentioned that other
studies on unicellular eukaryotes, plants, animals and archaea
would be also worth citing in such a review if more space was
available. The authors also describe the most recent exoproteomic
methodologies, with emphasis on shotgun proteomic approaches
and bioinformatic tools for predicting the presence of peptide
signal and other determinants, as well as the protein localiza-
tion. Finally, the authors draw the perspectives of exoproteomics,
a fi eld with an enormous potential in terms of novel-function
discovery, as highlighted by the large amount of exoproteins pre-
dicted at the genomic level and detected at the proteomic level,
which remain poorly characterized, as well as for biomedical and
biotechnological applications.
Tools & strategies for exoproteome analysis
A new generation of mass spectrometers for
high-throughput proteomics
Over the last decade, substantial improvements in mass spectrom-
etry for analyzing proteins and peptides have led to fundamental
changes in the fi eld of proteomics. Pioneering studies on exopro-
teomes have been carried out by fractionating complex samples
on 2D-PAGE gels, followed by the identi cation of isolated pro-
tein spots using peptide mass fi ngerprinting with MALDI-TOF
mass spectrometry records. Today, complex peptide mixtures are
in most cases resolved by reversed-phase chromatography (nano-
HPLC) prior to being analyzed by high-throughput hybrid mass
spectrometers. Alternatively, they can be resolved fi rst on strong
cation exchange chromatography in combination with reversed-
phase chromatography as in MudPIT experiments, or by OFFGEL
electrophoresis [31] . These novel shotgun approaches facilitate the
identifi cation of several hundreds of proteins from a single sam-
ple. Scan rates ranging from ve to 20 high-quality tandem mass
spectrometry (MS/MS) spectra acquired per second are achieved
by the most recent mass spectrometers, with attomolar sensitiv-
ity [32] . Bottom-up strategies, in which proteins in the sample are
proteolyzed with trypsin and the masses and sequences of the
resulting tryptic peptides are determined by MS/MS, are the most
frequently used approaches. With the advent of shotgun strate-
gies and high-throughput mass spectrometers, such as Q-TOF,
Q-trap and Orbitrap confi gurations, more exhaustive catalogs of
proteins can be achieved without the need for tedious fractionation
[33] . However, exoproteome samples exist in extracellular, aqueous-
phase solutions containing large quantities of other chemical enti-
ties which may sometimes be incompatible with mass spectrometry.
For this reason, protein extraction and purifi cation is a step that
must be addressed cautiously. The most straightforward method to
obtain a clean peptide solution for mass spectrometry is to separate
the exoproteome sample through a denaturing SDS–PAGE gel.
After a short electrophoretic migration, the puri ed exoproteins
within a single, small polyacrylamide band can be proteolyzed with
trypsin. The resulting, clean peptide mix can then be analyzed
by nano-LC-MS/MS. The majority of recent studies on exopro-
teomes have taken advantage of this high-throughput approach, but
2D-PAGE still has signi cant interest in the resolution of isoforms
that may result from posttranslational modi cation of proteins and
their quantitation [34] .
Shotgun proteomics & quantitation methods
In shotgun proteomics, a whole pool of peptides, which may com-
prise thousands of different entities, is being analyzed with the
objective of identifying most of the components. The presence
of proteins in the sample may be ascertained after tally of the
multiple peptide identi cations. As recently reviewed [35] , specifi c
precautions should be taken into consideration for processing the
data. Filtering the MS/MS spectra assignments with a stringent
cutoff and evaluating false-positive identifi cations by searching
the MS/MS spectra against decoy databases should be done to
evaluate the robustness of the approach. Over the last decade,
protein quantitation of a large number of the components became
an important dimension of shotgun proteomics. Work ows
developed over the past decade for protein quantitation by mass
spectrometry fall into two categories: labeling approaches, which
include a step in which different isotopes are introduced into
the samples to be compared, either by metabolic labeling or by
protein modi cation with chemical reagents, and nonlabeling
approaches [36] . Labeling approaches are appealing technically.
For example, multiplex analyses have been made possible with
the use of isobaric reagents, which give different, unique reporter
groups during the MS/MS fragmentation. Stable isotope labeling
by amino acids in cell culture, abbreviated as SILAC, consists in
an in vivo metabolic labeling of proteins aimed at introducing a
mass difference in the same polypeptides between different sam-
ples, enabling their direct comparison when mixed and analyzed
[37] . It has been used to quantify the type III secreted proteins of
enteropathogenic Escherichia coli [38] , the exoproteome of renal
cells [39] and a mouse skeletal muscle cell line [40, 41] . Nonlabeling
approaches are also quite popular, due to their simplicity and
relatively good reproducibility. High-throughput comparison of
samples has become a routine procedure with the use of multiple-
reaction monitoring. In this case, the intensities of a given set of
fragments from the collision-induced fragmentation of parent
peptides are measured precisely by tandem mass spectrometry
[42, 43] . More recently, MS/MS data-independent acquisition has
been shown to perform well, with an increase in the number of
Exoproteomics: exploring the world around biological systems
Expert Rev. Proteomics 9(5), (2012)
564
Review
quanti ed proteins [44] . However, spectral counting is nowadays
widely used and easy to perform in the common data dependent
MS/MS acquisition mode. In this quantitation procedure, the
number of MS/MS spectra of a given protein during different
sample runs, under strictly similar nano-LC-MS/MS conditions,
is counted
[45] . The comparison of protein quantities between
samples is direct, although tending to be restricted to the most
abundant proteins. The normalized spectral abundance factor can
be calculated for each protein in order to compare the abundance
of all the proteins. As the ionization properties of peptides are
sequence dependent, this comparison is only valid to compare
protein groups at a global scale and cannot be precise. Typically,
spectral counts assigned to each polypeptide are normalized by
the number of residues present [46] or the molecular weight [47 , 48] .
Values are then expressed as a ratio, by dividing each normalized
spectral abundance factor value by the total sum corresponding
to all the polypeptides detected with two or more nonredundant
peptides. As an alternative to spectral count quantitation, the
intensity of the corresponding parent ion can be extracted from
the chromatography record. A peptide elutes from the reverse-
phase column generally over several tens of seconds. This inten-
sity is repeatedly measured during this elution and the global
signal extracted. Summing the three most important intensities
recorded among the detected peptides corresponding to a given
protein has been shown to be highly representative. Extracted ion-
chromatogram, commonly abbreviated as XIC, has been reported
to increase the number of quanti able proteins compared with
spectral count, and also to be more rigorous [32] . Moreover, abso-
lute quantitation of proteins may be achieved by spiking known
quantities of isotope-labeled peptides or proteins-of-interest into
the sample and comparing both forms in the same analysis [49] .
Label-free proteomic methods are, arguably, the most signi cant
contribution to proteomic analyses in recent times [35] . They have
been shown to be very robust and comparable with labeling meth-
ods [32] . These approaches will be further developed as can be
foreseen from recent studies [50, 51] .
Bioinformatic tools for predicting protein localization
During cell culture, some cells may die and be lysed resulting
in the release of large amounts of intracellular proteins into the
conditioning medium. The amount of cellular debris is depend-
ent on the physiological state of the cells, the age of the culture,
the amount of biomass produced and the design of the experi-
ment itself. The comparison between the exoproteome and
the whole-cell proteome may be informative in distinguishing
between secreted proteins and spilled cytoplasmic proteins. Other
experimental approaches such as western blotting are needed to
confi rm MS observations. Nevertheless, bioinformatic tools may
also be helpful for such discrimination. Prediction software has-
been conceived to predict the presence of possible peptide signals
at the N-termini of proteins, discriminate transmembrane heli-
ces containing proteins that will be inserted into the membrane
and postulate amino acid composition, secondary structures and
disordered regions necessary for nonclassical secretion methods.
These software programs are also able to predict the cleavage
site of the signal peptides. The most recent software programs
have been further re ned on the basis of experimental datasets
and new knowledge of secretion pathways. Prediction tools have
been reviewed in part
[7, 52] . TABLE 1 lists the most widely used and
recent tools for predicting the presence of peptide signal and by
extension, protein subcellular localization. As noted in TABLE 1 , the
most popular tools are available via web servers. Most software are
based on machine-learning techniques, such as hidden Markov
models, Bayesian networks, arti cial neural networks or support
vector machine classifi ers. Interestingly, meta-analytical methods
which have been recently proposed merge the results of several
predictors and propose a meta scoring for the consensus prediction
[53, 54] . Prediction for the whole theoretical secreted proteins of any
new organism may be achieved relatively quickly using such tools.
Moreover, specialized databases for protein localization are under
development. These databases list predicted secreted proteins for
the whole proteome of several hundreds of organisms, as well as
the localization of the cleavage position of their signal peptide.
Exoproteomes from human cell lines
The concepts and applications of proteomics on human biologi-
cal fl uids have already been presented in a number of reviews
[55 57] , and the authors will focus here on proteomic analysis of
human cell line exoproteomes. Proteins secreted by human cell
lines represent a valuable source of therapeutic targets and candi-
date biomarkers. These polypeptides include numerous hormones,
growth factors, cytokines and enzymes. The authors believe that
there are two major drawbacks encountered during exoproteome
analysis. First, exoproteins are secreted in biological fl uids or cul-
ture media in low amounts, resulting in high dilution. Moreover,
in vitro culture of cell lines implies the presence of nutriments in
the medium, which may sometimes be incompatible with mass
spectrometry. This can make exoproteome purifi cation and analy-
sis quite challenging. The second drawback is the elevated rate
of cell death observed in vitro . Even if cell death is minimized by
maintaining optimal cell culture conditions, this is an issue that
cannot be totally avoided, resulting in the spill of intracellular
proteins into the extracellular milieu. These dif culties contribute
to a heterogeneity of published data, which could probably be
addressed, or at least minimized, by an optimization of sample
preparation. It is worth noting that the rapid growth of cell lines
may affect protein secretion when compared with cells in tissues
where growth is much slower. Shotgun proteomic approaches have
shown an enormous potential for defi ning the exoproteomes from
human cell lines but separate con rmatory testing is required to
validate observations made by mass spectrometry.
Conditioning media
Fetal serum is widely used as a nutrient source for human cell
cultures, usually at 10% in a standard culture medium, result-
ing in a total protein charge of 6 mg/ml. Such a high protein
concentration masks proteins secreted by the cells, which are
usually found in the tens of nanograms per mililitre range. The
use of culture media without serum, that is, basal medium, has
been proposed to overcome this problem [5860] . However, such
Armengaud, Christie-Oleza, Clair, Malard & Duport
565
www.expert-reviews.com
Review
nonoptimal culture conditions may disturb cells, even with short
cell-incubation times. Consequently, secretion mechanisms, and
hence catalogs of secreted proteins, may be modi ed. Moreover,
a change of culture medium may induce a strong metabolic stress
due to the sudden deprivation of serum, which usually translates
to a drastic reduction in cell proliferation rate [61, 62] , activation
of apoptotic pathways [63] and the induction of signal transduc-
tion and expression of genes involved in stress-response mecha-
nisms [64] . Even a previous washing step performed to remove
contaminant proteins from the culture medium may also impact
the cells [65] . Most studies to date have shown a relatively high
number of intracellular proteins (around 50%) identifi ed in exo-
proteome samples [48, 66–68] . A large, comparative study was con-
ducted recently, using 17 different culture media compositions for
the human bronchial epithelial BEAS-2B cell line. Cell viability,
proliferation rate, effect on the cell cycle and initial protein charge
were compared systematically. Of interest, while cell viability
was preserved in most media, proliferation rates were in some
cases drastically reduced. DMEM, a high-glucose-containing,
protein-free medium, was shown to preserve a good viability
and proliferation of cells and, thus, was recommended for such
exoproteomic studies [48] .
Analytical methods for conditioning & analysis of samples
The analysis of secretomes using current proteomic procedures
requires large volumes (from 8 to 100 ml) of conditioned medium
[65, 67] . Extraction of all the proteins from conditioned media is
required, as well as their concentration before mass spectrometry
analysis. Methods include precipitation, ultra ltration and lyo-
philization. Improved techniques based on carrier-assisted TCA
precipitation [59, 69, 70] and fi ltration on low cutoff membranes with
centrifugation devices [48, 65, 71, 72] have been documented for exo-
proteome analysis. Applying separation methods or puri cation
prior to nano-LC-MS/MS has allowed an increase in the number
of identi ed proteins in secretome analyses. Resolution of pro-
tein mixtures into less complex fractions can be performed easily
Table 1. Main predictors and databases for peptide signal and protein subcellular localization.
Predictor name Organism target Short description Ref.
BPBAac Gram- Specifi c type III secretion signals (SVM) [14 3]
CELLO Gram- N-terminal peptide analysis (SVMs)
[14 4]
EffectPred Gram- Meta-analytic approach to predict type III secreted proteins
[54]
ESLpred Eukaryotes Based on dipeptide composition and PSI-BLAST analysis
[145]
ExProt Bacteria Artifi cial neural network
[29]
HMMTOP All Transmembrane topology prediction
[146]
LipoP Bacteria Lipoprotein, other signal peptides, membrane helices (HMM)
[147]
LocateP Gram+ Database (seven protein subcellular location for 427 genomes)
[148]
MetaLocGramN Gram- Metaserver (14 predictors at once), fi ve subcellular localization
[53]
PA-SUB All Five machine-learning classifi ers specialized for specifi c taxons
[149]
Pheobius All Transmembrane protein topology and signal peptide (HMM)
[150]
Pred-lipo Gram+ Lipoprotein signal peptides (HMM)
[151]
PRED-SIGNAL Archaea Based on 69 proteins with signal peptide (HMM)
[152]
PRED-TAT Bacteria Sec and Tat signal peptide predictor (HMM)
[153]
PSLT Human InterPro motifs and speci c membrane domains
[15 4]
PSORTb Prokaryotes Homology, motif identifi cation and machine learning methods
[155, 156]
SecretomeP Bacteria Nonclassically secreted proteins (neural networks)
[10]
SecretP All Combination of predictors
[157]
Sherloc Eukaryotes Sequence and text-based features (SVM)
[158]
SIEVE Gram- Specifi c type III and IV secretion signals detection (SVM)
[159]
Signal-3L All Amino acid composition
[160]
SignalP 4.0 Bacteria Neural network-based method with composite scoring
[161]
SOSUI All Transmembrane helices, secondary and tertiary structure
[162]
SubLoc All Amino acid composition (SVM)
[163]
PrediSi All Position weight matrix
[16 4]
TMHMM All Transmembrane helices (HMM)
[165]
HMM: Hidden Markov model; Gram-: Gram-negative bacteria; Gram+: Gram-positive bacteria; SVM: Support vector machine; TAT: Twin-arginine translocation.
Exoproteomics: exploring the world around biological systems
Expert Rev. Proteomics 9(5), (2012)
566
Review
by SDSPAGE [48, 65, 73 , 74] , although liquid separation methods,
such as strong cation exchange chromatography
[67] or OFFGEL
electro phoresis [75] , have also proven to be successful. Other,
more-speci c methods have also been developed, as exemplifi ed
by af nity-based lectin chromatography for enriching glycopro-
teins from three human breast cell lines [74] . Such an approach
is highly relevant in order to enrich the secreted glycoprotein
fraction of the exoproteome sample.
Applications
The identifi cation of specifi c cancer biomarkers is the objective
of many exoproteome studies, with more than 28 reports on
cancer cell lines available to date [60] . These studies cover a wide
range of cancers, such as prostate, lung, breast, colorectal and
pancreatic [72, 76, 77] . Secreted proteins are indeed excellent sero-
logical tumor marker candidates as they can enter the circulatory
system and, thus, be detected during routine blood analysis.
In some cases, these candidate biomarkers may be measured
directly in patient samples, either from serum or tissues [78] .
Proteins secreted by cancer cells are a major component of the
tumor microenvironment. However, little is known about the
impact of single oncogenic lesions on the regulation of secreted
proteins at the early stages of tumor development. As c-Myc
overproduction is one of the most frequent alterations in cancer,
the effect of sustained c-Myc production in a nontransformed
human epithelial cell line was studied [79] . A substantial down-
regulation of 50 proteins was observed in the exoproteome.
These ndings helped to uncover an autocrine/paracrine com-
ponent in the ability of c-Myc to stimulate proliferation, sustain
tumor maintenance and modulate cell migration. Resistance to
cancer treatment has also been studied by means of exoproteome
analysis. A total of 57 proteins were found to be secreted dif-
ferentially when comparing the exoproteomes of MCF-7 and
doxorubicin (an anticancer drug)-resistant MCF-7 cell lines.
IL-18 was further validated to contribute to doxorubicin resist-
ance and may, thus, represent a useful drug target for breast
cancer therapy [73] .
Other applications of human cell line exoproteomes relate
to mechanistic information, the treatment of other patholo-
gies and to drug development. Neuronal differentiation and
skeletal myogenesis have been addressed through the exopro-
teomic lens. The identifi cation of proteins released from neu-
rons, astrocytes and neural precursor cells, using 2D-PAGE
and nano-LC-MS/MS, revealed that the secreted, extracellu-
lar pool includes proteins involved in cell–cell interactions,
energy metabolism, redox regulations and several molecular
chaperones, each one specifi c to a given cell population [80] . A
panel of proteins secreted during cell differentiation has been
established to improve knowledge of regulatory pathways
[59] . New potential mediators of the stem cell microenviron-
ment, and cardiac and neuronal differentiation processes, have
been identi ed [81] . Among these, some proteins (e.g., protein
14-3-3), which were found to be secreted in atherosclerotic
lesions but not in normal vasculature, represent potential thera-
peutic targets. The exoproteome of adipose tissue may be useful
for the development of treatments for type 2 diabetes, as adi-
pose tissue has proven to be a central driver of the progression
of this condition
[82] .
The eld of toxicology is of particular interest because many
new compounds, chemicals or nanoparticles have to be assessed
in terms of safety. The cellular response to stress when cells are
put in contact with such compounds may be evaluated through
changes in their exoproteome. For example, the effect of cobalt
on human lung cells has recently been described. Upon cobalt
exposure, protein secretion was generally reduced, the ER
amino peptidase 1 being the only signifi cantly induced protein.
This implies a possible link with angiogenesis, the process by
which novel blood vessels are formed [48] . Nephrotoxicity is an
adverse event which strongly limits the use of the immuno-
suppressant, cyclosporine, in solid organ transplantation. An
exoproteome study highlighted the upregulation of the secreted
cyclophilins A and B, a macrophage inhibition factor and the
phosphatidyl-ethanolamine-binding protein 1, following expo-
sure of HEK-293 human cells to cyclosporine [39] . The pro-
teins identi ed in such studies represent potential biomarkers
of toxicity.
Revisiting microbial virulence determinants &
pathogenesis
Microbial pathogens are involved in a wide range of severe, and
sometimes fatal, human diseases, including nosocomial infec-
tions, food-borne infections and toxic shock syndrome. The
pathogenesis of the majority of bacterial diseases is a multifacto-
rial process. Five common, crucial steps may be listed: adhesion
to the host cell, invasion, damage to host tissues, resistance to
environmental stresses during infection and subversion of the host
immune response [83] . Completion of each stage is dependent on
orchestrated activities of specifi c exoproteins. These are, in turn,
tightly controlled by a speci c set of regulators which, for most
pathogens, are still being studied. Exoproteins which play pivotal
role in the adaptability of the pathogen to the specifi cities of the
host’s intracellular environment and promote effi cient infection
are recognized as virulence factors [84] .
Pathogenesis through the lens of exoproteomes
During the last decade, proteomic approaches based on 2D-PAGE
have been widely used to explore the various components of the
pathogenicity of different bacteria [85, 86] and their capacities to
adapt to environmental conditions such as those encountered in
the human host [87] . Among numerous examples, the authors
may cite the 2D-PAGE analysis of the Pseudomonas aeruginosa
exoproteome which led to the identi cation of proteins involved
in adaptation to suboptimal temperature facilitating nosocomial
infection [88] , and the analysis of Corynebacterium diphteriae and
Mycobacterium tuberculosis exoproteomes which allowed the con-
struction of reference maps for these two human pathogens and
the identi cation of novel proteins involved in host cell adhesion
and invasion [89–92] . In order to take into account the biodiver-
sity, and to obtain a more objective overview of the exoproteome,
Dumas et al. analyzed the exoproteome of 12 representative
Armengaud, Christie-Oleza, Clair, Malard & Duport
567
www.expert-reviews.com
Review
Listeria monocytogenes strains of different serotype origins and
virulence levels
[93] . As a result of these studies, the authors pro-
posed the fi rst defi nition of the core and variant exoproteome of
L. monocytogenes species. Similarly, the exoproteomes of 25 clinical
Staphylococcus aureus isolates were analyzed, highlighting the
importance of proteomic studies to complement genomic infor-
mation [94] . Proteomic approaches were more recently focused on
a deeper understanding of the molecular mechanisms responsible
for pathogenicity. Quantifying the rates of protein production and
turnover and evaluating their changes among strains differing in
terms of virulence and/or under different cellular conditions are
important [51, 85] . Regulatory processes occurring after transcrip-
tion may be assessed by integrating proteomic, transcriptomic
and genomic data [95] .
New perspectives in pathogenesis & toxinogenesis
Like other human pathogens, Bacillus cereus , a Gram-positive,
spore-forming bacterium, has been the focus of several prot-
eomic studies. An initial report noted the presence of col-
lagenase, phospholipases, proteases and enterotoxins in the
exo proteome of B. cereus at the onset of the stationary phase,
following a 2D-PAGE separation [96] . A comparative study of
the exoproteomes of three closely related species, namely, B.
cereus , Bacillus thuringiensis and Bacillus anthracis , implied the
presence of the same cytosolic proteins in the exoproteome, but
different secreted proteins [97] . This result highlighted the inter-
est of the exoproteome fraction in the speci city of infection.
A total of 22 proteins were identifi ed in the culture superna-
tant of cells grown in the nutritional rich lysogeny broth (LB)
medium under aerobiosis [98] . The majority of these (14 of 22)
belonged to the well-characterized PlcR virulence regulon, and
the kinetics of their concentrations peaked at the early station-
ary growth phase, concomitantly with the culture supernatant
cytotoxicity. The authors concluded that the pathogenic poten-
tial of B. cereus is maximal during the transition state, that is,
at high cell density. A comprehensive analysis of the B. cereus
exoproteome was performed recently, taking advantage of the
modern shotgun proteomic approaches [99] . B. cereus ATCC
14579 was grown in pH-regulated batch cultures on chemi-
cally de ned medium under oxido-reduction potential (ORP)
conditions considered as mimicking those encountered in the
human intestine, such as low- and high ORP anaerobiosis. Since
B. cereus does not secrete a large amount of protein at early-
exponential growth phase, culture fi ltrates were concentrated
using the deoxy cholate/trichloro acetic acid method to obtain
suf cient amounts of exoproteins. From these samples, 133
different proteins were con dently listed, resulting in a much
higher exoproteome coverage than previous studies. A total of
57 putative virulence factors (14 toxins, eight motility-related
proteins and 35 adhesins and degradative enzymes) were identi-
ed with high confi dence (false-positive evaluation of less than
1%), 31 of them being detected for the fi rst time. Among these
newly detected virulence factors, the putative fourth compo-
nent of hemolysin BL (Hbl), enterotoxin FM, hemolysin II and
three new putative conserved enterotoxins were uncovered. Two
additive factors may explain why these proteins were not previ-
ously detected: the slightly different growth conditions and the
high-resolution mass spectrometer used and the shotgun strategy
that was developed, which most probably both favored the detec-
tion of poorly accumulating proteins. In addition, the exopro-
teomes from the three chemically defi ned conditions (namely,
low- and high-oxidoreduction-potential anoxic conditions, and
fully-oxic conditions) were compared based on their spectral-
count quanti cation
[99] . The contributions of the detected tox-
ins to the cytotoxicity of B. cereus in response to different redox
environments were discussed and derived in further experiments
focusing on the regulatory network of the cellular proteome [100] .
More importantly, shotgun proteomic strategy used in this
study brings evidence that B. cereus , like other pathogens, can
deploy an arsenal of virulence factors at low density popula-
tion to promote infection. Comprehensive analysis based on a
combination of complementary enrichment strategies such as
recently reported for Bacillus subtilis [101] could become stand-
ard in the near future. Like in B. cereus , shotgun proteomics
has enabled the detection of novel secreted virulence factors
from Salmonella enterica enterica , serovar Typhimurium [102] , a
novel secreted antigen of Streptococcus pneumoniae [103] and the
identifi cation of new virulence factors from the catalogue of exo-
proteins generated from Corynebacterium pseudotuberculosis [10 4] .
More recently, exoproteome analysis of Campylobacter ureolyticus
revealed three putative virulence and colonization factors, which
may contribute to its pathogenic potential and its involvement in
the Crohns disease [105] . Finally, all these studies revealed that
the exoproteome of a pathogen is more complex than previously
reported from 2D-PAGE approaches [10 6 , 107] . In this regard, the
in-depth study of the exoproteome of a pathogen using shotgun
experiments represents an open door for further exploration
[108 114] : the identi cation of novel targets for rational drug and/
or a source of antigens for vaccine design (as an example, a recent
comparative exoproteome analysis of M. tuberculosis identifi ed
Lipoprotein Diacylglyceryl Transferase as a valuable target for
generation of antituberculosis drugs) [115] ; the identifi cation of
mechanisms that contribute to the various virulence pheno-
types of pathogens; the comprehensive study of secretion mecha-
nisms used to export proteins (as a pioneer work, the authors
cite the recent study on the enteropathogenic E. coli [38] where
its type III secretome was characterized, the repertoire of type
III secreted effectors for the attaching and effacing pathogens
was expanded, and new insights into the mode of function for
LifA/Efa1/ToxB/Z4332, an important family of virulence fac-
tors, were provided); the existence of a selection procedure in the
excretion of cytoplasmic proteins as suggested [12] ; the dynamic
of exoproteome within the different environments encountered
by the pathogen throughout its life cycle; the function of a
large number of proteins annotated as uncharacterized; and the
unique opportunity to address the biology of genetically intrac-
table pathogens, such as the obligate intracellular bacterium
Chlamydia trachomatis [116] . In conclusion, shotgun proteomics
greatly enhances the toolbox to understand the pathogenicity
of micro-organisms.
Exoproteomics: exploring the world around biological systems
Expert Rev. Proteomics 9(5), (2012)
568
Review
Exploring the exoproteomes of environmental
organisms
Novel functions to be characterized & biotechnological
applications
While proteomics has been restricted for years to just a few
microbial models because of the need for a sequenced genome
to interpret MS/MS spectra, next-generation sequencing tools
have opened up a new era for microbial proteomics [35] . The huge
diversity of environmental organisms can now be investigated
with modern proteomic tools, as a result of the huge increase in
sequenced organisms. A wealth of discoveries is to hand, as exem-
pli ed by the most recent reports on microbial exo proteomes
[117–121] . Exoproteomics has highlighted the importance, at
least in terms of quantitation, of specifi c protein families. The
Ruegeria pomeroyi DSS-3 marine bacterium belonging to the
Roseobacter clade was shown to accumulate extremely high quan-
tities of a protein in its extracellular environment. This protein,
namely, PaxA, accounted for 61% of the total MS/MS spectra
recorded for the sample in a global shotgun analysis [122] , despite
the fact that it would certainly not have been noticed in the list
of predicted secreted proteins if only the genome was inspected.
PaxA showed strong sequence identities with RTX-like toxins,
which are virulence factors found in several human and ani-
mal pathogens [123] . In fact, the perception of an organism can
change when the theoretical or experimental exo proteomes are
taken into consideration, as illustrated in FIGURE 2 . The Roseobacter
clade is known for its high capacity to scavenge nutrients from
its environment, as indicated by its large number of catabolic
pathways and nutrient transporters [124] . As shown in FIGURE 2 ,
76.5% of the proteins encoded onto the genomes of 12 repre-
sentative members of the Roseobacter clade and predicted to be
secreted are involved in transport. In terms of quantity, the mass
spectrometry detected proteins corresponds only to 51.7% of the
exoproteome. Shotgun proteomics revealed that, in terms of the
secreted proteins found in its milieu, other functions such as
toxicity, motility or adhesion are more important than initially
expected by the theoretical analysis of the genomes (FIGURE 2) .
The secretion of RTX-like proteins seems to be a recurrent fac-
tor for Roseobacter members, as shown by the analysis of the
exoproteomes of 12 strains [117] . Toxin-like proteins represented
from 6 to 45% of the total exoproteome detected. While these
proteins are commonly found in the exoproteomes of marine bac-
teria, their functions and targets remain unknown and warrant
further investigation.
The functions of many secreted proteins are unknown.
Estimates based on comparative genomics have indicated one
third of the genes in most genomes to be annotated as either
hypothetical protein’ or ‘conserved protein with unknown
function’. The authors have noted that usually over 40% of pro-
teins fall into this category when considering only the predicted
exoproteome [117, 125 –127] . Indeed, environmental organisms rep-
resent a huge reservoir of enzymatic elements which could be
useful bricks for the fi eld of green chemistry, aiming to produce
novel biotechnological products of interest or to improve chemi-
cal means-of-production. Exoenzymes, targeted bactericides,
antifouling compounds, organism attractants/repellants or
proteins with potential uses in bioremediation, industry or in
the production of alternative energies, are illustrations of the
many possible applications. Many exoenzymes are already being
exploited in industry, for example proteases and lipases used as
additives in detergents or for food processing. In recent years, the
exoproteomes of fungi, the main producers of lignocellulolytic
enzymes, have been documented
[108 , 109, 118, 120] . The hydrolytic
pathway of the complex polymer, lignocellulose, producing more-
easily fermentable sugars, could be useful to obtain alternative
fuels from plant biomasses. The repertoire of useful biotechno-
logical proteins should increase in the near future with a more in-
depth characterization of exoproteomes coupled with systematic
functional screening.
Microbial ecology
Life has defi ned the chemistry of the Earth that we know today.
Because of its origin, life is based, fundamentally, on exchanges
and interactions between the enormous diversity of existing
prokaryotic and eukaryotic organisms and their environments.
This continuous exchange of compounds with the environment
is vital for their persistence and evolution. It is not restricted
to the uptake of nutrients and secretion of cell waste, as many
compounds that cells secrete are directed to modifying or in u-
encing the environment, for example antibiotics, toxins, viru-
lence factors, catabolic enzymes, pheromones, alarmones and
quorum-sensing molecules. Most cellular processes are carried
out by proteins. Thus, clues about the ecological strategies of an
organism and, as a result, its functions within its environment,
may be inferred by analyzing its exoproteome and confi rmed
by further experimentation. As an example, the symbiotic ele-
ments found in the exoproteome of the ecto mycorrhizal fun-
gus, Laccaria bicolor , detected by different shotgun proteomic
approaches, show the ecological association between the fun-
gus and the plant [121] . Four adaptive strategies may be defi ned
within the microbial world: competition for existing resources
by the use of effective and diverse nutrient uptake systems and
exoenzymes; sensing and moving towards or away from stim-
uli; elimination of competitors or avoidance of predation; and
direct interaction with other community members, in a parasitic
or symbiotic manner. These different lifestyles were shown to
exist within different members of the same group of marine
bacteria, the Roseobacter clade, by means of exoproteomic data
[117] . Nevertheless, while some strains showed a clear preference
for one of these four strategies, others presented a combination,
as attested by the abundance of their secreted proteins. A single
organism may adopt different trophic strategies depending on
the context or its physiological state. Bacillus licheniformis , a
bacterium commonly found in the soil, secreted a higher amount
of proteins when grown in rich media, although speci c pro-
teins to counteract starvation were found in limited media, as
shown by a 2D-PAGE approach [128] . Such exoproteomic result
is expected as starved cells induce a stringent global response
characterized by a decrease in protein synthesis upon shifting
from growth and proliferation to survival only. In another study,
Armengaud, Christie-Oleza, Clair, Malard & Duport
569
www.expert-reviews.com
Review
the actinobacteria Frankia spp., an actino-
rhizal plant symbiont, showed completely
different exoproteome profi les when it
was grown in culture or in symbiosis, as
determined by nano-LC-MS/MS, with 38
secreted proteins identifi ed in the former
condition while up to 73 proteins could
be detected in fi eld-collected root nodules
of Alnus incana and Elaeagnus angustifolia
[129] . The exoproteome of another marine
organism, Pseudoalteromonas tunicata , was
also screened [130] . This study revealed
how the secreted profi le of the same organ-
ism can vary due to a mutation in a type-II
secretion pathway, for example, nutrient
transport mechanisms or non-protein pig-
mented toxin compounds. Multicellular
organisms also have an enormous range of
proteins that can, potentially, be secreted
into their environment. In this sense,
the rice plant showed a large variety of
proteins in its rhizosphere exoproteome,
with functions such as defense or protec-
tion against environmental stress [131] . The
mucous subproteome from the freshwa-
ter planarian Schmidtea mediterranea , a
atworm, was recently documented with
119 detected proteins homologous to
proteins found in human secretions [132] .
This demonstrates how exoproteomics on
such organisms could be informative for
understanding parasitic worm infections.
In fact, all living organisms interact with
their biotic and abiotic surroundings by
means of secreted compounds, such as
proteins. Due to the high energy and
nutrient cost, evolutionary pressure will select for only those
secretions which will bring a fi nal bene t to the organism, and
this will slowly develop its ecological strategy. Furthermore,
costly functions can be shared within a community reducing
the genomic content even in free-living organisms [133] .
Expert commentary
Proteomic tools are now able to generate comprehensive list of
spare-parts for numerous organisms’ exoproteomes. Nevertheless,
most studies still rely on the prediction of the subcellular locali-
zation of proteins and a wealth of predictors has been proposed.
N-terminomics is a speci c proteogenomic approach aimed at
confi dently establishing the N-terminus of proteins, among them
the mature secreted proteins and, thus, N-terminomics helps to
characterize their peptide signal and maturation specifi cities [31] .
These experimental data should be helpful for the further devel-
opment of prediction software [31] . In addition to the rather lim-
ited con dence of the current prediction algorithms, annotation
errors in genomes should also be considered. It has been noted
in recently conducted, large N-terminome studies that for some
genomes more than 15% of the genes are wrongly annotated in
terms of translation initiation sites [134 , 135] . Interestingly, large
efforts to curate the annotations of Shewanella strains manually
have shown that incorrect gene prediction is a major issue for
lipoproteins belonging to the Tat export system [52] . Combining
predictive tools and proteomic data has proven to be effective for
the Shewanella genus [52] , E. coli [136] , M. tuberculosis [137] and
natural microbial communities [138] . Further improvement of
the reliability of predictive programs is expected in the coming
years, taking advantage of the high-throughput proteomic data
currently being collated from many different organisms. The
HUPO initiative on the Human Proteome Project is worth citing
because specifi c emphasis on secreted proteins could be fruitful
[139] . Exoproteomics is expected to expand in many disciplines of
biology, with the objective of further characterizing the fraction
of this speci c pool of proteins and the molecular mechanisms
associated. Any organism can now be rapidly sequenced and
investigated with modern proteomic tools. Secreted proteins
Theoretical exoproteome
Transport (76.5%)
Toxicity (9.2%)
Motility (11.6%)
Adhesion (2.7%)
Experimental exoproteome
Transport (51.7%)
Toxicity (19.8%)
Motility (19.9%)
Adhesion (8.6%)
Time
Intensity
m/z
Figure 2. Theoretical versus experimental exoproteomes from 12 marine
Roseobacter strains. The ratio of predicted secreted proteins annotated with functions
related to transport, toxicity, motility or adhesion are indicated in terms of numbers of
occurrence in the 12 genomes (theoretical secreted proteins; 14,845 theoretical proteins
predicted to be secreted among 51,686 proteins encoded by the genomes) and in terms
of cumulated quantities when assessed by mass spectrometry (experimental
exoproteome: 874 polypeptides detected). The experimental abundance of exoproteins
found in the extracellular milieu was obtained by shotgun proteomics of each of the
Roseobacter exoproteomes grown in a rich marine broth and quantifi ed by the
normalized spectral abundance factor (data from
[117] ). The 12 Roseobacter clade strains
are: Ruegeria pomeroyi DSS-3, Oceanicola batsensis HTCC2597 , Pelagibaca bermudensis
HTCC2601 , Roseobacter denitrifi cans OCh114 , Oceanicola granulosus HTCC2516 ,
Oceanibulbus indolifex HEL45 , Ruegeria lacuscaerulensis ITI1157 , Roseobacter litoralis
OCh149 , Roseobacter spp. MED193 , Roseovarius nubinhibens ISM , Dinoroseobacter
shibae DFL12, and Sagittula stellata E-37.
Exoproteomics: exploring the world around biological systems
Expert Rev. Proteomics 9(5), (2012)
570
Review
can be identifi ed in a larger dynamic range and their quanti-
ties compared in different experimental conditions. In the near
future, such strategies will highlight with more refi nement the
processes of life outside the cell and, of particular interest, the
interactions between cells.
Five-year view
The current generation of mass spectrometers for the analysis of
proteins and peptides present considerable potential. Today, thou-
sands of proteins may be identifi ed and quantifi ed from a single
complex sample. This is exempli ed by the rather simple fractiona-
tion process needed for a total mammalian proteome prior to analy-
sis with a high-fi eld Orbitrap mass spectrometer. This has resulted
in the identi cation of 11,731 proteins from a single organism [140] .
The next generation of mass spectrometers, which could be mar-
keted within the next ve years, will probably be complex hybrid
confi gurations of different analyzers, as exemplifi ed by the current
trend of confi guration including Q-TOF, Q-trap and Q-Orbitrap
coupling, as well as new, high- eld Orbitrap and ion-mobility ana-
lyzers. Technical developments will certainly allow improvements
by orders of magnitude in dynamic range, accuracy and speed.
Proteomics will become very comprehensive in the near future. In
the eld of human health, exoproteome studies have demonstrated
their usefulness in defi ning candidate cancer biomarkers. It appears
to be unlikely that a single biomarker could diagnose a multistage
cancer or monitor the ef cacy of a given treatment, but rather a
pool of biomarkers could be decisive. Validating these candidates
for translational medicine using more-accurate quantitative pro-
teomic approaches is another eld of development for the near
future. During the next fi ve years, we may extrapolate new, impor-
tant developments in the systematic analysis of exoproteomes for
a large number of cellular models, as well as for studying complex
exoproteomes found in diverse ecosystems (human microbiome,
environmental samples) that could be developed into the ‘meta-exo-
proteomic concept. In this latter case, the possible universe of exo-
proteins is so huge that novel methodologies should be developed,
as well as advanced bioinformatics and statistics data treatment, in
order to avoid high levels of false-positive identifi cations. The devel-
opment of speci c public data repositories for exoproteomic data,
as exemplifi ed by pioneering databases for fungi
[141, 142] , should
be fostered. Ideally, such information should be incorporated into
proteogenomic programs dedicated to better annotating genomes.
Moreover, integration of multi-omic data is an important issue as
a number of recent studies have shown poor correlation between
transcriptomic and proteomic analyses on one hand, and difference
in terms of complexity of several orders of magnitude between
proteome and genome data due to posttranslational modi cations
on the other [51, 95] . Taking advantage of improved predictive tools,
correlation between theoretical prediction, literature data mining,
experimental data and well de ned experimental conditions should
lead to new insights into secretion mechanisms, and functional
characterization of most of the exoproteome components that have
been, until now, poorly annotated.
Financial & competing interests disclosure
This study was supported by the Commissariat à l’Energie Atomique et
aux Energies Alternatives, the EDF company and the Agence Nationale de
la Recherche. JA Christie-Oleza gratefully acknowledges the Marie Curie
Actions of the EC FP7. The authors have no other relevant af liations or
nancial involvement with any organization or entity with a fi nancial
interest in or fi nancial confl ict with the subject matter or materials discussed
in the manuscript apart from those disclosed.
Kate Vassaux provided writing assistance, and this was funded by the
Commissariat à lEnergie Atomique et aux Energies Alternatives.
Key issues
The fraction of putative secreted proteins encoded by bacterial genomes could be as much as a quarter of the theoretical proteome.
The functions of most secreted proteins are still unknown, with more than 40% of experimentally detected bacterial exoproteins
hitherto annotated as either ‘hypothetical protein’ or ‘conserved protein with unknown function’.
The ‘secretome’ includes the pool of secreted proteins and also the secretion machinery itself. The ‘proteomic surfaceome’ or
‘surface proteome’ comprises the accessible fraction of exported proteins which are inserted into the membrane via the presence of
hydrophobic transmembrane segments or associated with such proteins. The ‘exoproteome’ corresponds to the proteins found in the
extracellular milieu.
Prediction software has been conceived for predicting the subcellular localization of proteins, the presence of a signal peptide at the
protein N-terminus and its corresponding cleavage site. Further re nement of these tools is expected thanks to mass spectrometrically
validated experimental exoprotein datasets.
Proteins secreted by human cell lines represent a valuable source of therapeutic targets and candidate biomarkers. The development of
exoproteome studies should lead to numerous applications ranging from the development of new cancer biomarkers to toxicological
evaluations.
Methods to characterize the modi cations that are frequent in secreted proteins are important, but limited and challenging.
Glycosylation, in particular, is important for the activity and stability of many secreted proteins, but is heterogeneous and can be
diffi cult to characterize by mass spectrometry. Many other post-translational modi cations are also relevant and deserve speci c
methodological developments.
Shotgun proteomic strategies, and the latest generation of hybrid mass spectrometers, are producing a comprehensive view of
the arsenal of virulence factors present in microbial pathogens. Exoproteomics should contribute to the elucidation of the novel
functions of the numerous hypothetical proteins exported out of the cells, as well as to an exploration of the relationships between
environmental organisms and their habitats.
Armengaud, Christie-Oleza, Clair, Malard & Duport
571
www.expert-reviews.com
Review
References
Papers of special note have been highlighted as:
• of interest
•• of considerable interest
1 D r i e s s e n A J , N o u w e n N . P r o t e i n
translocation across the bacterial
cytoplasmic membrane. Annu. Rev.
Biochem. 7 7 , 6 4 3 6 6 7 ( 2 0 0 8 ) .
2 F a c e y S J , K u h n A . B i o g e n e s i s o f b a c t e r i a l
inner-membrane proteins. Cell. Mol. Life
Sci. 6 7 ( 1 4 ) , 2 3 4 3 2 3 6 2 ( 2 0 1 0 ) .
3 P a p a n i k o u E , K a r a m a n o u S , E c o n o m o u A .
Bacterial protein secretion through the
translocase nanomachine. Nat. Rev.
Microbiol. 5 ( 1 1 ) , 8 3 9 8 5 1 ( 2 0 0 7 ) .
4 R o b i n s o n C , M a t o s C F , B e c k D et al .
Transport and proofreading of proteins by
the twin-arginine translocation (Tat)
system in bacteria. Biochim. Biophys. Acta
1 8 0 8 ( 3 ) , 8 7 6 8 8 4 ( 2 0 1 1 ) .
5 d u P l e s s i s D J , N o u w e n N , D r i e s s e n A J . T h e
Sec translocase. Biochim. Biophys. Acta
1 8 0 8 ( 3 ) , 8 5 1 8 6 5 ( 2 0 1 1 ) .
6 H a f t D H , P a y n e S H , S e l e n g u t J D .
Archaeosortases and exosortases are widely
distributed systems linking membrane
transit with posttranslational modi cation.
J. Bacteriol. 1 9 4 ( 1 ) , 3 6 4 8 ( 2 0 1 2 ) .
7 K a r a g i a n n i s G S , P a v l o u M P , D i a m a n d i s
E P . C a n c e r s e c r e t o m i c s r e v e a l
pathophysiological pathways in cancer
molecular oncology. Mol. Oncol. 4 ( 6 ) ,
4 9 6 5 1 0 ( 2 0 1 0 ) .
8 N i c k e l W , S e e d o r f M . U n c o n v e n t i o n a l
mechanisms of protein transport to the cell
surface of eukaryotic cells. Annu. Rev. Cell
Dev. Biol. 2 4 , 2 8 7 3 0 8 ( 2 0 0 8 ) .
9 S i m p s o n R J , L i m J W , M o r i t z R L ,
M a t h i v a n a n S . E x o s o m e s : p r o t e o m i c
insights and diagnostic potential. Expert
Rev. Proteomics 6 ( 3 ) , 2 6 7 2 8 3 ( 2 0 0 9 ) .
1 0 B e n d t s e n J D , K i e m e r L , F a u s b ø l l A ,
B r u n a k S . N o n - c l a s s i c a l p r o t e i n s e c r e t i o n
in bacteria. BMC Microbiol. 5 , 5 8 ( 2 0 0 5 ) .
1 1 K u e h n M J , K e s t y N C . B a c t e r i a l o u t e r
membrane vesicles and the host-pathogen
i n t e r a c t i o n . Genes Dev. 1 9 ( 2 2 ) , 2 6 4 5 2 6 5 5
( 2 0 0 5 ) .
1 2 P a s z t o r L , Z i e b a n d t A K , N e g a M et al .
Staphylococcal major autolysin (Atl) is
involved in excretion of cytoplasmic proteins.
J. Biol. Chem. 2 8 5 ( 4 7 ) , 3 6 7 9 4 3 6 8 0 3
( 2 0 1 0 ) .
1 3 H e n d e r s o n B , M a r t i n A . B a c t e r i a l
virulence in the moonlight: multitasking
bacterial moonlighting proteins are
virulence determinants in infectious
d i s e a s e . Infect. Immun. 7 9 ( 9 ) , 3 4 7 6 3 4 9 1
( 2 0 1 1 ) .
1 4 A n t i k a i n e n J , A n t o n L , S i l l a n p ä ä J ,
K o r h o n e n T K . D o m a i n s i n t h e S - l a y e r
protein CbsA of Lactobacillus crispatus
involved in adherence to collagens, laminin
and lipoteichoic acids and in self-assembly.
Mol. Microbiol. 4 6 ( 2 ) , 3 8 1 3 9 4 ( 2 0 0 2 ) .
1 5 S c h n e e w i n d O , M i s s i a k a s D M . P r o t e i n
secretion and surface display in Gram-
positive bacteria. Philos. Trans. R. Soc.
Lond., B, Biol. Sci. 3 6 7 ( 1 5 9 2 ) , 1 1 2 3 1 1 3 9
( 2 0 1 2 ) .
1 6 D e s v a u x M , H é b r a u d M , T a l o n R ,
H e n d e r s o n I R . S e c r e t i o n a n d s u b c e l l u l a r
localizations of bacterial proteins: a
semantic awareness issue. Trends Microbiol.
1 7 ( 4 ) , 1 3 9 1 4 5 ( 2 0 0 9 ) .
1 7 B ü t t n e r D . P r o t e i n e x p o r t a c c o r d i n g t o
schedule: architecture, assembly, and
regulation of type III secretion systems
from plant- and animal-pathogenic
bacteria. Microbiol. Mol. Biol. Rev. 7 6 ( 2 ) ,
2 6 2 3 1 0 ( 2 0 1 2 ) .
1 8 B a y e r E M , B o t t r i l l A R , W a l s h a w J et al .
Arabidopsis cell wall proteome de ned
using multidimensional protein
identi cation technology. Proteomics 6 ( 1 ) ,
3 0 1 3 1 1 ( 2 0 0 6 ) .
1 9 C u l l e n P A , X u X , M a t s u n a g a J et al .
S u r f a c e o m e o f L e p t o s p i r a s p p . Infect.
Immun. 7 3 ( 8 ) , 4 8 5 3 4 8 6 3 ( 2 0 0 5 ) .
2 0 D e s v a u x M , D u m a s E , C h a f s e y I , H é b r a u d
M . Protein cell surface display in
Gram-positive bacteria: from single protein
to macromolecular protein structure.
FEMS Microbiol. Lett. 2 5 6 ( 1 ) , 1 1 5
( 2 0 0 6 ) .
2 1 K a r l s e n O A , L a r s e n O , J e n s e n H B . T h e
copper responding surfaceome of
Methylococcus capsulatus Bath. FEMS
Microbiol. Lett. 3 2 3 ( 2 ) , 9 7 1 0 4 ( 2 0 1 1 ) .
2 2 V o i g t B , H i e u C X , H e m p e l K et al . C e l l
surface proteome of the marine
planctomycete Rhodopirellula baltica .
Proteomics 1 2 ( 1 1 ) , 1 7 8 1 1 7 9 1 ( 2 0 1 2 ) .
2 3 G u n d r y R L , R i o r d o n D R , T a r a s o v a Y et al .
A cell surfaceome map for
immunophenotyping and sorting
pluripotent stem cells. Mol. Cell Proteomics
1 1 ( 8 ) , 3 0 3 3 1 6 ( 2 0 1 2 ) .
2 4 Z i e g l e r A , C e r c i e l l o F , B i g o s c h C et al .
Proteomic surfaceome analysis of
mesothelioma. Lung Cancer 7 5 ( 2 ) ,
1 8 9 1 9 6 ( 2 0 1 2 ) .
2 5 J a m e t E , A l b e n n e C , B o u d a r t G , I r s h a d M ,
C a n u t H , P o n t - L e z i c a R . R e c e n t a d v a n c e s
in plant cell wall proteomics. Proteomics
8 ( 4 ) , 8 9 3 9 0 8 ( 2 0 0 8 ) .
2 6 C h a m p i o n M M , W i l l i a m s E A , K e n n e d y
G M , C h a m p i o n P A . D i r e c t d e t e c t i o n o f
bacterial protein secretion using whole
colony proteomics. Mol. Cell. Proteomics
1 1 ( 9 ) , 5 9 6 6 0 4 ( 2 0 1 2 ) .
2 7 C h r i s t i e - O l e z a J A , P i ñ a - V i l l a l o n g a J M ,
G u e r i n P et al . S h o t g u n n a n o L C - M S / M S
proteogenomics to document MALDI-
TOF biomarkers for screening new
members of the Ruegeria g e n u s . Environ.
Microbiol. doi:10.1111/j.1462-2920.
2012.02812.x (2012) (Epub ahead of
print).
2 8 W e l k e r M , M o o r e E R . A p p l i c a t i o n s o f
whole-cell matrix-assisted laser-desorption/
ionization time-of-fl ight mass spectrometry
in systematic microbiology. Syst. Appl.
Microbiol. 3 4 ( 1 ) , 2 1 1 ( 2 0 1 1 ) .
2 9 S a l e h M T , F i l l o n M , B r e n n a n P J , B e l i s l e
J T . I d e n t i cation of putative exported/
secreted proteins in prokaryotic proteomes.
Gene 2 6 9 ( 1 2 ) , 1 9 5 2 0 4 ( 2 0 0 1 ) .
3 0 S o n g C , K u m a r A , S a l e h M . B i o i n f o r m a t i c
comparison of bacterial secretomes .
Genomics Proteomics Bioinformatics 7 ( 1 2 ) ,
3 7 4 6 ( 2 0 0 9 ) .
3 1 A r m e n g a u d J . A p e r f e c t g e n o m e a n n o t a t i o n
is within reach with the proteomics and
genomics alliance. Curr. Opin. Microbiol.
1 2 ( 3 ) , 2 9 2 3 0 0 ( 2 0 0 9 ) .
3 2 T h e l e n J J , M i e r n y k J A . T h e p r o t e o m i c
future: where mass spectrometry should be
taking us. Biochem. J. 4 4 4 ( 2 ) , 1 6 9 1 8 1
( 2 0 1 2 ) .
3 3 A r m e n g a u d J . P r o t e o g e n o m i c s a n d s y s t e m s
biology: quest for the ultimate missing
p a r t s . Expert Rev. Proteomics 7 ( 1 ) , 6 5 7 7
( 2 0 1 0 ) .
3 4 R a b i l l o u d T . T h e w h e r e a b o u t s o f 2 D g e l s
in quantitative proteomics. Methods Mol.
Biol. 8 9 3 , 2 5 3 5 ( 2 0 1 2 ) .
3 5 A r m e n g a u d J . M i c r o b i a l p r o t e o m i c s :
getting the best of both worlds! Env.
Microbiol. ( 2 0 1 2 ) ( I n P r e s s ) .
3 6 O t t o A , B e r n h a r d t J , H e c k e r M , B e c h e r D .
Global relative and absolute quantitation in
microbial proteomics. Curr. Opin.
Microbiol. 1 5 ( 3 ) , 3 6 4 3 7 2 ( 2 0 1 2 ) .
3 7 M u e l l e r L N , B r u s n i a k M Y , M a n i D R ,
A e b e r s o l d R . A n a s s e s s m e n t o f s o f t w a r e
solutions for the analysis of mass
spectrometry based quantitative proteomics
data. J. Proteome Res. 7 ( 1 ) , 5 1 6 1 ( 2 0 0 8 ) .
3 8 D e n g W , Y u H B , d e H o o g C L et al .
Quantitative proteomic analysis of type III
secretome of enteropathogenic Escherichia
coli reveals an expanded effector repertoire
for attaching/effacing bacterial pathogens.
Mol. Cell. Proteomics 11 ( 9 ), 692 – 709
( 2 0 1 2 ) .
Exoproteomics: exploring the world around biological systems
Expert Rev. Proteomics 9(5), (2012)
572
Review
3 9 L a m o u r e u x F , G a s t i n e l L N , M e s t r e E ,
M a r q u e t P , E s s i g M . M a p p i n g
cyclosporine-induced changes in protein
secretion by renal cells using stable isotope
labeling with amino acids in cell culture
( S I L A C ) . J. Proteomics 7 5 ( 1 2 ) , 3 6 7 4 3 6 8 7
( 2 0 1 2 ) .
4 0 C h a n C Y , M a s u i O , K r a k o v s k a O et al .
I d e n t i cation of differentially regulated
secretome components during skeletal
myogenesis. Mol. Cell Proteomics 1 0 ( 5 ) ,
M 1 1 0 . 0 0 4 8 0 4 ( 2 0 1 1 ) .
4 1 C h a n C Y , M c D e r m o t t J C , S i u K W .
Secretome analysis of skeletal myogenesis
using SILAC and shotgun proteomics. Int.
J. Proteomics 2 0 1 1 , 3 2 9 4 6 7 ( 2 0 1 1 ) .
4 2 G a l l i e n S , D u r i e z E , D o m o n B . S e l e c t e d
reaction monitoring applied to
proteomics. J. Mass Spectrom. 4 6 ( 3 ) ,
2 9 8 3 1 2 ( 2 0 1 1 ) .
4 3 M a i o l i c a A , J ü n g e r M A , E z k u r d i a I ,
A e b e r s o l d R . T a r g e t e d p r o t e o m e
investigation via selected reaction
monitoring mass spectrometry. J. Proteomics
7 5 ( 1 2 ) , 3 4 9 5 3 5 1 3 ( 2 0 1 2 ) .
4 4 G i l l e t L C , N a v a r r o P , T a t e S et al . T a r g e t e d
data extraction of the MS/MS spectra
generated by data-independent acquisition:
a new concept for consistent and accurate
proteome analysis. Mol. Cell Proteomics
1 1 ( 6 ) , O 1 1 1 . 0 1 6 7 1 7 ( 2 0 1 2 ) .
4 5 L i u H , S a d y g o v R G , Y a t e s J R 3 r d . A m o d e l
for random sampling and estimation of
relative protein abundance in shotgun
proteomics. Anal. Chem. 7 6 ( 1 4 ) , 4 1 9 3 4 2 0 1
( 2 0 0 4 ) .
4 6 P a o l e t t i A C , P a r m e l y T J , T o m o m o r i - S a t o C
et al . Quantitative proteomic analysis of
distinct mammalian Mediator complexes
using normalized spectral abundance
f a c t o r s . Proc. Natl Acad. Sci. USA 1 0 3 ( 5 0 ) ,
1 8 9 2 8 1 8 9 3 3 ( 2 0 0 6 ) .
4 7 C h r i s t i e - O l e z a J A , F e r n a n d e z B , N o g a l e s
B , B o s c h R , A r m e n g a u d J . P r o t e o m i c
insights into the lifestyle of an
environmentally relevant marine
b a c t e r i u m . ISME J. 6 ( 1 ) , 1 2 4 1 3 5 ( 2 0 1 2 ) .
4 8 M a l a r d V , C h a r d a n L , R o u s s i S et al .
Analytical constraints for the analysis of
human cell line secretomes by shotgun
proteomics. J. Proteomics 7 5 ( 3 ) , 1 0 4 3 1 0 5 4
( 2 0 1 2 ) .
A large comparative study of conditioning
media for the analysis of exoproteomes
from cultivated human cell lines.
4 9 B r u n V , M a s s e l o n C , G a r i n J , D u p u i s A .
Isotope dilution strategies for absolute
quantitative proteomics. J. Proteomics
7 2 ( 5 ) , 7 4 0 7 4 9 ( 2 0 0 9 ) .
5 0 G r i f n N M , Y u J , L o n g F et al . L a b e l - f r e e ,
normalized quantifi cation of complex mass
spectrometry data for proteomic analysis.
Nat. Biotechnol. 2 8 ( 1 ) , 8 3 8 9 ( 2 0 1 0 ) .
5 1 K o c h a r u n c h i t t C , K i n g T , G o b i u s K ,
B o w m a n J P , R o s s T . I n t e g r a t e d
transcriptomic and proteomic analysis of
the physiological response of Escherichia
coli O157:H7 Sakai to steady-state
conditions of cold and water activity stress.
Mol. Cell Proteomics 1 1 ( 1 ) , M 1 1 1 . 0 0 9 0 1 9
( 2 0 1 2 ) .
5 2 R o m i n e M F . G e n o m e - w i d e p r o t e i n
localization prediction strategies for gram
negative bacteria. BMC Genomics
1 2 ( S u p p l . 1 ) , S 1 ( 2 0 1 1 ) .
•• An insightful presentation on manual
curation of the genome annotations of
several Shewanella strains. Lipoprotein
substrates of the twin-arginine
translocation system are shown to be
badly predicted for this genus. How
proteomic data could help for improving
predictors is discussed.
5 3 M a g n u s M , P a w l o w s k i M , B u j n i c k i J M .
MetaLocGramN: a meta-predictor of
protein subcellular localization for
Gram-negative bacteria. Biochim. Biophys.
Acta 1 8 2 4 ( 1 2 ) , 1 4 2 5 1 4 3 3 ( 2 0 1 2 ) .
A recent metaserver for predicting the
subcellular localization of proteins. This
tool displays the detailed results from
14 predictors launched automatically.
5 4 S a t o Y , T a k a y a A , Y a m a m o t o T .
Meta-analytic approach to the accurate
prediction of secreted virulence effectors in
Gram-negative bacteria. BMC
Bioinformatics 1 2 , 4 4 2 ( 2 0 1 1 ) .
5 5 A l b a l a t A , M i s c h a k H , M u l l e n W . C l i n i c a l
application of urinary proteomics/
p e p t i d o m i c s . Expert Rev. Proteomics 8 ( 5 ) ,
6 1 5 6 2 9 ( 2 0 1 1 ) .
5 6 P e r n e m a l m M , L e w e n s o h n R , L e h t i ö J .
Affi nity prefractionation for MS-based
plasma proteomics. Proteomics 9 ( 6 ) ,
1 4 2 0 1 4 2 7 ( 2 0 0 9 ) .
5 7 T e n g P N , B a t e m a n N W , H o o d B L ,
C o n r a d s T P . A d v a n c e s i n p r o x i m a l uid
proteomics for disease biomarker
d i s c o v e r y . J. Proteome Res. 9 ( 1 2 ) ,
6 0 9 1 6 1 0 0 ( 2 0 1 0 ) .
5 8 C h e n a u J , M i c h e l l a n d S , S e v e M .
[Secretome: de nitions and biomedical
i n t e r e s t ] . Rev. Med. Interne 2 9 ( 7 ) , 6 0 6 6 0 8
( 2 0 0 8 ) .
5 9 F a r i n a A , D A n i e l l o C , S e v e r i n o V et al .
Temporal proteomic profi ling of embryonic
stem cell secretome during cardiac and
n e u r a l d i f f e r e n t i a t i o n . Proteomics 1 1 ( 2 0 ) ,
3 9 7 2 3 9 8 2 ( 2 0 1 1 ) .
6 0 M a k r i d a k i s M , V l a h o u A . S e c r e t o m e
proteomics for discovery of cancer
biomarkers. J. Proteomics 7 3 ( 1 2 ) , 2 2 9 1 2 3 0 5
( 2 0 1 0 ) .
An overview of the main fi ndings from
the analysis of cancer cell exoproteomes.
6 1 C o o p e r S . R e a p p r a i s a l o f s e r u m s t a r v a t i o n ,
the restriction point, G0, and G1 phase
arrest points. FASEB J. 1 7 ( 3 ) , 3 3 3 3 4 0
( 2 0 0 3 ) .
6 2 S h i n J S , H o n g S W , L e e S L et al . S e r u m
starvation induces G1 arrest through
suppression of Skp2-CDK2 and CDK4 in
SK-OV-3 cells. Int. J. Oncol. 3 2 ( 2 ) ,
4 3 5 4 3 9 ( 2 0 0 8 ) .
6 3 H a s a n N M , A d a m s G E , J o i n e r M C . E f f e c t
of serum starvation on expression and
phosphorylation of PKC-alpha and p53 in
V79 cells: implications for cell death. Int. J.
Cancer 8 0 ( 3 ) , 4 0 0 4 0 5 ( 1 9 9 9 ) .
6 4 L e v i n V A , P a n c h a b h a i S C , S h e n L ,
K o r n b l a u S M , Q i u Y , B a g g e r l y K A .
Different changes in protein and
phosphoprotein levels result from serum
starvation of high-grade glioma and
adenocarcinoma cell lines. J. Proteome Res.
9 ( 1 ) , 1 7 9 1 9 1 ( 2 0 1 0 ) .
6 5 L a w l o r K , N a z a r i a n A , L a c o m i s L , T e m p s t
P , V i l l a n u e v a J . P a t h w a y - b a s e d b i o m a r k e r
search by high-throughput proteomics
profi ling of secretomes. J. Proteome Res.
8 ( 3 ) , 1 4 8 9 1 5 0 3 ( 2 0 0 9 ) .
6 6 B u r g h o f f S , S c h r a d e r J . S e c r e t o m e o f
human endothelial cells under shear stress.
J. Proteome Res. 1 0 ( 3 ) , 1 1 6 0 1 1 6 9 ( 2 0 1 1 ) .
6 7 P l a n q u e C , K u l a s i n g a m V , S m i t h C R ,
R e c k a m p K , G o o d g l i c k L , D i a m a n d i s E P .
I d e n t i cation of ve candidate lung cancer
biomarkers by proteomics analysis of
conditioned media of four lung cancer cell
l i n e s . Mol. Cell Proteomics 8 ( 1 2 ) , 2 7 4 6 2 7 5 8
( 2 0 0 9 ) .
•• An interesting study on exoproteomes
from lung cancer cell lines carried out by
two-dimensional liquid chromatography–
tandem mass spectrometry, in which 1830
different proteins were identi ed.
6 8 X i a o T , Y i n g W , L i L et al . A n a p p r o a c h t o
studying lung cancer-related proteins in
human blood. Mol. Cell Proteomics 4 ( 1 0 ) ,
1 4 8 0 1 4 8 6 ( 2 0 0 5 ) .
6 9 C h e v a l l e t M , D i e m e r H , V a n D o r s s e a l e r A ,
V i l l i e r s C , R a b i l l o u d T . T o w a r d a b e t t e r
analysis of secreted proteins: the example of
the myeloid cells secretome. Proteomics
7 ( 1 1 ) , 1 7 5 7 1 7 7 0 ( 2 0 0 7 ) .
Armengaud, Christie-Oleza, Clair, Malard & Duport
573
www.expert-reviews.com
Review
7 0 V i l l i e r s C , C h e v a l l e t M , D i e m e r H et al .
From secretome analysis to immunology:
chitosan induces major alterations in the
activation of dendritic cells via a TLR4-
d e p e n d e n t m e c h a n i s m . Mol. Cell Proteomics
8 ( 6 ) , 1 2 5 2 1 2 6 4 ( 2 0 0 9 ) .
7 1 S r i r a j a s k a n t h a n R , C a p l i n M E , W a u g h
M G et al . I d e n t i cation of Mac-2-binding
protein as a putative marker of
neuroendocrine tumors from the analysis of
cell line secretomes. Mol. Cell Proteomics
9 ( 4 ) , 6 5 6 6 6 6 ( 2 0 1 0 ) .
7 2 W u C C , H s u C W , C h e n C D et al .
Candidate serological biomarkers for cancer
identi ed from the secretomes of 23 cancer
cell lines and the human protein atlas. Mol.
Cell Proteomics 9 ( 6 ) , 1 1 0 0 1 1 1 7 ( 2 0 1 0 ) .
7 3 Y a o L , L a o W , Z h a n g Y et al . I d e n t i cation
of EFEMP2 as a serum biomarker for the
early detection of colorectal cancer with
lectin af nity capture assisted secretome
analysis of cultured fresh tissues. J. Proteome
Res. doi: 10.1021/pr300020p (2012)
(Epub ahead of print) .
7 4 Z h a n g Y , T a n g X , Y a o L et al . L e c t i n
capture strategy for effective analysis of cell
s e c r e t o m e . Proteomics 1 2 ( 1 ) , 3 2 3 6 ( 2 0 1 2 ) .
7 5 C a c c i a D , Z a n e t t i D o m i n g u e s L , M i c c i c h è
F et al . Secretome compartment is a
valuable source of biomarkers for cancer-
relevant pathways. J. Proteome Res. 1 0 ( 9 ) ,
4 1 9 6 4 2 0 7 ( 2 0 1 1 ) .
7 6 G o o Y A , G o o d l e t t D R . A d v a n c e s i n
proteomic prostate cancer biomarker
d i s c o v e r y . J. Proteomics 7 3 ( 1 0 ) , 1 8 3 9 1 8 5 0
( 2 0 1 0 ) .
7 7 P a v l o u M P , D i a m a n d i s E P . T h e c a n c e r c e l l
secretome: a good source for discovering
biomarkers? J. Proteomics 7 3 ( 1 0 ) , 1 8 9 6 1 9 0 6
( 2 0 1 0 ) .
7 8 D o w l i n g P , C l y n e s M . C o n d i t i o n e d m e d i a
from cell lines: a complementary model to
clinical specimens for the discovery of
disease-specifi c biomarkers. Proteomics
1 1 ( 4 ) , 7 9 4 8 0 4 ( 2 0 1 1 ) .
7 9 P o c s f a l v i G , V o t t a G , D e V i n c e n z o A et al .
Analysis of secretome changes uncovers an
autocrine/paracrine component in the
modulation of cell proliferation and
motility by c-Myc. J. Proteome Res. 1 0 ( 1 2 ) ,
5 3 2 6 5 3 3 7 ( 2 0 1 1 ) .
8 0 C o l u c c i - D A m a t o L , F a r i n a A , V i s s e r s J P ,
C h a m b e r y A . Q u a n t i t a t i v e
neuroproteomics: classical and novel tools
for studying neural differentiation and
function. Stem Cell Rev. 7 ( 1 ) , 7 7 9 3 ( 2 0 1 1 ) .
8 1 S e n z e l L , G n a t e n k o D V , B a h o u W F . T h e
platelet proteome. Curr. Opin. Hematol.
1 6 ( 5 ) , 3 2 9 3 3 3 ( 2 0 0 9 ) .
8 2 B r e i t l i n g R . R o b u s t s i g n a l i n g n e t w o r k s o f
the adipose secretome. Trends Endocrinol.
Metab. 2 0 ( 1 ) , 1 7 ( 2 0 0 9 ) .
8 3 F i n l a y B B , F a l k o w S . C o m m o n t h e m e s i n
microbial pathogenicity revisited.
Microbiol. Mol. Biol. Rev. 6 1 ( 2 ) , 1 3 6 1 6 9
( 1 9 9 7 ) .
8 4 W i l s o n J W , S c h u r r M J , L e B l a n c C L ,
R a m a m u r t h y R , B u c h a n a n K L , N i c k e r s o n
C A . M e c h a n i s m s o f b a c t e r i a l p a t h o g e n i c i t y .
Postgrad. Med. J. 7 8 ( 9 1 8 ) , 2 1 6 2 2 4 ( 2 0 0 2 ) .
8 5 K o n e c n a K , H e r n y c h o v a L , R e i c h e l o v a M
et al . Comparative proteomic pro ling of
culture fi ltrate proteins of less and highly
virulent Francisella tularensis s t r a i n s .
Proteomics 1 0 ( 2 4 ) , 4 5 0 1 4 5 1 1 ( 2 0 1 0 ) .
8 6 T r o s t M , W e h m h ö n e r D , K ä r s t U ,
D i e t e r i c h G , W e h l a n d J , J ä n s c h L .
Comparative proteome analysis of secretory
proteins from pathogenic and
nonpathogenic Listeria s p e c i e s . Proteomics
5 ( 6 ) , 1 5 4 4 1 5 5 7 ( 2 0 0 5 ) .
8 7 W a l z A , M u j e r C V , C o n n o l l y J P et al .
Bacillus anthracis secretome time course
under host-simulated conditions and
identi cation of immunogenic proteins.
Proteome Sci. 5 , 1 1 ( 2 0 0 7 ) .
8 8 T e r m i n e E , M i c h e l G P . T r a n s c r i p t o m e a n d
secretome analyses of the adaptive response
of Pseudomonas aeruginosa to suboptimal
g r o w t h t e m p e r a t u r e . Int. Microbiol. 1 2 ( 1 ) ,
7 1 2 ( 2 0 0 9 ) .
8 9 A r a u j o L S , M a c i e l R M , M o r a e s R M ,
T r a j m a n A , S a a d M H . A s s e s s m e n t o f t h e
IgA immunoassay diagnostic potential of the
Mycobacterium tuberculosis MT10.3-MPT64
fusion protein in tuberculous pleural fl uid.
Clin. Vaccine Immunol. 1 7 ( 1 2 ) , 1 9 6 3 1 9 6 9
( 2 0 1 0 ) .
9 0 H a n s m e i e r N , C h a o T C , K a l i n o w s k i J ,
P ü h l e r A , T a u c h A . M a p p i n g a n d
comprehensive analysis of the extracellular
and cell surface proteome of the human
pathogen Corynebacterium diphtheriae .
Proteomics 6 ( 8 ) , 2 4 6 5 2 4 7 6 ( 2 0 0 6 ) .
9 1 M å l e n H , B e r v e n F S , F l a d m a r k K E , W i k e r
H G . C o m p r e h e n s i v e a n a l y s i s o f e x p o r t e d
proteins from Mycobacterium tuberculosis
H 3 7 R v . Proteomics 7 ( 1 0 ) , 1 7 0 2 1 7 1 8
( 2 0 0 7 ) .
9 2 O t t L , H ö l l e r M , G e r l a c h R G et al .
Corynebacterium diphtheriae invasion-
associated protein (DIP1281) is involved in
cell surface organization, adhesion and
internalization in epithelial cells. BMC
Microbiol. 1 0 , 2 ( 2 0 1 0 ) .
9 3 D u m a s E , D e s v a u x M , C h a m b o n C ,
Hébraud M . Insight into the core and
variant exoproteomes of Listeria
monocytogenes species by comparative
subproteomic analysis. Proteomics 9 ( 1 1 ) ,
3 1 3 6 3 1 5 5 ( 2 0 0 9 ) .
9 4 Z i e b a n d t A K , K u s c h H , D e g n e r M et al .
Proteomics uncovers extreme heterogeneity
in the Staphylococcus aureus exoproteome
due to genomic plasticity and variant gene
regulation. Proteomics 1 0 ( 8 ) , 1 6 3 4 1 6 4 4
( 2 0 1 0 ) .
9 5 V o g e l C , M a r c o t t e E M . I n s i g h t s i n t o t h e
regulation of protein abundance from
proteomic and transcriptomic analyses.
Nat. Rev. Genet. 1 3 ( 4 ) , 2 2 7 2 3 2 ( 2 0 1 2 ) .
9 6 G o h a r M , Ø k s t a d O A , G i l o i s N , S a n c h i s
V , K o l s t ø A B , L e r e c l u s D . T w o -
dimensional electrophoresis analysis of the
extracellular proteome of Bacillus cereus
reveals the importance of the PlcR regulon.
Proteomics 2 ( 6 ) , 7 8 4 7 9 1 ( 2 0 0 2 ) .
9 7 G o h a r M , G i l o i s N , G r a v e l i n e R , G a r r e a u
C , S a n c h i s V , L e r e c l u s D . A c o m p a r a t i v e
study of Bacillus cereus , Bacillus
thuringiensis and Bacillus anthracis
extracellular proteomes. Proteomics 5 ( 1 4 ) ,
3 6 9 6 3 7 1 1 ( 2 0 0 5 ) .
9 8 G i l o i s N , R a m a r a o N , B o u i l l a u t L et al .
Growth-related variations in the Bacillus
cereus s e c r e t o m e . Proteomics 7 ( 1 0 ) ,
1 7 1 9 1 7 2 8 ( 2 0 0 7 ) .
9 9 C l a i r G , R o u s s i S , A r m e n g a u d J , D u p o r t
C . Expanding the known repertoire of
virulence factors produced by Bacillus
cereus through early secretome profi ling in
three redox conditions. Mol. Cell Proteomics
9 ( 7 ) , 1 4 8 6 1 4 9 8 ( 2 0 1 0 ) .
New growth conditions and high-
throughput shotgun proteomics led to
the description of a large number of new
virulence factors.
1 0 0 C l a i r G , A r m e n g a u d J , D u p o r t C .
Restricting fermentative potential by
proteome remodeling: an adaptive strategy
evidenced in Bacillus cereus . Mol. Cell
Proteomics 1 1 ( 6 ) , M 1 1 1 . 0 1 3 1 0 2 ( 2 0 1 2 ) .
1 0 1 O t t o A , B e r n h a r d t J , M e y e r H et al .
Systems-wide temporal proteomic profi ling
in glucose-starved Bacillus subtilis . Nat.
Commun. 1 , 1 3 7 ( 2 0 1 0 ) .
1 0 2 N i e m a n n G S , B r o w n R N , G u s t i n J K et al .
Discovery of novel secreted virulence
factors from Salmonella enterica serovar
Typhimurium by proteomic analysis of
culture supernatants. Infect. Immun. 7 9 ( 1 ) ,
3 3 4 3 ( 2 0 1 1 ) .
1 0 3 C h o i C W , L e e Y G , K w o n S O et al .
Analysis of Streptococcus pneumoniae
secreted antigens by immuno-proteomic
approach. Diagn. Microbiol. Infect. Dis.
7 2 ( 4 ) , 3 1 8 3 2 7 ( 2 0 1 2 ) .
Exoproteomics: exploring the world around biological systems
Expert Rev. Proteomics 9(5), (2012)
574
Review
1 0 4 P a c h e c o L G , S l a d e S E , S e y f f e r t N et al .
A combined approach for comparative
exoproteome analysis of Corynebacterium
pseudotuberculosis . BMC Microbiol. 1 1 ( 1 ) ,
1 2 ( 2 0 1 1 ) .
1 0 5 B u r g o s - P o r t u g a l J A , K a a k o u s h N O ,
R a f t e r y M J , M i t c h e l l H M . P a t h o g e n i c
potential of Campylobacter ureolyticus .
Infect. Immun. 8 0 ( 2 ) , 8 8 3 8 9 0 ( 2 0 1 2 ) .
1 0 6 C a s h P . I n v e s t i g a t i n g p a t h o g e n b i o l o g y a t
the level of the proteome. Proteomics 1 1 ( 1 5 ) ,
3 1 9 0 3 2 0 2 ( 2 0 1 1 ) .
A review of proteomic-based
methodologies to study pathogens.
1 0 7 G u p t a M K , S u b r a m a n i a n V , Y a d a v J S .
Immunoproteomic identi cation of
secretory and subcellular protein antigens
and functional evaluation of the secretome
fraction of Mycobacterium immunogenum , a
newly recognized species of the
Mycobacterium chelonaeMycobacterium
abscessus g r o u p . J. Proteome Res. 8 ( 5 ) ,
2 3 1 9 2 3 3 0 ( 2 0 0 9 ) .
1 0 8 A d a v S S , C h a o L T , S z e S K . Q u a n t i t a t i v e
secretomic analysis of Trichoderma reesei
strains reveals enzymatic composition for
lignocellulosic biomass degradation. Mol.
Cell Proteomics 1 1 ( 7 ) , M 1 1 1 . 0 1 2 4 1 9 ( 2 0 1 2 ) .
1 0 9 A d a v S S , C h e o w E S , R a v i n d r a n A , D u t t a
B , S z e S K . L a b e l f r e e q u a n t i t a t i v e
proteomic analysis of secretome by
Thermobifi da fusca on different
lignocellulosic biomass. J. Proteomics
7 5 ( 1 2 ) , 3 6 9 4 3 7 0 6 ( 2 0 1 2 ) .
1 1 0 B u m a n n D . P a t h o g e n p r o t e o m e s d u r i n g
infection: a basis for infection research and
novel control strategies. J. Proteomics
7 3 ( 1 1 ) , 2 2 6 7 2 2 7 6 ( 2 0 1 0 ) .
111 F r a n ç o i s P , S c h e r l A , H o c h s t r a s s e r D ,
S c h r e n z e l J . P r o t e o m i c a p p r o a c h e s t o s t u d y
Staphylococcus aureus pathogenesis.
J. Proteomics 7 3 ( 4 ) , 7 0 1 7 0 8 ( 2 0 1 0 ) .
1 1 2 K r u h N A , T r o u d t J , I z z o A , P r e n n i J ,
D o b o s K M . P o r t r a i t o f a p a t h o g e n : t h e
Mycobacterium tuberculosis proteome in
vivo . PLoS ONE 5 ( 1 1 ) , e 1 3 9 3 8 ( 2 0 1 0 ) .
1 1 3 L a h n e r E , B e r n a r d i n i G , S a n t u c c i A ,
A n n i b a l e B . Helicobacter pylori
immunoproteomics in gastric cancer and
gastritis of the carcinoma phenotype.
Expert Rev. Proteomics 7 ( 2 ) , 2 3 9 2 4 8
( 2 0 1 0 ) .
1 1 4 S c o t t N E , C o r d w e l l S J . Campylobacter
proteomics: guidelines, challenges and
future perspectives. Expert Rev. Proteomics
6 ( 1 ) , 6 1 7 4 ( 2 0 0 9 ) .
1 1 5 T s c h u m i A , G r a u T , A l b r e c h t D , R e z w a n
M , A n t e l m a n n H , S a n d e r P . F u n c t i o n a l
analyses of mycobacterial lipoprotein
diacylglyceryl transferase and comparative
secretome analysis of a mycobacterial lgt
mutant. J. Bacteriol. 1 9 4 ( 1 5 ) , 3 9 3 8 3 9 4 9
( 2 0 1 2 ) .
1 1 6 S a k a H A , T h o m p s o n J W , C h e n Y S et al .
Quantitative proteomics reveals metabolic
and pathogenic properties of Chlamydia
trachomatis developmental forms. Mol.
Microbiol. 8 2 ( 5 ) , 1 1 8 5 1 2 0 3 ( 2 0 1 1 ) .
1 1 7 C h r i s t i e - O l e z a J A , P i ñ a - V i l l a l o n g a J M ,
B o s c h R , N o g a l e s B , A r m e n g a u d J .
Comparative proteogenomics of twelve
Roseobacter exoproteomes reveals different
adaptive strategies among these marine
bacteria. Mol. Cell Proteomics 1 1 ( 2 ) ,
M 1 1 1 . 0 1 3 1 1 0 ( 2 0 1 2 ) .
One of the largest comparative
studies of exoproteomes ever reported
for environmental bacteria. High-
throughput shotgun proteomics led to
the description of an unusual number of
RTX toxins.
1 1 8 P h a l i p V , D e l a l a n d e F , C a r a p i t o C et al .
Diversity of the exoproteome of Fusarium
graminearum grown on plant cell wall.
Curr. Genet. 4 8 ( 6 ) , 3 6 6 3 7 9 ( 2 0 0 5 ) .
1 1 9 S a n t o s E d e O , A l v e s N J r , D i a s G M et al .
Genomic and proteomic analyses of the
coral pathogen Vibrio coralliilyticus reveal a
diverse virulence repertoire. ISME J. 5 ( 9 ) ,
1 4 7 1 1 4 8 3 ( 2 0 1 1 ) .
1 2 0 T i a n C , B e e s o n W T , I a v a r o n e A T et al .
Systems analysis of plant cell wall
degradation by the model fi lamentous
fungus Neurospora crassa . Proc. Natl Acad.
Sci. USA 1 0 6 ( 5 2 ) , 2 2 1 5 7 2 2 1 6 2 ( 2 0 0 9 ) .
1 2 1 V i n c e n t D , K o h l e r A , C l a v e r o l S et al .
Secretome of the free-living mycelium from
the ectomycorrhizal basidiomycete Laccaria
bicolor . J. Proteome Res. 1 1 ( 1 ) , 1 5 7 1 7 1
( 2 0 1 2 ) .
1 2 2 C h r i s t i e - O l e z a J A , A r m e n g a u d J . I n - d e p t h
analysis of exoproteomes from marine
bacteria by shotgun liquid chromatography-
tandem mass spectrometry: the Ruegeria
pomeroyi DSS-3 case-study. Mar. Drugs
8 ( 8 ) , 2 2 2 3 2 2 3 9 ( 2 0 1 0 ) .
1 2 3 L i n h a r t o v á I , B u m b a L , M a š í n J et al . R T X
proteins: a highly diverse family secreted by
a common mechanism. FEMS Microbiol.
Rev. 3 4 ( 6 ) , 1 0 7 6 1 1 1 2 ( 2 0 1 0 ) .
1 2 4 M o r a n M A , B e l a s R , S c h e l l M A et al .
Ecological genomics of marine Roseobacters .
Appl. Environ. Microbiol. 7 3 ( 1 4 ) , 4 5 5 9 4 5 6 9
( 2 0 0 7 ) .
1 2 5 B o e k h o r s t J , W e l s M , K l e e r e b e z e m M ,
S i e z e n R J . T h e p r e d i c t e d s e c r e t o m e o f
Lactobacillus plantarum WCFS1 sheds light
on interactions with its environment.
Microbiology (Reading, Engl.) 1 5 2 ( P t 1 1 ) ,
3 1 7 5 3 1 8 3 ( 2 0 0 6 ) .
1 2 6 B r o w n N A , A n t o n i w J , H a m m o n d - K o s a c k
KE . The predicted secretome of the plant
pathogenic fungus Fusarium graminearum :
a refi ned comparative analysis. PLoS ONE
7 ( 4 ) , e 3 3 7 3 1 ( 2 0 1 2 ) .
1 2 7 S i v a s h a n k a r i S , S h a n m u g h a v e l P .
Functional annotation of hypothetical
proteins – a review. Bioinformation 1 ( 8 ) ,
3 3 5 3 3 8 ( 2 0 0 6 ) .
1 2 8 V o i g t B , S c h w e d e r T , S i b b a l d M J et al . T h e
extracellular proteome of Bacillus
licheniformis grown in different media and
under different nutrient starvation
conditions. Proteomics 6 ( 1 ) , 2 6 8 2 8 1
( 2 0 0 6 ) .
1 2 9 M a s t r o n u n z i o J E , H u a n g Y , B e n s o n D R .
Diminished exoproteome of Frankia spp. in
culture and symbiosis. Appl. Environ.
Microbiol. 7 5 ( 2 1 ) , 6 7 2 1 6 7 2 8 ( 2 0 0 9 ) .
1 3 0 E v a n s F F , R a f t e r y M J , E g a n S , K j e l l e b e r g
S . Profi ling the secretome of the marine
bacterium Pseudoalteromonas tunicata using
amine-speci c isobaric tagging (iTRAQ).
J. Proteome Res. 6 ( 3 ) , 9 6 7 9 7 5 ( 2 0 0 7 ) .
1 3 1 S h i n a n o T , K o m a t s u S , Y o s h i m u r a T et al .
Proteomic analysis of secreted proteins
from aseptically grown rice. Phytochemistry
7 2 ( 4 - 5 ) , 3 1 2 3 2 0 ( 2 0 1 1 ) .
1 3 2 B o c c h i n f u s o D G , T a y l o r P , R o s s E et al .
Proteomic profi ling of the planarian
Schmidtea mediterranea and its mucous
reveals similarities with human secretions
and those predicted for parasitic fl atworms.
Mol. Cell. Proteomics 1 1 ( 9 ) , 6 8 1 6 9 1
( 2 0 1 2 ) .
1 3 3 M o r r i s J J , L e n s k i R E , Z i n s e r E R . T h e
Black Queen Hypothesis: evolution of
dependencies through adaptive gene loss.
MBio 3 ( 2 ) , e 0 0 0 3 6 - 1 2 ( 2 0 1 2 ) .
1 3 4 A r m e n g a u d J , B l a n d C , C h r i s t i e - O l e z a J A ,
M i o t e l l o G . M i c r o b i a l p r o t e o g e n o m i c s ,
gaining ground with the avalanche of
genomic sequences . J. Bacteriol. Parasitol.
S 3 ( 2 0 1 1 ) .
1 3 5 B a u d e t M , O r t e t P , G a i l l a r d J C et al .
Proteomics-based refi nement of Deinococcus
deserti genome annotation reveals an
unwonted use of non-canonical translation
initiation codons. Mol. Cell Proteomics 9 ( 2 ) ,
4 1 5 4 2 6 ( 2 0 1 0 ) .
1 3 6 H a n M J , Y u n H , L e e J W et al . G e n o m e -
wide identifi cation of the subcellular
localization of the Escherichia coli B
proteome using experimental and
computational methods. Proteomics 1 1 ( 7 ) ,
1 2 1 3 1 2 2 7 ( 2 0 1 1 ) .
Armengaud, Christie-Oleza, Clair, Malard & Duport
575
www.expert-reviews.com
Review
1 3 7 L e v e r s e n N A , d e S o u z a G A , M å l e n H ,
P r a s a d S , J o n a s s e n I , W i k e r H G .
Evaluation of signal peptide prediction
algorithms for identi cation of
mycobacterial signal peptides using
sequence data from proteomic methods.
Microbiology (Reading, Engl.) 1 5 5 ( P t 7 ) ,
2 3 7 5 2 3 8 3 ( 2 0 0 9 ) .
1 3 8 E r i c k s o n B K , M u e l l e r R S , V e r B e r k m o e s
N C et al . Computational prediction and
experimental validation of signal peptide
cleavages in the extracellular proteome of a
natural microbial community. J. Proteome
Res. 9 ( 5 ) , 2 1 4 8 2 1 5 9 ( 2 0 1 0 ) .
A good illustration of the input of
accurate protein data, determined by
mass spectrometry, on the subcellular
localization of proteins and their
maturation.
1 3 9 L e g r a i n P , A e b e r s o l d R , A r c h a k o v A et al .
The human proteome project: current state
and future direction. Mol. Cell Proteomics
10 ( 7 ) , M111.009993 (2011) .
1 4 0 G e i g e r T , W e h n e r A , S c h a a b C , C o x J ,
M a n n M . C o m p a r a t i v e p r o t e o m i c a n a l y s i s
of eleven common cell lines reveals
ubiquitous but varying expression of most
proteins. Mol. Cell Proteomics 1 1 ( 3 ) ,
M 1 1 1 . 0 1 4 0 5 0 ( 2 0 1 2 ) .
1 4 1 C h o i J , P a r k J , K i m D , J u n g K , K a n g S ,
L e e Y H . F u n g a l s e c r e t o m e d a t a b a s e :
integrated platform for annotation of
fungal secretomes . BMC Genomics 1 1 , 1 0 5
( 2 0 1 0 ) .
1 4 2 L u m G , M i n X J . F u n S e c K B : t h e F u n g a l
Secretome KnowledgeBase. Database
(Oxford) 2 0 1 1 , b a r 0 0 1 ( 2 0 1 1 ) .
1 4 3 W a n g Y , Z h a n g Q , S u n M A , G u o D .
High-accuracy prediction of bacterial type
III secreted effectors based on position-
speci c amino acid composition profi les.
Bioinformatics 2 7 ( 6 ) , 7 7 7 7 8 4 ( 2 0 1 1 ) .
1 4 4 Y u C S , C h e n Y C , L u C H , H w a n g J K .
Prediction of protein subcellular
localization. Proteins 6 4 ( 3 ) , 6 4 3 6 5 1
( 2 0 0 6 ) .
1 4 5 B h a s i n M , R a g h a v a G P . E S L p r e d :
SVM-based method for subcellular
localization of eukaryotic proteins using
dipeptide composition and PSI-BLAST.
Nucleic Acids Res. 3 2 , W 4 1 4 W 4 1 9 ( 2 0 0 4 ) .
1 4 6 T u s n á d y G E , S i m o n I . T h e H M M T O P
transmembrane topology prediction server.
Bioinformatics 1 7 ( 9 ) , 8 4 9 8 5 0 ( 2 0 0 1 ) .
1 4 7 J u n c k e r A S , W i l l e n b r o c k H , V o n H e i j n e G ,
B r u n a k S , N i e l s e n H , K r o g h A . P r e d i c t i o n
of lipoprotein signal peptides in Gram-
negative bacteria. Protein Sci. 1 2 ( 8 ) ,
1 6 5 2 1 6 6 2 ( 2 0 0 3 ) .
1 4 8 Z h o u M , B o e k h o r s t J , F r a n c k e C , S i e z e n
R J . L o c a t e P : g e n o m e - s c a l e s u b c e l l u l a r -
location predictor for bacterial proteins.
BMC Bioinformatics 9 , 1 7 3 ( 2 0 0 8 ) .
1 4 9 L u Z , S z a f r o n D , G r e i n e r R et al .
Predicting subcellular localization of
proteins using machine-learned classi ers.
Bioinformatics 2 0 ( 4 ) , 5 4 7 5 5 6 ( 2 0 0 4 ) .
1 5 0 K ä l l L , K r o g h A , S o n n h a m m e r E L . A
combined transmembrane topology and
signal peptide prediction method. J. Mol.
Biol. 3 3 8 ( 5 ) , 1 0 2 7 1 0 3 6 ( 2 0 0 4 ) .
1 5 1 B a g o s P G , T s i r i g o s K D , L i a k o p o u l o s T D ,
H a m o d r a k a s S J . P r e d i c t i o n o f l i p o p r o t e i n
signal peptides in Gram-positive bacteria
with a Hidden Markov model. J. Proteome
Res. 7 ( 1 2 ) , 5 0 8 2 5 0 9 3 ( 2 0 0 8 ) .
1 5 2 B a g o s P G , T s i r i g o s K D , P l e s s a s S K ,
L i a k o p o u l o s T D , H a m o d r a k a s S J .
Prediction of signal peptides in archaea.
Protein Eng. Des. Sel. 2 2 ( 1 ) , 2 7 3 5 ( 2 0 0 9 ) .
1 5 3 B a g o s P G , N i k o l a o u E P , L i a k o p o u l o s T D ,
T s i r i g o s K D . C o m b i n e d p r e d i c t i o n o f T a t
and Sec signal peptides with hidden
Markov models. Bioinformatics 2 6 ( 2 2 ) ,
2 8 1 1 2 8 1 7 ( 2 0 1 0 ) .
1 5 4 S c o t t M S , T h o m a s D Y , H a l l e t t M T .
Predicting subcellular localization via
protein motif co-occurrence. Genome Res.
1 4 ( 1 0 A ) , 1 9 5 7 1 9 6 6 ( 2 0 0 4 ) .
1 5 5 Y u N Y , W a g n e r J R , L a i r d M R et al .
PSORTb 3.0: improved protein subcellular
localization prediction with re ned
localization subcategories and predictive
capabilities for all prokaryotes.
Bioinformatics 2 6 ( 1 3 ) , 1 6 0 8 1 6 1 5 ( 2 0 1 0 ) .
1 5 6 Y u N Y , L a i r d M R , S p e n c e r C , B r i n k m a n
F S . P S O R T d b a n e x p a n d e d , a u t o - u p d a t e d ,
user-friendly protein subcellular localization
database for Bacteria and Archaea. Nucleic
Acids Res. 3 9 , D 2 4 1 D 2 4 4 ( 2 0 1 1 ) .
1 5 7 Y u L , G u o Y , L i Y e t a l . S e c r e t P : i d e n t i f y i n g
bacterial secreted proteins by fusing new
features into Chou’s pseudo-amino acid
composition. J. Theor. Biol. 267 ( 1 ), 1 6
( 2010 ).
1 5 8 S h a t k a y H , H ö g l u n d A , B r a d y S , B l u m T ,
D ö n n e s P , K o h l b a c h e r O . S h e r L o c :
high-accuracy prediction of protein
subcellular localization by integrating text
and protein sequence data. Bioinformatics
2 3 ( 1 1 ) , 1 4 1 0 1 4 1 7 ( 2 0 0 7 ) .
1 5 9 M c D e r m o t t J E , C o r r i g a n A , P e t e r s o n E
et al . Computational prediction of type III
and IV secreted effectors in Gram-negative
bacteria. Infect. Immun. 7 9 ( 1 ) , 2 3 3 2
( 2 0 1 1 ) .
1 6 0 S h e n H B , C h o u K C . S i g n a l - 3 L : A 3 - l a y e r
approach for predicting signal peptides.
Biochem. Biophys. Res. Commun. 3 6 3 ( 2 ) ,
2 9 7 3 0 3 ( 2 0 0 7 ) .
1 6 1 P e t e r s e n T N , B r u n a k S , v o n H e i j n e G ,
N i e l s e n H . S i g n a l P 4 . 0 : d i s c r i m i n a t i n g
signal peptides from transmembrane
r e g i o n s . Nat. Methods 8 ( 1 0 ) , 7 8 5 7 8 6
( 2 0 1 1 ) .
1 6 2 H i r o k a w a T , B o o n - C h i e n g S , M i t a k u S .
S O S U I : c l a s s i cation and secondary
structure prediction system for membrane
proteins. Bioinformatics 1 4 ( 4 ) , 3 7 8 3 7 9
( 1 9 9 8 ) .
1 6 3 H u a S , S u n Z . S u p p o r t v e c t o r m a c h i n e
approach for protein subcellular
localization prediction. Bioinformatics
1 7 ( 8 ) , 7 2 1 7 2 8 ( 2 0 0 1 ) .
1 6 4 H i l l e r K , G r o t e A , S c h e e r M , M ü n c h R ,
J a h n D . P r e d i S i : p r e d i c t i o n o f s i g n a l
peptides and their cleavage positions.
Nucleic Acids Res. 3 2 , W 3 7 5 W 3 7 9
( 2 0 0 4 ) .
1 6 5 S o n n h a m m e r E L , v o n H e i j n e G , K r o g h A .
A hidden Markov model for predicting
transmembrane helices in protein
s e q u e n c e s . Proc. Int. Conf. Intell. Syst. Mol.
Biol. 6 , 1 7 5 1 8 2 ( 1 9 9 8 ) .
Exoproteomics: exploring the world around biological systems
... Due to the need to interact with the environment, these effector molecules will be either membrane-bound or released to the extracellular milieu. The protein content outside a cell in either conditioned medium or the extracellular matrix from environmental samples is generically known as the exoproteome [6]. The exoproteome includes actively secreted proteins and non-secreted proteins that can result from surface shedding or cell lysis. ...
... The term 'exoproteome' is used to describe the extracellular protein content in th cinity of a biological system [6]. The exoproteome includes not only the conventional tome but also all the most stable proteins present in the extracellular space, including teins released from the surface or originating from cell lysis or EVs rupture (Figure 1). ...
... The term 'exoproteome' is used to describe the extracellular protein content in the vicinity of a biological system [6]. The exoproteome includes not only the conventional secretome but also all the most stable proteins present in the extracellular space, including proteins released from the surface or originating from cell lysis or EVs rupture (Figure 1). ...
Article
Full-text available
Leishmaniasis, a vector-borne parasitic protozoan disease, is among the most important neglected tropical diseases. In the absence of vaccines, disease management is challenging. The available chemotherapy is suboptimal, and there are growing concerns about the emergence of drug resistance. Thus, a better understanding of parasite biology is essential to generate new strategies for disease control. In this context, in vitro parasite exoproteome characterization enabled the identification of proteins involved in parasite survival, pathogenesis, and other biologically relevant processes. After 2005, with the availability of genomic information, these studies became increasingly feasible and revealed the true complexity of the parasite exoproteome. After the discovery of Leishmania extracellular vesicles (EVs), most exoproteome studies shifted to the characterization of EVs. The non-EV portion of the exoproteome, named the vesicle-depleted exoproteome (VDE), has been mostly ignored even if it accounts for a significant portion of the total exoproteome proteins. Herein, we summarize the importance of total exoproteome studies followed by a special emphasis on the available information and the biological relevance of the VDE. Finally, we report on how VDE can be studied and disclose how it might contribute to providing biologically relevant targets for diagnosis, drug, and vaccine development.
... The bacterial exoproteome is the entirety of proteins that a bacterial cell releases into the environment through secretion systems, outer membrane vesicles, or lysis [35; 36]. The proteins within the bacterial exoproteome have diverse functions in bacterial physiology [36][37][38][39]. This can include proteins that aid in nutrient acquisition, such as transporters and enzymes that break down complex molecules [40]. ...
Article
Full-text available
Root exudates from host plant species are known to play a critical role in the establishment and maintenance of symbiotic relationships with soil bacteria. In this study, we investigated the impact of root exudates from compatible host plant species; Elaeagnus angustifolia on the exoproteome of Parafrankia soli strain NRRL B-16219. A total of 565 proteins were evidenced as differentially abundant, with 32 upregulated and 533 downregulated in presence of the plant exudates. Analysis of the function of these proteins suggests that the bacterial strain is undergoing a complex metabolic reprogramming towards a new developmental phase elicited in presence of host plant root exudates. The upregulation of Type II/IV secretion system proteins among the differentially expressed proteins indicates their possible role in infecting the host plant, as shown for some rhizobia. Additionally, EF-Tu, proteins upregulated in this study, may function as an effector for the T4SSs and trigger plant defense responses. These findings suggest that Parafrankia soli may use EF-Tu to infect the actinorhizal host plant and pave the way for further investigations of the molecular mechanisms underlying the establishment of symbiotic relationships.
... The microbiota can modulate intestinal immunity by direct contact with the intestinal mucosa, by short-chain fatty acids derived from fibre metabolism or by peptides originating from the exoproteome of microbiota bacteria. The exoproteome is made up of proteins from the cellular secretion, export phenomena and cell lysis whose relative abundance depends on their stability [34]. Two peptides from microbiota bacteria, the 17-mer peptide B12 from Bacteroides fragilis YCH46 and the 19-mer peptide B7, with tolerogenic potential, from Bifidobacterium longum subsp. ...
Article
Full-text available
In addition to traditional use in fermented dairy products, S. thermophilus also exhibits anti-inflammatory properties both in live and heat-inactivated form. Recent studies have highlighted that some hydrolysates from surface proteins of S. thermophilus could be responsible partially for overall anti-inflammatory activity of this bacterium. It was hypothesized that anti-inflammatory activity could also be attributed to peptides resulting from the digestion of intracellular proteins of S. thermophilus. Therefore, total intracellular proteins (TIP) from two phenotypically different strains, LMD-9 and CNRZ-21N, were recovered by sonication followed by ammonium sulphate precipitation. The molecular masses of the TIP of both strains were very close to each other as observed by SDS-PAGE. The TIP were fractionated by size exclusion fast protein liquid chromatography to obtain a 3–10 kDa intracellular protein (IP) fraction, which was then hydrolysed with pancreatic enzyme preparation, Corolase PP. The hydrolysed IP fraction from each strain exhibited anti-inflammatory activity by modulating pro-inflammatory mediators, particularly IL-1β in LPS-stimulated THP-1 macrophages. However, a decrease in IL-8 secretion was only observed with hydrolysed IP fraction from CNRZ-21N, indicating that strain could be an important parameter in obtaining active hydrolysates. Results showed that peptides from the 3–10 kDa IP fraction of S. thermophilus could therefore be considered as postbiotics with potential beneficial effects on human health. Thus, it can be used as a promising bioactive ingredient for the development of functional foods to prevent low-grade inflammation.
... To predict the secreted proteins and their subcellular localization, there are several programs available, some perform simpler methods such as SECRETOOL [25] and others more refined, for example, Loctree [26], BUSCO [24] and Deeploc [27]. The programs that perform prediction refined methods can predict the subcellular location that is important in proteomics because proteins can perform a wide diversity of functions inside different cell compartments, the protein function is related to the compartment or organelle where it is located, providing a physiological context for a certain function [28]. The Loctree is a hierarchical tree system combining Support Vector Machines (SVMs) and other methods such as gene Ortology Consortium (GO) for predict the subcellular localization of proteins. ...
... Taking into account the wide range of enzymatic values in the secretomes, we decided to analyse in more depth the secretomes of 16 isolates that presented at least one exceptionally very high activity, for which we analysed the peptides in these secretomes (see Table 2). For this analysis, we performed nextgeneration shotgun exoproteomics (Armengaud et al., 2012) using a high-resolution tandem mass spectrometer equipped with a high-field Orbitrap analyser able to give a precise partial sequence of the polypeptides. As expected, we found that most of the identified peptides (about 50% of total) in the 16 secretomes matched with sequences of hydrolases deposited in databases (Table 2). ...
Article
Full-text available
Lignocellulosic residues are amongst the most abundant waste products on Earth. Therefore, there is an increasing interest in the utilization of these residues for bioethanol production and for biorefineries to produce compounds of industrial interest. Enzymes that breakdown cellulose and hemicellulose into oligomers and monosaccharides are required in these processes and cellulolytic enzymes with optimum activity at a low pH area are desirable for industrial processes. Here, we explore the fungal biodiversity of Rıo Tinto, the largest acidic ecosystem on Earth, as far as the secretion of cellulolytic enzymes is concerned. Using colorimetric and industrial substrates, we show that a high proportion of the fungi present in this extremophilic environment secrete a wide range of enzymes that are able to hydrolyze cellulose and hemicellulose at acidic pH (4.5–5). Shotgun proteomic analysis of the secretomes of some of these fungi has identified different cellulases and hemicellulolytic enzymes as well as a number of auxiliary enzymes. Supplementation of pre‐industrial cocktails from Myceliophtora with Rio Tinto secretomes increased the amount of monosaccharides released from corn stover or sugar cane straw. We conclude that the Rio Tinto fungi display a good variety of hydrolytic enzymes with high industrial potential.
... Frontiers in Microbiology 31 frontiersin.org proteome or the exoproteome, to determine the responses of surface proteins or exoproteins (i.e., exported proteins and proteins resulting from cell lysis or leakage), which are in direct contact with the environment surrounding the cells, will be important for understanding site-specific responses (Armengaud et al., 2012;Wolden et al., 2020). In any case, our study illustrates the potential of quantitative proteomic approaches to reveal global and specific changes in the proteomes of carbapenem-resistant P. aeruginosa strains with different mechanisms of resistance, in response to antibiotic stress. ...
Article
Full-text available
Pseudomonas aeruginosa is an opportunistic pathogen with increasing incidence of multidrug-resistant strains, including resistance to last-resort antibiotics, such as carbapenems. Resistances are often due to complex interplays of naturally and acquired resistance mechanisms that are enhanced by its large regulatory network. This study describes the proteomic responses of two carbapenem-resistant P. aeruginosa strains of high-risk clones ST235 and ST395 to subminimal inhibitory concentrations (sub-MICs) of meropenem by identifying differentially regulated proteins and pathways. Strain CCUG 51971 carries a VIM-4 metallo-β-lactamase or ‘classical’ carbapenemase; strain CCUG 70744 carries no known acquired carbapenem-resistance genes and exhibits ‘non-classical’ carbapenem-resistance. Strains were cultivated with different sub-MICs of meropenem and analyzed, using quantitative shotgun proteomics based on tandem mass tag (TMT) isobaric labeling, nano-liquid chromatography tandem-mass spectrometry and complete genome sequences. Exposure of strains to sub-MICs meropenem resulted in hundreds of differentially regulated proteins, including β-lactamases, proteins associated with transport, peptidoglycan metabolism, cell wall organization, and regulatory proteins. Strain CCUG 51971 showed upregulation of intrinsic β-lactamases and VIM-4 carbapenemase, while CCUG 70744 exhibited a combination of upregulated intrinsic β-lactamases, efflux pumps, penicillinbinding proteins and downregulation of porins. All components of the H1type-VI secretion system were upregulated in strain CCUG 51971. Multiple metabolic pathways were affected in both strains. Sub-MICs of meropenem cause marked changes in the proteomes of carbapenem-resistant strains of P. aeruginosa exhibiting different resistance mechanisms, involving a wide range of proteins, many uncharacterized, which might play a role in the susceptibility of P. aeruginosa to meropenem.
Preprint
Full-text available
Microbes are the engines driving the elemental cycles. In order to interact with their environment and the community, microbes secrete proteins into the environment (known collectively as the secretome), where they remain active for prolonged periods of time. Despite the environmental relevance of microbes and the vast quantities of marine prokaryotic sequences available, our knowledge of the marine secretome remains limited due to the available in silico methods for the study of secreted proteins, which are not suited to their use in large datasets due to their low throughput. An alternative and potentially better approach to characterise the secretome is to combine modern machine learning tools with the evolutionary adaptation changes of the proteome to the marine environment. In this study, we found and described adaptations of marine extracellular proteins, which vary between phyla, resulting in differences in ATP costs, amino acid composition and nitrogen and sulphur content. With these adaptations in mind, we developed ‘Ayu’, a machine prediction tool that does not employ homology-based predictors (as current methods do) and achieves better performance than current state-of-the-art software at a fraction of the time. When applied to oceanic samples (Tara Oceans dataset), our method was able to recover more than double the proteins compared to the most widely used method to identify secreted proteins, indicating that more than half of secreted proteins are presently unconsidered in current secrtome analysis. The application of this powerful novel to open ocean samples allowed to better characterise the real composition of the marine secretome, allowing for a better recognition and understanding of the secretome in marine ecosystems.
Article
Full-text available
Salt stress can affect survival, multiplication and ability of plant growth promoting microorganisms to enhance plant growth. Changes in a microbe’s proteome profile is one of the mechanisms employed by PGPM to enhance tolerance of salt stress. This study was focused on understanding changes in the exoproteome profile of Bacillus amyloliquefaciens EB2003A and Lactobacillus helveticus EL2006H when exposed to salt stress. The strains were cultured in 100 mL M13 ( B. amyloliquefaciens ) and 100 mL De man, Rogosa and Sharpe (MRS) ( L. helveticus ) media, supplemented with 200 and 0 mM NaCl (control), at pH 7.0. The strains were then incubated for 48 h (late exponential growth phase), at 120 rpm and 30 ( B. amyloliquefaciens ) and 37 ( L. helveticus ) °C. The microbial cultures were then centrifuged and filtered sterilized, to obtain cell free supernatants whose proteome profiles were studied using LC–MS/MS analysis and quantified using scaffold. Results of the study revealed that treatment with 200 mM NaCl negatively affected the quantity of identified proteins in comparison to the control, for both strains. There was upregulation and downregulation of some proteins, even up to 100%, which resulted in identification of proteins significantly unique between the control or 200 mM NaCl ( p ≤ 0.05), for both microbial species. Proteins unique to 200 mM NaCl were mostly those involved in cell wall metabolism, substrate transport, oxidative stress tolerance, gene expression and DNA replication and repair. Some of the identified unique proteins have also been reported to enhance plant growth. In conclusion, based on the results of the work described here, PGPM alter their exoproteome profile when exposed to salt stress, potentially upregulating proteins that enhance their tolerance to this stress.
Article
Full-text available
The unsustainable manufacturing, utilization and inadequate handling of plastics have led to a surge in global plastic pollution. In recent times, there has been increasing concern about the plausible hazards associated with exposure to micro/nanoplastics (M/NPs). As aquatic systems are considered to be the likely sink for M/NPs, it is crucial to comprehend their environmental behavior. The bioavailability, toxicity and fate of M/NPs in the environment are predominantly dictated by their surface characteristics. In the aquatic environment, M/NPs are prone to be internalized by aquatic organisms. This may facilitate their interaction with a diverse array of biomolecules within the organism, resulting in the formation of a biocorona (BC). The development of BC causes modifications in the physicochemical attributes of the M/NPs including changes to their size, stability, surface charge and other properties. This review details the concept of BC formation and its underlying mechanism. It provides insight on the analytical techniques employed for characterizing BC formation and addresses the associated challenges. Further, the eco-toxicological implications of M/NPs and the role of BC in modifying their potential toxicity on aquatic organisms is specified. The impact of BC formation on the fate and transport of M/NPs is discussed. A concise outlook on the future perspectives is also presented.
Article
Full-text available
Many virulence factors are secreted by the gram-positive, spore forming bacterium Bacillus cereus. Most of them are regulated by the transcriptional activator, PlcR, which is maximally expressed at the beginning of the stationary phase. We used a proteomic approach to study the impact of the PlcR regulon on the secreted proteins of B. cereus, by comparing the extracellular proteomes of strains ATCC 14579 and ATCC 14579 Δ plcR, in which plcR has been disrupted. Our study indicated that, quantitatively, most of the proteins secreted at the onset of the stationary phase are putative virulence factors, all of which are regulated, directly or indirectly, by PlcR. The inactivation of plcR abolished the secretion of some of these virulence factors, and strongly decreased that of others. The genes encoding proteins that are not secreted in the ΔplcR mutant possessed a regulatory sequence, the PlcR box, upstream from their coding sequence. These proteins include collagenase, phospholipases, haemolysins, proteases and enterotoxins. Proteins for which the secretion was strongly decreased, but not abolished, in the ΔplcR mutant did not display the PlcR box upstream from their genes. These proteins include flagellins and InhA2. InhA2 is a homologue of InhA, a Bacillus thuringiensis metalloprotease that specifically degrades antibacterial peptides. The mechanism by which PlcR affects the production of flagellins and InhA2 is not known.
Article
Full-text available
Vibrio coralliilyticus has been implicated as an important pathogen of coral species worldwide. In this study, the nearly complete genome of Vibrio coralliilyticus strain P1 (LMG23696) was sequenced and proteases implicated in virulence of the strain were specifically investigated. The genome sequence of P1 (5 513 256 bp in size) consisted of 5222 coding sequences and 58 RNA genes (53 tRNAs and at least 5 rRNAs). Seventeen metalloprotease and effector (vgrG, hlyA and hcp) genes were identified in the genome and expressed proteases were also detected in the secretome of P1. As the VcpA zinc-metalloprotease has been considered an important virulence factor of V. coralliilyticus, a vcpA deletion mutant was constructed to evaluate the effect of this gene in animal pathogenesis. Both wild-type and mutant (ΔvcpA) strains exhibited similar virulence characteristics that resulted in high mortality in Artemia and Drosophila pathogenicity bioassays and strong photosystem II inactivation of the coral dinoflagellate endosymbiont (Symbiodinium). In contrast, the ΔvcpA mutant demonstrated higher hemolytic activity and secreted 18 proteins not secreted by the wild type. These proteins included four types of metalloproteases, a chitinase, a hemolysin-related protein RbmC, the Hcp protein and 12 hypothetical proteins. Overall, the results of this study indicate that V. coralliilyticus strain P1 has a diverse virulence repertoire that possibly enables this bacterium to be an efficient animal pathogen.Keywords: V. coralliilyticus; coral disease; zinc-metalloprotease; genome; secretome
Article
Full-text available
An integrated computational/experimental approach was used to predict and identify signal peptide cleavages among microbial proteins of environmental biofilm communities growing in acid mine drainage (AMD). SignalP-3.0 was employed to computationally query the AMD protein database of >16,000 proteins, which resulted in 1,480 predicted signal peptide cleaved proteins. LC−MS/MS analyses of extracellular (secretome) microbial preparations from different locations and developmental states empirically confirmed 531 of these signal peptide cleaved proteins. The majority of signal-cleavage proteins (58.4%) are annotated to have unknown functions; however, Pfam domain analysis revealed that many may be involved in extracellular functions expected within the AMD system. Examination of the abundances of signal-cleaved proteins across 28 proteomes from biofilms collected over a 4-year period demonstrated a strong correlation with the developmental state of the biofilm. For example, class I cytochromes are abundant in early growth states, whereas cytochrome oxidases from the same organism increase in abundance later in development. These results likely reflect shifts in metabolism that occur as biofilms thicken and communities diversify. In total, these results provide experimental confirmation of proteins that are designed to function in the extreme acidic extracellular environment and will serve as targets for future biochemical analysis.Keywords: Mass spectrometry; multidimensional liquid chromatography; shotgun proteomics; signal peptides; microbial communities
Article
Full-text available
Preprolipopoprotein diacylglyceryl transferase (Lgt) is the gating enzyme of lipoprotein biosynthesis, and it attaches a lipid structure to the N-terminal part of preprolipoproteins. Using Lgt from Escherichia coli in a BLASTp search, we identified the corresponding Lgt homologue in Mycobacterium tuberculosis and two homologous (MSMEG_3222 and MSMEG_5408) Lgt in Mycobacterium smegmatis. M. tuberculosis lgt was shown to be essential, but an M. smegmatis ΔMSMEG_3222 mutant could be generated. Using Triton X-114 phase separation and [14C]palmitic acid incorporation, we demonstrate that MSMEG_3222 is the major Lgt in M. smegmatis. Recombinant M. tuberculosis lipoproteins Mpt83 and LppX are shown to be localized in the cell envelope of parental M. smegmatis but were absent from the cell membrane and cell wall in the M. smegmatis ΔMSMEG_3222 strain. In a proteomic study, 106 proteins were identified and quantified in the secretome of wild-type M. smegmatis, including 20 lipoproteins. All lipoproteins were secreted at higher levels in the ΔMSMEG_3222 mutant. We identify the major Lgt in M. smegmatis, show that lipoproteins lacking the lipid anchor are secreted into the culture filtrate, and demonstrate that M. tuberculosis lgt is essential and thus a validated drug target.
Article
Bacteria export proteins to the periplasmic space and outer membrane via a mechanism that involves a multisubunit integral membrane protein complex termed translocase. Proteins to be secreted are synthesized in the cytosol as cleavable signal sequence bearing precursor proteins. The signal sequence acts as a targeting and recognition signal for the translocase. The translocase mediates the energy-dependent translocation of the precursor protein across the cytoplasmic membrane. The mechanism of protein translocation has rapidly advanced due to the ability to use powerful genetic and biochemical techniques in Escherichia coli. Extensive genetic screening methods have identified the genes, termed “sec(retion)-genes,” which code for the components involved in this process. Many of the genes are essential for viability, and conditional lethal mutants are selected that cause pleiotropic protein export defects. The products of these genes are overproduced and purified to homogeneity. Authentic energy-dependent protein translocation as it occurs in vivo is reconstituted into proteoliposomes with purified components. This chapter discusses the biochemical role of the individual components that catalyze the actual translocation reaction in bacteria. The enzymology of signal peptidases, and alternative and dedicated protein translocation pathways are also discussed.
Article
Proteins secreted by Mycobacterium tuberculosis play an essential role in the pathogenesis of tuberculosis. The culture filtrates of M. tuberculosis H37Rv made by Sadamu Nagai (Japan), are considerably enriched for secreted proteins compared to other culture filtrates. Complementary approaches were used to identify the secreted proteins in these culture filtrates: (i) 2-DE combined with MALDI-TOF MS and (ii) LC coupled MS/MS. Peptides derived from a total of 257 proteins were identified of which 144 were identified by more than one peptide. Several members of the immunologically important early secretory antigenic target-6 (ESAT-6) family of proteins were found to be major components. The majority of the identified proteins, 159 (62%), were predicted to be exported through the general secretory pathway. We experimentally verified that the signal peptides, which mediate translocation through the cell membrane, had been removed in 41 of the identified proteins, and in 35 of those, there was an AXA motif N-terminally to the cleavage site, showing that this motif is important for the recognition and cleavage of signal peptides in mycobacteria. A large fraction of the secreted proteins were unknown, suggesting that we have mapped an unexplored part of the exported proteome of M. tuberculosis.∁
Article
The identification of bacteria by means of matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry directly using whole cells has become a standard method in clinical diagnosis due to its rapidity and simplicity. Nevertheless, the analysis of environmental samples with this approach still represents a challenge due to the enormous microbial diversity existing on earth and the lack of a comprehensive database. Most of the environmentally relevant species comprise only one unique strain, while pathogens such as Escherichia coli, with 667 described strains, are well documented. In such case, identification of the proteins responsible for the peak signals within MALDI-TOF spectra can give crucial information for species discrimination. To give higher confidence in MALDI-TOF biomarker description we exploited information from proteins identified by shotgun nanoLC-MS/MS, consisting of the identification and quantification of low-molecular-weight proteins after SDS-PAGE, in-gel trypsin proteolysis and analysis of tryptic peptides. We also proposed the standardization of the inclusion of internal calibrants in the bacterial sample to improve the accuracy of the MALDI-TOF measurements. In this way, nine candidate biomarkers were tentatively proposed for Ruegeria lacuscaerulensis ITI-1157. The conserved biomarkers were theoretically deduced for all other Ruegeria strains whose genomes have been sequenced and their corresponding m/z MALDI-TOF signals were estimated. Among these, DNA-binding protein, HU, and ribosomal proteins, L29, L30, L32 and S17, were shown experimentally to be also the most prominent and conserved signals in the other strain tested, Ruegeria pomeroyi DSS-3. Thus, we suggested that these five biomarkers, which give rise to 10 m/z peak signals derived from the mono- and doubly protonated proteins, are the best candidates for identifying bacteria belonging to the Ruegeria genus, and quickly assessed their phylogenetic proximity to described species. As an application of these biomarkers, we quickly screened 30 seawater bacterial isolates by MALDI-TOF and found one belonging to the Ruegeria genus, as further confirmed by 16S RNA sequencing. Due to its simplicity and effectiveness, this technique could be of immense value in monitoring bacteria in the environment in the near future.
Article
High-throughput identification of proteins with the latest generation of hybrid high-resolution mass spectrometers is opening new perspectives in microbiology. I present, here, an overview of tandem mass spectrometry technology and bioinformatics for shotgun proteomics that make 2D-PAGE approaches obsolete. Non-labelling quantitative approaches have become more popular than labelling techniques on most proteomic platforms because they are easier to carry out while their quantitative outcome is rather robust. Parameters for recording mass spectrometry data, however, need to be chosen carefully and statistics to assess the confidence of the results should not be neglected. Interestingly, next-generation sequencing methodologies make any microbial model quickly amenable to proteomics, leading to the documentation of a wide range of organisms from diverse environments. Some recent discoveries made using microbial proteomics have challenged some biological dogma, such as: (i) initiation of the translation does not occur predominantly from ATG codons in some microorganisms, (ii) non-canonical initiation codons are used to regulate the production of specific but important proteins and (iii) a gene may code for multiple polypeptide species, heterogeneous in terms of sequences. Microbial diversity and microbial physiology can now be revisited by means of exhaustive comparative proteomic surveys where thousands of proteins are detected and quantified. Proteogenomics, consisting of better annotating of genomes with the help of proteomic evidence, is paving the way for integrated multi-omic approaches in microbiology. Finally, meta-proteomic tools and approaches are emerging for tackling the high complexity of the microbial world as a whole, opening new perspectives for assessing how microbial communities function.
Article
Subcellular localization is a key functional characteristic of proteins. It is determined by signals encoded in the protein sequence. The experimental determination of subcellular localization is laborious. Thus, a number of computational methods have been developed to predict the protein location from sequence. However predictions made by different methods often disagree with each other and it is not always clear which algorithm performs best for the given cellular compartment. We benchmarked primary subcellular localization predictors for proteins from Gram-negative bacteria, PSORTb3, PSLpred, CELLO, and SOSUI-GramN, on a common dataset that included 1056 proteins. We found that PSORTb3 performs best on the average, but is outperformed by other methods in predictions of extracellular proteins. This motivated us to develop a meta-predictor, which combines the primary methods by using the logistic regression models, to take advantage of their combined strengths, and to eliminate their individual weaknesses. MetaLocGramN runs the primary methods, and based on their output classifies protein sequences into one of five major localizations of the Gram-negative bacterial cell: cytoplasm, plasma membrane, periplasm, outer membrane, and extracellular space. MetaLocGramN achieves the average Matthews correlation coefficient of 0.806, i.e. 12% better than the best individual primary method. MetaLocGramN is a meta-predictor specialized in predicting subcellular localization for proteins from Gram-negative bacteria. According to our benchmark, it performs better than all other tools run independently. MetaLocGramN is a web and SOAP server available for free use by all academic users at the URL http://iimcb.genesilico.pl/MetaLocGramN. This article is part of a Special Issue entitled: Computational Methods for Protein Interaction and Structural Prediction.