ArticlePDF Available

Concanavalin A-captured Glycoproteins in Healthy Human Urine

Authors:

Abstract

Both the urinary proteome and its glycoproteome can reflect human health status, and more directly, functions of kidney and urinary tracts. Because the high abundance protein albumin is not N-glycosylated, the urine N-glycoprotein enrichment procedure could deplete it, and urine proteome could thus provide a more detailed protein profile in addition to glycosylation information especially when albuminuria occurs in some kidney diseases. In terms of describing the details of urinary proteins, the urine glycoproteome is even a better choice than the proteome itself. Pooled urine samples from healthy volunteers were collected and acetone-precipitated for proteins. N-Linked glycoproteins enriched with concanavalin A affinity purification were separated and analyzed by SDS-PAGE-reverse phase LC/MS/MS or two-dimensional LC/MS/MS. A total of 225 urinary proteins were identified based on two-hit criteria with reliability over 97% for each peptide. Among these proteins, 94 were identified in previous urine proteome works, 150 were annotated as glycoproteins in Swiss-Prot, and 43 were predicted as glycoproteins by NetNGlyc 1.0. A number of known biomarkers and disease-related glycoproteins were identified. Because changes in protein quantity or the glycosylation status can lead to changes in the concanavalin A-captured glycoprotein profile, specific urine glycoproteome patterns might be observed for specific pathological conditions as multiplex urinary biomarkers. Knowledge of the urine glycoproteome is important in understanding kidney and body function.
Concanavalin A-captured Glycoproteins in
Healthy Human Urine*
S
Linjie Wang‡, Fuxin Li§, Wei Sun‡, Shuzhen Wu‡, Xiaorong Wang‡, Ling Zhang‡,
Dexian Zheng‡, Jue Wang§, and Youhe Gao‡¶
Both the urinary proteome and its glycoproteome can
reflect human health status, and more directly, functions
of kidney and urinary tracts. Because the high abundance
protein albumin is not N-glycosylated, the urine N-glyco-
protein enrichment procedure could deplete it, and urine
proteome could thus provide a more detailed protein pro-
file in addition to glycosylation information especially
when albuminuria occurs in some kidney diseases. In
terms of describing the details of urinary proteins, the
urine glycoproteome is even a better choice than the
proteome itself. Pooled urine samples from healthy vol-
unteers were collected and acetone-precipitated for pro-
teins. N-Linked glycoproteins enriched with concanavalin
A affinity purification were separated and analyzed by
SDS-PAGE-reverse phase LC/MS/MS or two-dimensional
LC/MS/MS. A total of 225 urinary proteins were identified
based on two-hit criteria with reliability over 97% for each
peptide. Among these proteins, 94 were identified in previ-
ous urine proteome works, 150 were annotated as glyco-
proteins in Swiss-Prot, and 43 were predicted as glycopro-
teins by NetNGlyc 1.0. A number of known biomarkers and
disease-related glycoproteins were identified. Because
changes in protein quantity or the glycosylation status can
lead to changes in the concanavalin A-captured glycopro-
tein profile, specific urine glycoproteome patterns might
be observed for specific pathological conditions as mul-
tiplex urinary biomarkers. Knowledge of the urine glyco-
proteome is important in understanding kidney and body
function. Molecular & Cellular Proteomics 5:560 –562,
2006.
The urinary proteome has received more and more atten-
tion in the proteomic field for its simplicity compared with
serum as well as its potential in biomarker discovery. Several
research teams have worked on profiling the healthy human
urine proteome using electrophoresis and/or liquid chroma-
tography followed by mass spectrometry identification (1–5).
The fact that some proteins were observed at higher molec-
ular weight than their theoretical ones on SDS-PAGE (5) con-
firmed that the post-translational modifications including gly-
cosylation exist extensively in urinary proteins. Glycosylation,
a common and important protein post-translational modifica-
tion, is involved in many biological processes such as cell
adhesion, signal transduction, immune response, and inflam-
matory reaction (6). More than half of all proteins are thought
to be glycoproteins, and they will undergo changes in quality
and quantity along with the changes in different physiological
and pathological states. Both the urine proteome and its
glycoproteome can reflect human health status, especially
functions of kidney and urinary tracts. In some kidney dis-
eases, albuminuria usually occurs. Because high abundance
albumin is not N-glycosylated, urine N-glycoprotein enrich-
ment procedure could deplete it, and the urine glycoproteome
could provide a more detailed protein profile in addition to the
glycosylation information. In terms of describing details of
urinary proteins, the urine glycoproteome is even a better
choice than the proteome itself. In this study our goal was to
profile the N-linked glycoproteome in normal human urine.
In the present study, pooled urine samples from healthy
males and females were collected, and urine proteins were
acetone-precipitated as described previously (5). Lectin con-
canavalin A (Con A)
1
was chosen to enrich N-linked glycopro-
teins for its broader specificity and higher affinity. Con A
affinity chromatography was performed according to previous
protocols (7). The eluted proteins (100
g loaded each time)
from Con A-agarose were separated and analyzed by two
approaches: 1) SDS-PAGE-RPLC/MS/MS (SDS-PAGE, in-gel
digestion, and peptide extraction followed by RPLC/MS/MS)
and 2) two-dimensional LC/MS/MS (protein mixture digestion
followed by strong cation exchange-RPLC/MS/MS). Proteins
were reduced with dithiothreitol and alkylated with iodoacet-
amide before tryptic digestion. Glycosidase PNGase F was
also added to remove N-glycans from glycoproteins during
the digestion. All peptides were analyzed by an LCQ-DECA
XP
plus
electrospray ion trap mass spectrometer (ThermoFinni-
gan, San Jose, CA). Ions were detected in a survey scan from
400 to 1500 amu (three microscans) followed by five data-de-
pendent MS/MS scans (five microscans each; isolation width,
3 amu; 35% normalized collision energy; dynamic exclusion
From the ‡Proteomics Research Center, National Key Laboratory of
Medical Molecular Biology, Institute of Basic Medical Sciences, Chi-
nese Academy of Medical Sciences/Peking Union Medical College,
Beijing 100005 and §Laboratory of Complex Systems and Intelligence
Science, Institute of Automation, Chinese Academy of Sciences,
Beijing 100080, China
Received, November 16, 2005
Published, MCP Papers in Press, November 29, 2005, DOI
10.1074/mcp.D500013-MCP200
1
The abbreviations used are: Con A, concanavalin A; AMASS,
advanced mass spectrum scanner; RP, reverse phase.
Dataset
© 2006 by The American Society for Biochemistry and Molecular Biology, Inc.560 Molecular & Cellular Proteomics 5.3
This paper is available on line at http://www.mcponline.org
for 3 min) in a completely automated fashion. Both ap-
proaches were run twice in parallel. All MS/MS spectra were
searched using Bioworks 3.1 against the database ipi.hu-
man.v3.05 (8) with enzyme constraints and with a static mod-
ification of 57 Da on cysteine residue and a differential
modification of 1 Da on asparagine residue. The precursor
ion mass tolerance was 1.40 Da, and the fragment ion mass
tolerance was 1.50 Da. We used SEQUEST criteria as follows:
Cn 0.1; Rsp 1; Xcorr 1.9 for 1 charged peptides;
Xcorr 2.2 for 2 with fully or partially tryptic end; Xcorr
3.0 for 2 without regard to the end residues; Xcorr 3.75 for
3. Then AMASS version 1.13 (available at www.proteomics-
cams.com) was used to filter the SEQUEST results with three
parameters: MatchPct 60, Cont 40, and Rscore 2.6 (9,
10). Proteins with two or more spectra approved by AMASS
were accepted as positive identifications. Reverse database
searching was used to estimate the false positive rate. The
false positive rate peptide number in reverse database/
peptide number in forward database 100%, and the final
average false positive rate was 2.76% for SEQUEST/AMASS-
filtered positive peptides.
In total, 225 proteins were identified (excluding keratins)
based on two or more positive peptides with a reliability of
more than 97% for each (Supplemental Table 1). For 142
proteins recognized with at least two independent peptides,
the reliability of protein identification can reach more than
99.9%. Even for the other 83 proteins identified by single
peptide with multiple hits, the reliability was still more than
97%. 94 proteins were identified in previous urine proteome
studies (1–5). 43 proteins were also identified in serum N-
glycoproteome (11–13). 150 were annotated as glycoproteins
or subunits of glycoproteins in Swiss-Prot, and 43 were an-
notated as potential glycoproteins predicted by NetNGlyc
1.0.
2
22 proteins had potential N-linked glycan binding sites
but no signal peptides, which means they are unlikely to be
N-glycosylated. 10 proteins had no N-glycosylation sites.
Those 32 identified non-N-glycosylated proteins might either
be associated with the captured N-glycoproteins or nonspe-
cifically bind to Con A. For example, albumin can be associ-
ated with many proteins in the list as one of its significant
functions is protein transportation. The proteins were catego-
rized based on their subcellular localizations. For the known
proteins, their subcellular localizations were determined by
the Swiss-Prot or Gene Ontology annotations. For all the
others, localizations were predicted based on those of similar
known proteins by BLAST or PSORT II for proteins that had no
similar known proteins in BLAST search. There were 101
extracellular, 67 membrane, 32 lysosomal, 22 cytoplasmic,
one cytoskeletal, and two nuclear proteins. They are mainly
composed of enzymes, enzyme inhibitors, receptors, immu-
noglobulin/complements, and apolipoproteins that participate
in many biological processes such as immune response and
inflammation, blood coagulation, cell adhesion, signal trans-
duction, and cleansing of the aged or abnormal proteins in
lysosome. A number of known biomarkers and disease-re-
lated glycoproteins were identified, such as prostate-specific
antigen, cadherins, and cathepsins. To enrich the details of
the urinary Con A-captured glycoprotein profile, 119 proteins
identified by one hit were listed in Supplemental Table 2. In
total there were 334 proteins identified with more than 97%
reliability. The peptide hit numbers of all the proteins were
also included in Supplemental Tables 1 and 2 to serve as a
rough estimate of the protein quantity in the sample.
Both the changes in protein quantity and the glycosylation
status of glycoproteins can be reflected in the profiled
changes of Con A-captured glycoproteins. Specific patterns
might be observed in the Con A-captured glycoproteins for
specific pathological conditions as “multiplex urinary biomar-
kers.” Hence we believe profiling Con A-captured glycopro-
teins in healthy human urine will be a very useful reference for
future applications of the urine glycoproteome. We expect
that with the development of more efficient enrichment tech-
niques and sophisticated mass spectrometers more glyco-
proteins will be in the detectable range.
Acknowledgments—We thank Dr. Mark A. Knepper (Laboratory of
Kidney and Electrolyte Metabolism, NHLBI, National Institutes of
Health, Bethesda, MD) for helpful discussion. We also thank Dr.
Weizhi Chen and Peter Marshall for critical reading of the manuscript.
* This work was supported in part by National Basic Research
Program Grant 2004CB520804 and by National Natural Science
Foundation Grants 30270657, 30230150, and 3037030. The costs of
publication of this article were defrayed in part by the payment of
page charges. This article must therefore be hereby marked adver-
tisement in accordance with 18 U.S.C. Section 1734 solely to indi-
cate this fact.
SThe on-line version of this article (available at http://www.
mcponline.org) contains supplemental material.
To whom correspondence should be addressed: Inst. of Basic
Medical Sciences, Chinese Academy of Medical Sciences/Peking
Union Medical College, 5 Dong Dan San Tiao, Beijing 100005, China.
Tel.: 86-010-6521-2284; Fax: 86-010-6521-2284; E-mail: gaoyouhe@
pumc.edu.cn.
REFERENCES
1. Spahr, C. S., Davis, M. T., McGinley, M. D., Robinson, J. H., Bures, E. J.,
Beierle, J., Mort, J., Courchesne, P. L., Chen. K., Wahl, R. C., Yu, W.,
Luethy, R., and Patterson, S. D. (2001) Towards defining the urinary
proteome using liquid chromatography-tandem mass spectrometry: I.
Profiling an unfractionated tryptic digest. Proteomics 1, 93–107
2. Davis, M. T., Spahr, C. S., McGinley, M. D., Robinson, J. H., Bures, E. J.,
Beierle, J., Mort, J., Yu, W., Luethy, R., and Patterson, S. D. (2001)
Towards defining the urinary proteome using liquid chromatography-
tandem mass spectrometry: II. Limitations of complex mixture analyses.
Proteomics 1, 108 –1171
3. Thongboonkerd, V., McLeish, K. R., Arthur, J. M., and Klein, J. B. (2002)
Proteomic analysis of normal human urinary proteins isolated by acetone
precipitation or ultracentrifugation. Kidney Int. 62, 1461–1469
4. Pieper, R., Gatlin, C. L., McGrath, A. M., Makusky, A. J., Mondal, M.,
Seonarain M., Field, E., Schatz, C. R., Estock, M. A., Ahmed, N., Ander-
son, N. G., and Steiner, S. (2004) Characterization of the human urinary
proteome: a method for high-resolution display of urinary proteins on
two-dimensional electrophoresis gels with a yield of nearly 1400 distinct
2
R. Gupta, E. Jung, and S. Brunak, manuscript in preparation.
Urinary N-Glycoproteins
Molecular & Cellular Proteomics 5.3 561
protein spots. Proteomics 4, 1159 –1174
5. Sun, W., Li, F., Wu, S., Wang, X., Zheng, D., Wang, J., and Gao, Y. (2005)
Human urine proteome analysis by three separation approaches. Pro-
teomics 5, 4994 –5001
6. Haltiwanger, R. S., and Lowe, J. B. (2004) Role of glycosylation in devel-
opment. Annu. Rev. Biochem. 73, 491–537
7. Kristiansen, T. Z., Bunkenborg, J., Gronborg, M., Molina, H., Thuluvath,
P. J., Argani, P., Goggins, M. G., Maitra, A., and Pandey, A. (2004) A
proteomic analysis of human bile. Mol. Cell. Proteomics 3, 715–728
8. Kersey, P. J., Duarte, J., Williams, A., Karavidopoulou, Y., Birney, E., and
Apweiler, R. (2004) The International Protein Index: an integrated data-
base for proteomics experiments. Proteomics 4, 1985–1988
9. Sun, W., Li, F., Wang, J., Zheng, D., and Gao, Y. (2004) AMASS: software
for automatically validating the quality of MS/MS spectrum from SE-
QUEST results. Mol. Cell. Proteomics 3, 1194 –1199
10. Li, F., Sun, W., Gao, Y., and Wang, J. (2004) Rscore: a peptide randomicity
score for evaluating tandem mass spectra. Rapid Commun. Mass Spec-
trom. 18, 1–5
11. Yang, Z., and Hancock, W. S. (2004) Approach to the comprehensive
analysis of glycoproteins isolated from human serum using a multi-lectin
affinity column. J. Chromatogr. A 1053, 79–88
12. Bunkenborg, J., Pilch, B. J., Podtelejnikov, A. V., and Wisniewski, J. R.
(2004) Screening for N-glycosylated proteins by liquid chromatography
mass spectrometry. Proteomics 4, 454 465
13. Yang, Z., Hancock, W. S., Chew, T. R., and Bonilla, L. (2005) A study of
glycoproteins in human serum and plasma reference standards (HUPO)
using multilectin affinity chromatography coupled with RPLC-MS/MS.
Proteomics 5, 3353–3356
Urinary N-Glycoproteins
562 Molecular & Cellular Proteomics 5.3
... The 125 AAs of the NTD and 20 AAs of the CTD were sent to an online program for predicting the potential post-translational modification (PTM) sites [22][23][24][25] (Table S5). The thresholds were set at 50%. ...
... It can be seen that the four fusion proteins were different, and all proteins containing CTD (NGC and GC) bound to Fib-L. The actual mass was slightly different from the theoretical value (Table S8), indicating that the protein had a post-processing modification such as glycosylation or phosphorylation [22][23][24][25] (Fig. S2 and Table S5). ...
Article
Fibroin of the silkworm consists of fibroin heavy chain (Fib-H) with hydrophobic intermediate repeats flanked by hydrophilic N and C terminal domains (NTD and CTD, respectively), fibroin light chain (Fib-L), and P25. However, the respective roles of each polypeptide in silk processing remain largely unknown. Here, a series of transgenic silkworms with different fusion gene expression cassettes were created in order to selectively express different fluorescent fusion proteins in silk glands. The roles of different components in silk processing were investigated via observing and analyzing the movement and distribution of these proteins in the silk gland and in cocoon silk. The data showed that hydrophilic NTDs were distributed on the surface of micelles, providing sufficient electrostatic repulsion to prevent premature crystallization of silk proteins. Hydrophilic CTD==Ls (“==” represents the disulfide bond) were located on the inner layer of micelles to control the solubility of large micelles. The results presented here elucidated the underlying mechanisms of silkworm silk processing in vivo. This is significant for the development of artificial spinning technology, novel silk biomaterials, and silk gland expression systems.
... This can be attributed mainly to the complex and heterogenous nature of glycan structures linked to either Asparagine (N-linked) or Serine/Threonine (O-linked) amino acid residues [15]. The importance of different types of glycosylation (N-linked and O-linked) has already been highlighted in numerous physiological and pathological mechanisms related to inflammation, angiogenesis and cancer [16][17][18]. Along these lines, this study was carried out under the hypothesis that characterization of glycopeptides in the urine can reveal pathological mechanisms in CKD. ...
Article
Full-text available
Chronic kidney disease (CKD) is prevalent in 10% of world’s adult population. The role of protein glycosylation in causal mechanisms of CKD progression is largely unknown. The aim of this study was to identify urinary O-linked glycopeptides in association to CKD for better characterization of CKD molecular manifestations. Urine samples from eight CKD and two healthy subjects were analyzed by CE-MS/MS and glycopeptides were identified by a specific software followed by manual inspection of the spectra. Distribution of the identified glycopeptides and their correlation with Age, eGFR and Albuminuria were evaluated in 3810 existing datasets. In total, 17 O-linked glycopeptides from 7 different proteins were identified, derived primarily from Insulin-like growth factor-II (IGF2). Glycosylation occurred at the surface exposed IGF2 Threonine 96 position. Three glycopeptides (DVStPPTVLPDNFPRYPVGKF, DVStPPTVLPDNFPRYPVG and DVStPPTVLPDNFPRYP) exhibited positive correlation with Age. The IGF2 glycopeptide (tPPTVLPDNFPRYP) showed a strong negative association with eGFR. These results suggest that with aging and deteriorating kidney function, alterations in IGF2 proteoforms take place, which may reflect changes in mature IGF2 protein. Further experiments corroborated this hypothesis as IGF2 increased plasma levels were observed in CKD patients. Protease predictions, considering also available transcriptomics data, suggest activation of cathepsin S with CKD, meriting further investigation.
... Amino acid glycosylation sites of ZJ49 isolate were predicted by NetNGlyc-1.0 prediction server 1 (Wang et al., 2006). The tertiary structure model (The template was 4wss.1.A) of ZJ49 HA protein was constructed through the SWISS-MODEL online server 2 (Waterhouse et al., 2018), and PyMOL (Version 2.0.7) was used for visual analysis to map the spatial location of receptor binding sites and glycosylation sites (Lu, 2020). ...
Article
Full-text available
H6-subtype avian influenza virus (AIV) was prevalent in the world and could sporadically infect humans. Here, a new chicken-derived H6N6-subtype AIV strain A/chicken/Zhejiang/49/2021 (ZJ49) was isolated in Zhejiang Province, China in 2021. Phylogenetic analysis by Maximum likelihood methods showed that H6-subtype AIVs were classed into 13 groups according to HA gene. The ZJ49 strain belonged to the G12 group, which mainly consisted of strains from Asian and dominated in recent years. Based on NA gene, H6-subtype AIVs were divided into N6.1 and N6.2 clades according to the NA gene. The ZJ49 isolate was located in the N6.2e clade, which mainly consisted of the H5N6-subtype AIVs. Phylogenetic analysis by Bayesian methods showed that the effective quantity size of H6-subtype AIVs increased around 1990, reached a peak around 2015, declined after 2015, then kept in a stable level after 2018. The reassortment analysis predicted that the PB2, PA, and NA genes of ZJ49 may recombine with H5-subtype AIVs. The amino acid at 222 position of HA gene of ZJ49 strain mutated from A to V, suggesting that ZJ49 has a potential ability to cross species barriers. The four glycosylation sites were highly conserved, implying less impact on the fold and conception of HA stem structure. Our results revealed the complicated evolution, reassortment, and mutations of receptor binding sites of H6-subtype AIVs, which emphasize the importance to continuously monitor the epidemiology and evolution of H6-subtype AIVs.
... The composition of protein in the urine are however, relatively more stable compared to other biofluids, including serum and plasma, which are prone to proteolytic degradation during and after sampling [108]. Urine secretion is one of the steps of urine formation, and it is a consequence of blood filtration, but urine contains also proteins secreted from tubules and kidney specific cells [109][110][111][112][113][114]. The filtration of serum proteins is based on their size and charge at the glomeruli [115]. ...
Article
Full-text available
Novel molecular diagnostic tools that are rapid, accurate, cost-effective, and could be used at point-of-care are urgently required if we are to achieve the goals of the End TB Strategy and reduce TB incidence by 90% by 2035. Early and rapid diagnosis of TB can significantly improve therapeutic outcomes, patient survival rates and reduce recurrence. Studies have examined blood, urine, saliva and sputum to identify new TB biomarker candidates. However, no comparative evaluation of all four fluids as a diagnostic tool to identify active and latent TB has been reported. In the present paper, I peer-reviewed publications from PubMed databases in all languages between 2000 - April 1, 2018. Identifying a sample type, with appropriate microbial, immunologic or molecular biomarkers to accurately diagnose active TB or latent TB, particularly if the sample was easier to collect, such as saliva; offers a unique opportunity to reduce discomfort in patients via the provision of a noninvasive TB detection method. With its ease of access, processing and the invasiveness of collection procedures for TB molecular diagnosis, putting effort and resources into the saliva biomarker investigation may yield a point-of-care molecular diagnostic test that could revolutionize the TB field.
... Thus, many urine proteomic studies have focused on comparing protein abundance between healthy subjects and patients with specific medical conditions but have not considered PTMs. Only urinary glycoproteins were used in some biomarker studies, as high-abundance albumin is not N-glycosylated and can be subsequently removed (Wang et al. 2006;Wang et al. 2008). ...
Preprint
Full-text available
Urine, by accumulating all kinds of changes, was proposed to be a better source for biomarker discovery. As one of the most common post-translational modifications, phosphorylation plays a vital role in many biological activities. However, the urine phosphoproteome has been largely neglected due to the low abundance of phosphoproteins and the presence of various phosphatases in urine. The low level of background phosphorylation in urine is actually advantageous, as urinary phosphopeptides/proteins that are stable to the phosphatases present in urine have the potential to serve as valuable disease biomarkers. Using a TiO2 enrichment strategy, this study aimed to create a comprehensive proteomic profile of human urinary phosphoproteins and to characterize the changes in the urine phosphoproteome after incubation of urine with cell lysates. In total, 106 urine phosphorylation sites corresponding to 64 proteins, including 80 previously unidentified human urine protein phosphorylation sites, were identified by mass spectrometry. Fifteen phosphopeptides, together averaging 47% of the total phosphopeptides, were found in samples from three individuals. Addition of cellular proteins to urine did not significantly change the phosphorylation level of urine proteins. But there were still a few phosphopeptides from cell lysates survived urinary phosphatases; such phosphopeptides represent potential biomarkers in urine.
Article
Glycosylation changes in cancer proteins have been associated with malignant transformation. However, techniques for analyzing site-specific glycosylation changes in target proteins obtained from clinical tissue samples are insufficient. To overcome these problems, we developed a targeted N-glycoproteomic approach consisting of immunoprecipitation, glycopeptide enrichment, LC/MS/MS and structural assignment using commercially available analytical software followed by manual confirmation. This approach was applied to the comparative site-specific glycosylation analysis of lysosome-associated membrane glycoprotein 1 (LAMP1) between breast cancer (BC) tumors and normal tissues adjacent to tumors. Extensive determination of glycan heterogeneity from four N-glycosylation sites (Asn84/103/249/261) in LAMP1 identified 262 glycoforms and revealed remarkable diversity in tumor glycan structures. A significant increase in N-glycoforms with multiple fucoses and sialic acids at Asn84/249 and high-mannose-type glycans at Asn103/261 were observed in the tumor. Principal component analysis revealed that tumors of different subtypes have independent distributions. This approach enables site-specific glycopeptide analysis of target glycoprotein in breast cancer tissue and become a powerful tool for characterizing tumors with different pathological features by their glycan profiles.
Article
Cis-diol containing molecules are of great biological significance. The low abundance of cis-diol containing molecules and the complexity of biosamples have posed great challenges for the analysis of these compounds in biosamples. The pH-controlled, covalent and reversible binding property of boronate affinity materials (BAMs) endows them with huge application potential in specific capture, recognition, labeling and assay of cis-diol containing molecules. Recently, much effort has been devoted to improve the performance of BAMs, enabling various promising applications. This review aims at providing a clear interpretation of the principle for performance improvement of BAMs and tries to establish useful guidance for the engineering BAMs to meet the diverse requirements in bioanalysis. Meanwhile, applications of BAMs in sample preparation, biomolecular recognition and assay, molecular labeling, aptamers selection and cancer therapy are surveyed. Finally, a conclusion on recent progress and prospects for the future development in BAMs are presented.
Article
Post-translational modifications (PTMs) diversify the molecular structures of proteins and play essential roles in regulating their functions. Abnormal PTM status has been linked to a variety of developmental disorders and human diseases, highlighting the importance of studying PTMs in understanding physiological processes and discovering novel nodes and links with therapeutic intervention potential. Classical biochemical methods are suitable for studying PTMs on individual proteins; however, global profiling of PTMs in proteomes remains a challenging task. In this feature article, we start with a brief review of the traditional affinity-based strategies and shift the emphasis to summarizing recent progress in the development and application of chemical and computational proteomic strategies to delineate the global landscapes of functional PTMs. Finally, we discuss current challenges in PTM detection and provide future perspectives on how the field can be further advanced.
Preprint
Full-text available
Biomarker is the measurable change associated with a physiological or pathophysiological process, its nature is change. Contrast to the blood which is under homeostatic controls, urine reflects changes in the body earlier and more sensitive therefore is a better biomarker source. Lysine acetylation is an abundant and highly regulated post-translational modification. It plays a pivotal role in modulating diverse biological processes and is associated with various important diseases. Enrichment or visualization of proteins with specific post-translational modifications provides a method for sampling the urinary proteome and reducing sample complexity. In this study, we used anti-acetyllysine antibody-based immunoaffinity enrichment combined with high-resolution mass spectrometry to profile lysine-acetylated proteins in normal human urine. A total of 629 acetylation sites on 315 proteins were identified, including some very low-abundance proteins. This is the first proteome-wide characterization of lysine acetylation proteins in normal human urine. Our dataset provides a useful resource for the further discovery of the lysine acetylated proteins as biomarker in urine.
Article
Rationale: N-glycosylation is one of the most common protein post-translational modifications, and it is extremely complex with multiple glycoforms from different monosaccharide compositions, sequences, glycosidic linkages, and anomeric positions. Each glycoform functions with a particular site- and structure-specific N-glycan that can be fully characterized using state-of-the-art tandem mass spectrometry (MS/MS) and the intact N-glycopeptide database search engine GPSeeker that we recently developed. Urine has recently gained increasing attention as a non-invasive source for disease marker discovery. In this study, we report our structure-specific N-glycoproteomics study of human urine. Methods: We performed trypsin digestion, Zwitterionic Hydrophilic Interaction chromatography (ZIC-HILIC) enrichment, C18-RPLC/nano ESI-MS/MS using HCD with stepped normalized collisional energies, and GPSeeker database search for a comprehensive site- and structure-specific N-glycoproteomics characterization of the human urinary N-glycoproteome at the intact N-glycopeptide level. For this, we used b/y product ion pairs from the GlcNAc-containing site-determining peptide backbone and structure-diagnostic product ions from the N-glycan moieties, respectively. Results: We identified 2,986 intact N-glycopeptides with comprehensive site and structure information for the peptide backbones (amino acid sequences and N-glycosites) and the N-glycan moieties (monosaccharide compositions, sequences/linkages). The 2,986 intact N-glycopeptide IDs corresponded to 754 putative N-glycan linkage structures on 419 N-glycosites of 450 peptide backbones from 327 intact N-glycoproteins. Next, 146 linkage structures and 200 N-glycosites were confirmed with structure-diagnostic and GlcNAc-containing site-determining product ions, respectively. Conclusions: We found 106 new N-glycosites, not annotated in the current UniProt database. The elution-abundance patterns of urinary intact N-glycopeptide oxonium ions (m/z 138 and 204) of the same subject were temporally stable during the day and over 6 months. These patterns are rather different among different subjects. The results implied an interesting possibility that glycopeptide oxonium ion patterns could serve as distinguishing markers between individuals and/or between physiological and pathological states.
Article
Full-text available
Proteomic techniques have recently become available for large-scale protein analysis. The utility of these techniques in identification of urinary proteins is poorly defined. We constructed a proteome map of normal human urine as a reference protein database by using two differential fractionated techniques to isolate the proteins. Proteins were isolated from urine obtained from normal human volunteers by acetone precipitation or ultracentrifugation, separated by two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) and identified by matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF) mass spectrometry followed by peptide mass fingerprinting. A total of 67 protein forms of 47 unique proteins were identified, including transporters, adhesion molecules, complement, chaperones, receptors, enzymes, serpins, cell signaling proteins and matrix proteins. Acetone precipitated more acidic and hydrophilic proteins, whereas ultracentrifugation fractionated more basic, hydrophobic, and membrane proteins. Bioinformatic analysis predicted glycosylation to be the most common explanation for multiple forms of the same protein. Combining two differential isolation techniques magnified protein identification from human urine. Proteomic analysis of urinary proteins is a promising tool to study renal physiology and pathophysiology and to determine biomarkers of renal disease.
Article
Full-text available
We have carried out a comprehensive characterization of human bile to define the bile proteome. Our approach involved fractionation of bile by one-dimensional gel electrophoresis and lectin affinity chromatography followed by liquid chromatography tandem mass spectrometry. Overall, we identified 87 unique proteins, including several novel proteins as well as known proteins whose functions are unknown. A large majority of the identified proteins have not been previously described in bile. Using lectin affinity chromatography and enzymatically labeling of asparagine residues carrying glycan moieties by (18)O, we have identified a total of 33 glycosylation sites. The strategy described in this study should be generally applicable for a detailed proteomic analysis of most body fluids. In combination with "tagging" approaches for differential proteomics, our method could be used for identification of cancer biomarkers from any body fluid.
Chapter
IntroductionMaterials and methods MaterialsIsolating glycoproteins using multilectin affinity columnsAnalysis of glycoproteins on LC-LCQ MSAnalysis of glycoproteins on LC-LTQ MSProtein database searchResults and discussion Protein IDs from the plasma and serum samplesComparison between serum and plasma glycoproteomesComparison of the glycoproteins present in the samples collected from three ethnic groupsConcluding remarksReferences MaterialsIsolating glycoproteins using multilectin affinity columnsAnalysis of glycoproteins on LC-LCQ MSAnalysis of glycoproteins on LC-LTQ MSProtein database search Protein IDs from the plasma and serum samplesComparison between serum and plasma glycoproteomesComparison of the glycoproteins present in the samples collected from three ethnic groups
Article
With an emphasis on obtaining a multitude of high quality tandem mass spectrometry spectra for protein identification, instrumental parameters are described for the liquid chromatography-tandem mass spectrometry analysis of trypsin digested unfractionated urine using a hybrid quadrupole-time-of-flight (Q-TOF) mass spectrometer. Precursor acquisition rates of up to 20 distinct precursors/minute in a single analysis were obtained through the use of parallel precursor selection (four precursors/survey period) and variable collision induced dissociation integration time (1 to 6 periods summed). Maximal exploitation of the gas phase fractionated ions was obtained through the use of narrow survey scans and iterative data-dependent analyses incorporating dynamic exclusion. The impact on data fidelity as a product of data-dependent selection of precursor ions from a dynamically excluded field is discussed with regards to sample complexity, precursor selection rates, survey scan range and facile chemical modifications. Operational and post-analysis strategies are presented to restore data confidence and reconcile the greatest number of matched spectra.
Article
The proteome of normal male urine from a commercial pooled source has been examined using direct liquid chromatography-tandem mass spectrometry (LC-MS/MS). The entire urinary protein mixture was denatured, reduced and enzymatically digested prior to LC-MS/MS analysis using a hybrid-quadrupole time-of-flight mass spectrometer (Q-TOF) to perform data-dependent ion selection and fragmentation. To fragment as many peptides as possible, the mixture was analyzed four separate times, with the mass spectrometer selecting ions for fragmentation from a subset of the entire mass range for each run. This approach requires only an autosampler on the HPLC for automation (i.e, unattended operation). Across these four analyses, 1.450 peptide MS/MS spectra were matched to 751 sequences to identify 124 gene products (proteins and translations of expressed sequence tags). Interestingly, the experimental time for these analyses was less than that required to run a single two-dimensional gel.
Article
During my career on the faculty of the Department of Chemistry and Biochemistry at the University of Guelph, I spent two sabbatical years with John Beynon at Swansea. The first of these was in 1975/1976, and was intended to educate me into mass spectrometry as a new research direction for me since shock-tube research was reaching the end of its more productive years. As it turned out I did indeed get myself educated, and we also managed to publish several papers together, a non-trivial feat since John's new instrument did not arrive till after I had returned to my job in Canada! One of the things we worked on together was development of new techniques for obtaining MS/MS data of various kinds using double-focusing instruments. This work was motivated by the work of Graham and Evans on the so-called linked-scan at fixed E2/V, that provided fragment ion spectra of a mass-selected precursor ion but was subject to severe problems from ion source detuning. I had taken it on myself to see whether I could discover a complementary linked-scan that would reveal all parents of a mass-selected fragment ion without the detuning problems of the old Barber-Elliott scan (V-scan); after about four pages of tortuous logic I did succeed in deducing that a scan at fixed B2/E should work. John seemed pleased with this, and undertook to write something up and think about it during a trip to London he and Yvonne were making by train. On his arrival back in Swansea he did not go straight home, but burst into my small guest-scientist's office, just bubbling with what can only be called boyish enthusiasm: “I've cracked it”. Basically, he had cut to the chase, right past my clumsy approach, by a simple algebraic device in which he introduced a variable (restricted to integral values for practical reasons) that described the relationship between the field ratios and the mass ratios of the precursor-fragment ions. This was later developed further into what amounted to an existence theorem for linked-scan relationships that nicely complemented the generalized graphical approach that was being developed (unknown to us) by Mike Lacey and Colin Macdonald in Australia. (And of course the particular B/E scan was also being developed simultaneously by Keith Jennings and his colleagues, again unknown to us.) But the point of this story is the typical way in which all the circumlocutory baggage of my four pages was swept away by a simple (once John had produced it!) insight.
Article
In the last few years mass spectrometry has become the method of choice for characterization of post-translationally modified proteins. Whereas most protein chemical modifications are binary in the sense that only one change can be associated with a given residue, many different oligosaccharides can be attached to a glycosylation site residue. The detailed characterization of glycoproteins in complex biological samples is extremely challenging. However, information on N-glycosylation can be gained at an intermediary level. Here we demonstrate a procedure for mapping N-glycosylation sites in complex mixtures by reducing sample complexity and enriching glycoprotein content. Glycosylated proteins are selected by an initial lectin chromatography step and digested with endoproteinase Lys-C. Glycosylated peptides are then selected from the digest mixture by a second lectin chromatography step. The glycan components are removed with N-glycosidase F and the peptides digested with trypsin before analysis by on-line reversed-phase liquid chromatography mass spectrometry. Using two different lectins, concanavalin A and wheat germ agglutinin, this procedure was applied to human serum and a total of 86 N-glycosylation sites in 77 proteins were identified.
Article
The abundance profile of the human urinary proteome is known to change as a result of diseases or drug toxicities, particularly of those affecting the kidney and the urogenital tract. A consequence of such insults is the ability to identify proteins in urine, which may be useful as quantitative biomarkers. To succeed in discovering them, reproducible urine sample preparation methods and good protein resolution in two-dimensional electrophoresis (2-DE) gels for parallel semiquantitative protein measurements are desirable. Here, we describe a protein fractionation strategy enriching proteins of molecular masses (M(r)) lower than 30 kDa in a fraction separate from larger proteins. The fraction containing proteins with M(r)s higher than 30 kDa was subsequently subjected to immunoaffinity subtraction chromatography removing most of the highly abundant albumin and immunoglobulin G. Following 2-DE display, superior protein spot resolution was observed. Subsequent high-throughput mass spectrometry analysis of ca. 1400 distinct spots using matrix-assisted laser desorption/ionization-time of flight peptide mass fingerprinting and liquid chromatography-electrospray ionization tandem mass spectrometry lead to the successful identification of 30% of the proteins. As expected from high levels of post-translational modifications in most urinary proteins and the presence of proteolytic products, ca. 420 identified spots collapsed into 150 unique protein annotations. Only a third of the proteins identified in this study are described as classical plasma proteins in circulation, which are known to be relatively abundant in urine despite their retention to a large extent in the glomerular blood filtration process. As a proof of principle that our urinary proteome display effort holds promise for biomarker discovery, proteins isolated from the urine of a renal cell carcinoma patient were profiled prior to and after nephrectomy. Particularly, the decrease in abundance of the kininogen 2-DE gel spot train in urine after surgery was striking.
Article
Researchers have long predicted that complex carbohydrates on cell surfaces would play important roles in developmental processes because of the observation that specific carbohydrate structures appear in specific spatial and temporal patterns throughout development. The astounding number and complexity of carbohydrate structures on cell surfaces added support to the concept that glycoconjugates would function in cellular communication during development. Although the structural complexity inherent in glycoconjugates has slowed advances in our understanding of their functions, the complete sequencing of the genomes of organisms classically used in developmental studies (e.g., mice, Drosophila melanogaster, and Caenorhabditis elegans) has led to demonstration of essential functions for a number of glycoconjugates in developmental processes. Here we present a review of recent studies analyzing function of a variety of glycoconjugates (O-fucose, O-mannose, N-glycans, mucin-type O-glycans, proteoglycans, glycosphingolipids), focusing on lessons learned from human disease and genetic studies in mice, D. melanogaster, and C. elegans.
Article
Despite the complete determination of the genome sequence of several higher eukaryotes, their proteomes remain relatively poorly defined. Information about proteins identified by different experimental and computational methods is stored in different databases, meaning that no single resource offers full coverage of known and predicted proteins. IPI (the International Protein Index) has been developed to address these issues and offers complete nonredundant data sets representing the human, mouse and rat proteomes, built from the Swiss-Prot, TrEMBL, Ensembl and RefSeq databases.