ArticlePDF AvailableLiterature Review

The Protein Kinase Family: Conserved Features and Deduced Phylogeny of the Catalytic Domains

Authors:
  • Montana Molecular

Abstract

In recent years, members of the protein kinase family have been discovered at an accelerated pace. Most were first described, not through the traditional biochemical approach of protein purification and enzyme assay, but as putative protein kinase amino acid sequences deduced from the nucleotide sequences of molecularly cloned genes or complementary DNAs. Phylogenetic mapping of the conserved protein kinase catalytic domains can serve as a useful first step in the functional characterization of these newly identified family members.
The Protein Kinase Family: Conserved Features and Deduced Phylogeny of the Catalytic
Domains
Author(s): Steven K. Hanks, Anne Marie Quinn and Tony Hunter
Source:
Science,
New Series, Vol. 241, No. 4861 (Jul. 1, 1988), pp. 42-52
Published by: American Association for the Advancement of Science
Stable URL: http://www.jstor.org/stable/1701319 .
Accessed: 21/04/2013 19:08
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
.
American Association for the Advancement of Science is collaborating with JSTOR to digitize, preserve and
extend access to Science.
http://www.jstor.org
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
58. M. L. Alexander,
M. A. Johnson,
W. C. Lineberger,
J. Chem.
Phys.
82, 5288
(1985).
59. D. E. Hunton,
M. Hofmtnann,
T. G. Lindeman,
C. R. Albertoni,
A. W. Castleman,
Jr., ibid., p. 2884.
60. W. D. Reents, Jr., A. M. Mujsce, V. E. Bondybey, M. L. Mandich, ibid.
86, 5568
(1987).
61. M. D. Morse,
M. E. Geusic, J. R. Heath,
R. E. Smalley,
ibid.
83, 2293 (1985).
62. E. K. Parks,
K. Liu, S. C. Richtsmeier,
L. G. Pobo, S. J. Riley, ibid.
82, 5470
(1985).
63. S. A. Ruatta,
L. Hanley,
S. L. Anderson,
Chem.
Phys.
Lett.
137, 5 (1987).
58. M. L. Alexander,
M. A. Johnson,
W. C. Lineberger,
J. Chem.
Phys.
82, 5288
(1985).
59. D. E. Hunton,
M. Hofmtnann,
T. G. Lindeman,
C. R. Albertoni,
A. W. Castleman,
Jr., ibid., p. 2884.
60. W. D. Reents, Jr., A. M. Mujsce, V. E. Bondybey, M. L. Mandich, ibid.
86, 5568
(1987).
61. M. D. Morse,
M. E. Geusic, J. R. Heath,
R. E. Smalley,
ibid.
83, 2293 (1985).
62. E. K. Parks,
K. Liu, S. C. Richtsmeier,
L. G. Pobo, S. J. Riley, ibid.
82, 5470
(1985).
63. S. A. Ruatta,
L. Hanley,
S. L. Anderson,
Chem.
Phys.
Lett.
137, 5 (1987).
64. R. D. Levine, R. B. Bernstein, Eds., Molecular Reaction
Dynamics
and Chemical
Reactivity
(Oxford Univ. Press, New York, 1987).
65. B. Koplitz et al., Faraday
Discuss. Chem. Soc. 82, 125 (1986).
66. N. F. Scherer, L. R. Khundkar, R. B. Bernstein, A. H. Zewail,J. Chem.
Phys. 87,
1451 (1987).
67. S. Morgan and A. W. Castleman, Jr.,J. Am. Chem. Soc. 109, 2867 (1987).
68. D. K. Bohme, NATO ASI Ser. C 118, 111 (1984).
69. Financial support by the Army Research Office, grant DAAG29-85-K-0215, the
Department of Energy, grant DE-AC02-82ER60055, and the National Science
Foundation, grant ATM-82-04010, is gratefully acknowledged.
64. R. D. Levine, R. B. Bernstein, Eds., Molecular Reaction
Dynamics
and Chemical
Reactivity
(Oxford Univ. Press, New York, 1987).
65. B. Koplitz et al., Faraday
Discuss. Chem. Soc. 82, 125 (1986).
66. N. F. Scherer, L. R. Khundkar, R. B. Bernstein, A. H. Zewail,J. Chem.
Phys. 87,
1451 (1987).
67. S. Morgan and A. W. Castleman, Jr.,J. Am. Chem. Soc. 109, 2867 (1987).
68. D. K. Bohme, NATO ASI Ser. C 118, 111 (1984).
69. Financial support by the Army Research Office, grant DAAG29-85-K-0215, the
Department of Energy, grant DE-AC02-82ER60055, and the National Science
Foundation, grant ATM-82-04010, is gratefully acknowledged.
The
Protein
Kinase
Family:
Conserved
Features
and
Deduced
Phylogeny
of
the
Catalytic
Domains
STEVEN
K. HANKS,
ANNE
MARIE
QUINN,
TONY HUNTER
The
Protein
Kinase
Family:
Conserved
Features
and
Deduced
Phylogeny
of
the
Catalytic
Domains
STEVEN
K. HANKS,
ANNE
MARIE
QUINN,
TONY HUNTER
In recent years, members of the protein kinase family have
been discovered at an accelerated pace. Most were first
described, not through the traditional biochemical ap-
proach of protein purification and enzyme assay, but as
putative protein kinase amino acid sequences deduced
from the nucleotide sequences of molecularly cloned
genes or complementary DNAs. Phylogenetic mapping of
the conserved protein kinase catalytic domains can serve
as a useful first step in the functional characterization of
these newly identified family members.
HE PROTEIN KINASES ARE A LARGE
FAMILY
OF ENZYMES,
many of which mediate the response of eukaryotic cells to
external stimuli (1, 2). The number of unique members of
the protein kinase family that have been described has recently risen
exponentially (3) and now approaches 100. The surge in the number
of known protein kinases has been largely due to the advent of gene
cloning and sequencing techniques. Amino acid sequences deduced
from nucleotide sequences are considered to represent protein
kinases if they include certain key residues that are highly conserved
in the protein kinase "catalytic
domain."
Two different molecular approaches have been most instrumental
in the isolation of novel protein kinase-encoding genes or cDNAs:
(i) complementation or suppression of genetic defects in inverte-
brate regulatory mutants, and (ii) screening DNA libraries
by using
protein kinase genes as hybridization probes under low stringency
conditions. Recently, an approach that uses degenerate oligonucleo-
tides as probes has led to the identification of several novel putative
In recent years, members of the protein kinase family have
been discovered at an accelerated pace. Most were first
described, not through the traditional biochemical ap-
proach of protein purification and enzyme assay, but as
putative protein kinase amino acid sequences deduced
from the nucleotide sequences of molecularly cloned
genes or complementary DNAs. Phylogenetic mapping of
the conserved protein kinase catalytic domains can serve
as a useful first step in the functional characterization of
these newly identified family members.
HE PROTEIN KINASES ARE A LARGE
FAMILY
OF ENZYMES,
many of which mediate the response of eukaryotic cells to
external stimuli (1, 2). The number of unique members of
the protein kinase family that have been described has recently risen
exponentially (3) and now approaches 100. The surge in the number
of known protein kinases has been largely due to the advent of gene
cloning and sequencing techniques. Amino acid sequences deduced
from nucleotide sequences are considered to represent protein
kinases if they include certain key residues that are highly conserved
in the protein kinase "catalytic
domain."
Two different molecular approaches have been most instrumental
in the isolation of novel protein kinase-encoding genes or cDNAs:
(i) complementation or suppression of genetic defects in inverte-
brate regulatory mutants, and (ii) screening DNA libraries
by using
protein kinase genes as hybridization probes under low stringency
conditions. Recently, an approach that uses degenerate oligonucleo-
tides as probes has led to the identification of several novel putative
S. K. Hanks is a senior research associate at the Molecular Biology Laboratory, Salk
Institute for Biological Studies, Post Office Box 85800, San Diego, CA 92138. A. M.
Quinn is a scientific applications programmer at the Biocomputing Center, Salk
Institute for Biological Studies, Post Office Box 85800, San Diego, CA 92138. T.
Hunter is a professor at the Molecular Biology and Virology Laboratory, Salk Institute
for Biological Studies, Post Office Box 85800, San Diego, CA 92138.
S. K. Hanks is a senior research associate at the Molecular Biology Laboratory, Salk
Institute for Biological Studies, Post Office Box 85800, San Diego, CA 92138. A. M.
Quinn is a scientific applications programmer at the Biocomputing Center, Salk
Institute for Biological Studies, Post Office Box 85800, San Diego, CA 92138. T.
Hunter is a professor at the Molecular Biology and Virology Laboratory, Salk Institute
for Biological Studies, Post Office Box 85800, San Diego, CA 92138.
42 42
protein kinase genes and cDNAs (4, 5). The oligonucleotide probes
are designed to recognize target sequences that encode short amino
acid stretches highly conserved in protein kinase catalytic domains.
In this article,
we present an alignment of catalytic
domain amino
acid sequences from 65 different members of the protein kinase
family, including many putative protein kinase sequences recently
deduced from nucleotide sequence data. Based on this alignment,
we first identify and discuss conserved features of the catalytic
domains and then provide a visual display of the various interse-
quence relations through construction of a catalytic domain phylo-
genetic tree. Catalytic domains from protein kinases having similar
modes of regulation or substrate specificities are found to cluster
together within the tree. This clustering would appear to be of
predictive value in the determination of the properties and function
of novel protein kinases.
Catalytic
Domain Amino Acid Sequences
Protein kinase catalytic domains range from 250 to 300 amino
acid residues, corresponding to about 30 kD. Fairly precise bound-
aries
for the catalytic
domains have been defined through an analysis
of conserved sequences (1, 6, see below) as well as by assay of
truncated enzymes (7, 8). The location of the catalytic domain
within the protein is not fixed but, in most single subunit enzymes it
lies near the carboxyl
terminus, the amino terminus being devoted to
a regulatory role. In protein kinases having a multiple subunit
structure,
subunit polypeptides consisting almost entirely of catalyt-
ic domain are common. All protein kinases thus far characterized
with regard to substrate specificity fall within one of two broad
classes, serine/threonine-specific and tyrosine-specific. Although
both classes of protein kinase have very similar catalytic domain
primary structures, certain short amino acid stretches appear to
characterize
each class (4), and these regions can be used to predict
whether a putative protein kinase will phosphorylate tyrosine or
serine/threonine.
Members of the protein-serine/threonine kinase and protein-
SCIENCE, VOL. 24I
protein kinase genes and cDNAs (4, 5). The oligonucleotide probes
are designed to recognize target sequences that encode short amino
acid stretches highly conserved in protein kinase catalytic domains.
In this article,
we present an alignment of catalytic
domain amino
acid sequences from 65 different members of the protein kinase
family, including many putative protein kinase sequences recently
deduced from nucleotide sequence data. Based on this alignment,
we first identify and discuss conserved features of the catalytic
domains and then provide a visual display of the various interse-
quence relations through construction of a catalytic domain phylo-
genetic tree. Catalytic domains from protein kinases having similar
modes of regulation or substrate specificities are found to cluster
together within the tree. This clustering would appear to be of
predictive value in the determination of the properties and function
of novel protein kinases.
Catalytic
Domain Amino Acid Sequences
Protein kinase catalytic domains range from 250 to 300 amino
acid residues, corresponding to about 30 kD. Fairly precise bound-
aries
for the catalytic
domains have been defined through an analysis
of conserved sequences (1, 6, see below) as well as by assay of
truncated enzymes (7, 8). The location of the catalytic domain
within the protein is not fixed but, in most single subunit enzymes it
lies near the carboxyl
terminus, the amino terminus being devoted to
a regulatory role. In protein kinases having a multiple subunit
structure,
subunit polypeptides consisting almost entirely of catalyt-
ic domain are common. All protein kinases thus far characterized
with regard to substrate specificity fall within one of two broad
classes, serine/threonine-specific and tyrosine-specific. Although
both classes of protein kinase have very similar catalytic domain
primary structures, certain short amino acid stretches appear to
characterize
each class (4), and these regions can be used to predict
whether a putative protein kinase will phosphorylate tyrosine or
serine/threonine.
Members of the protein-serine/threonine kinase and protein-
SCIENCE, VOL. 24I
I I , I I ,
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
tyrosine kinase families with reported catalytic domain amino acid
sequences are listed in Tables 1 and 2, respectively. They are
classified within the tables according to similarities in primary
structure, based on deduced catalytic domain phylogeny. Included
in the tables are all confirmed and putative protein kinases for which
the catalytic domain sequence was available as of November 1987
(9). Presumed functional homologs from different
vertebrate
species
are listed together. Presumed invertebrate functional homologs of
protein kinases also found in vertebrates, however, are given
separate listings as a reflection of greater evolutionary distance and
the possibility of functional divergence. The asterisks indicate
protein kinases that have catalytic domains that are included in the
amino acid sequence alignment. We will use the abbreviated
names
from the tables to refer to individual protein kinases.
Of the 45 unique vertebrate protein kinase family members
included in Tables 1 and 2, 22 are serine/threonine-specific and 23
are tyrosine-specific. Fourteeh of the vertebrate protein-serine/thre-
onine kinases fall within one of the three subgroups that can be
Table 1. Protein-serine/threonine
kinase
family
members.
A. Cyclic
nucleotide-dependent subfamily
cAPK-a:
cAMP-dependent
protein
kinase
catalytic
subunit,
a form
*-bovine
cardiac
muscle
protein
(26)
-mouse S49 lymphoma
cell cDNA (35)
cAPK-P:
cAMP-dependent
protein
kinase
catalytic
subunit,
P form
*-bovine
pituitary
cDNA (36)
-mouse
S49 lymphoma
cell cDNA (37)
SRA3:
cAMP-dependent
protein
kinase from
yeast,
RAS suppressor
*-Saccharomyces
cereviiae
genomic
DNA (38)
TPKI(PK25):
cAMP-dependent protein
kinase from
yeast,
type 1
*-S. cerevisiae
genomic
DNA (39, 40)
TPK2:
cAMP-dependent protein
kinase from
yeast, type 2
*-S. cerevisiae
genomic
DNA (39)
TPK3:
cAMP-dependent
protein
kinase from
yeast, type 3
*-S. cerevisiae
genomic
DNA (39)
cGPK:
guanosine
3',5'-monophosphate (cGMP)-dependent protein
kinase
*-bovine
lung protein
(41)
B. Calcium-phospholipid-dependent
subfamily
PKC-a:
protein
kinase
C, a form
*-bovine brain
cDNA (42)
-rabbit brain cDNA (43)
-human brain
cDNA (partial)
(44)
PKC-P:
protein
kinase
C, 3 form
*-bovine
brain cDNA (44)
-rat brain cDNA (two splice
forms) (45, 46)
-rabbit
brain
cDNA (two splice
forms)
(43)
-human
brain cDNA (44)
PKC-y:
protein
kinase
C, y form
*-bovine brain cDNA (44)
-rat brain
cDNA (45)
-human
brain cDNA (44)
PKC-E:
protein
kinase
C, e form
-rat
brain
cDNA (RP16 clone) (partial)
(46)
DPKC:
Drosophila
gene product
related to protein
kinase
C
*-D. melanogaster
cDNA (47)
C. Calcium-calmodulin-dependent subfamily
CaMII-a:
calcium-calmodulin-dependent
protein
kinase
type II, a
subunit
*-rat brain cDNA (48)
CaMII-3:
calcium-calmodulin-dependent
protein
kinase
type II, P
subunit
*-rat brain
cDNA (49)
PhK-y:
phosphorylase
kinase,
y subunit
*-rabbit skeletal
muscle
protein
and cDNA (50)
-mouse muscle cDNA (51)
MLCK-K:
myosin
light chain
kinase,
skeletal muscle
*-rabbit
skeletal muscle
protein
(52)
MLCK-M:
myosin
light chain
kinase,
smooth
muscle
*-chicken
gizzard
cDNA (53)
PSK-H1:
putative protein-serine
kinase
*-human
HeLa cell cDNA (4, 54)
PSK-C3:
putative protein-serine
kinase
-human
HeLa cell cDNA (partial)
(4)
D. SNF1 subfamily
SNFI: "sucrose
nonfermenting"
mutant
wild-type gene product
*-S. cerevisiae
genomic
DNA (55)
niml+: "new
inducer
of mitosis";
suppressor
of cdc25 mutants
*-Schizosaccharomyces pombe
genomic
DNA (56)
KIN1: putative yeast
protein
kinase
*-Saccharomyces
cerevisiae
genomic
DNA (5)
KIN2: putative yeast
protein
kinase related to KIN1
*-S. cerevisiae
genomic
DNA (5)
E. CDC28-cdc2+
subfamily
CDC28:
"cell-division-cycle"
gene product
in yeast
*-S. cereisiae
genomic
DNA (57)
cdc2+:
"cell-division-cycle"
gene product
in yeast
*-Schizosaccharomyces pombe
genomic
DNA (58)
CDC2Hs: human functional
homolog of cdc2+
*-human
transformed
cell line cDNA (33)
PSK-J3:
putative protein
kinase related
to CDC28-cdc2+
*-human
HeLa cell cDNA (4, 59)
KIN28: putative
protein
kinase
related to CDC28-cdc2+
*-Saccharomyces
cerevisiae
genomic
DNA (60)
F. Casein
kinase
subfamily
CKIIa:
casein kinase
II, a subunit
-bovine
lung protein
(partial)
(61)
DCKII:
Drosophila
casein
kinase
II, a subunit
*-D. melanogaster
cDNA (62)
G. Raf-Mos
proto-oncogene subfamily
Raf: cellular
homolog
of oncogene
products
from 3611 murine
sarcoma virus and
Mill Hill 2 avian acute
leukemia virus
*-human
fetal liver cDNA (63)
A-Raf: cellular
oncogene product closely
related to Raf
*-human
T cell cDNA (64)
-mouse
spleen
cDNA (65)
PKS: cellular
gene product closely
related to Raf
*-human
fetal liver cDNA (66)
Mos: cellular
homolog
of oncogene product
from
Moloney
murine
sarcoma virus
*-human
placenta
genomic
DNA (67)
-mouse NIH 3T3 cell genomic
DNA (68)
-rat
3Y1 cell genomic
DNA (69)
H. STE7
subfamily
STE7: "sterile"
mutant
wild-type
allele
gene product
*-S. cerevisiae
genomic
DNA (70)
PBS2:
polymixin
B antibiotic resistance
gene product
*-S. cerevisiae
genomic
DNA (71)
I. Family
members with no close relatives
CDC7:
"cell-division-cycle"
gene product
*-S. cerevsiae
genomic
DNA (72)
weel+:
"reduced
size at division"
mutant
wild-type
gene product
*-Schizosaccharomyces
pombe
genomic
DNA (73)
ranl+: "meiotic
bypass"
mutant
wild-type
allele
gene product
*-S.
pombe
genomic
DNA (74)
PIM-1:
putative
transforming protein
induced
by murine leukemia
virus integration
*-mouse
BALB/c
cell genomic
DNA (75)
HSVK:
herpes
simplex
virus-US3
gene product
*-herpes simplex
virus
genomic
DNA (76)
*Protein kinases that have catalytic domains included in the amino acid sequence alignment.
ARTICLES 43
I JULY 1988
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
classified according to their mode of regulation: cyclic nucleotide-
dependent, calcium-phospholipid-dependent, and calcium-calmo-
dulin-dependent. Two of the serine/threonine kinases, Mos and Raf
(products of the c-mos and c-raf genes, respectively), are cellular
homologs of transforming proteins encoded by the retroviral
onco-
genes. Other members of the serine/threonine group with demon-
strated oncogenic potential are A-Raf (a distinct Raf-related mem-
ber), and PIM-1 (a putative transforming protein activated by viral
integration). Three vertebrate serine/threonine kinases (CDC2Hs,
PSK-J3, and CKIIa) are closely related, by various degrees, to the
yeast cell cycle control protein kinases CDC28 and cdc2+. A
protein-serine/threonine kinase has been described in herpes sim-
plex virus (HSVK) and, like the retroviral oncogenes, probably
originated as a eukaryotic cellular sequence. The protein-tyrosine
kinases can be further grouped as members of either the Src
subfamily or one of three different growth factor receptor subfami-
lies. The protein-tyrosine kinases encoded by the c-abl and c-fes/fps
genes may be considered distant members of the Src subfamily. At
least nine of the protein-tyrosine kinase genes have been transduced
by retroviruses where they encode transforming proteins.
Twenty-five additional sequences listed in Tables 1 and 2 de-
rive from invertebrate species. Eight are from Drosophila, one
from nematode, and the other 16 are from the budding or fission
yeasts. Many of the Drosophila protein kinases, as well as the
nematode protein kinase, were identified by screening DNA librar-
ies with probes from a vertebrate protein kinase gene or cDNA and
thus are likely to represent functional homologs of the vertebrate
enzymes. The Drosophila
"sevenless"
(7less) protein kinase and most
of the yeast protein kinases were identified through molecular
genetics. All of the yeast protein kinases identified to date fall within
the serine/threonine-specific class, despite directed attempts to
identify protein-tyrosine kinases in yeast (5). This observation,
together with the fact that many of the protein-tyrosine kinase
catalytic domains are components of growth factor receptor mole-
cules, suggests that tyrosine specificity may have been a recent
development in catalytic domain evolution, arising in conjunction
with the acquisition of multicellularity
and serving a role in cell-cell
communication.
Table 2. Protein-tyrosine
kinase
family
members.
* I _ i i i I l i i i i_
A. Src
subfamily
Src: cellular
homolog
of oncogene
product
from
Rous avian
sarcoma
virus
*-human
fetal liver
genomic
DNA (77)
-mouse brain
cDNA; neuronal
alternate
splice
form (78)
-chicken
genomic
DNA (79)
-Xenopus
laevis
ovary
cDNA (partial)
(80)
Yes:
cellular
homolog
of oncogene
product
from
Yamaguchi
73 avian
sarcoma virus
*-human
embryo
fibroblast
cDNA (81) D
Fgr:
cellular
homolog
of oncogene
product
from Gardner-Rasheed
feline
sarcoma
virus
*-human
genomic
DNA (82)
-human
B lymphocyte
cell line cDNA (amino terminus)
(83)
FYN:
putative
protein-tyrosine
kinase
related
to Fgr and
Yes
*-human
fibroblast
cDNA (84)
LYN:
putative
protein-tyrosine
kinase
related
to LCK and Yes
*-human
placenta
cDNA (85)
LCK:
lymphoid
cell protein-tyrosine
kinase
*-human
(JURKAT)
Tcell leukemia
line cDNA (86)
-mouse
(LSTRA)
T cell lymphoma
line cDNA (87)
HCK: hematopoietic
cell putative
protein-tyrosine
kinase
*-human
placenta
and peripheral
leukocyte
cDNAs (88)
Dsrc64:
Drosophila
gene product
related to Src;
polytene
locus
64B
*-D. melanogaster genomic DNA (89, 90)
Dsrc28:
Drosophila
gene product
related
to Src;
polytene
locus 28C
*-D. melanogaster
adult female cDNA (91)
B. Abl subfamily
Abl:
cellular
homolog
of oncogene
product
from Abelson
murine E
leukemia virus
*-human
fetal liver
cDNA (92)
ARG:
putative
protein-tyrosine
kinase
related
to Abl
-human
genomic
DNA (partial)
(93)
Dash:
Drosophila
gene product
related
to Abl
*-D. melanogaster
genomic DNA (90)
Nabl:
nematode
gene product
related
to Abl
*-Caenorhabditis
elegans
genomic DNA (94)
Fes/Fps:
cellular
homolog
of oncogene products
from Gardner-
Arnstein and Snyder-Theilen
feline
sarcoma
viruses and
Fujinami
and PRCII
avian
sarcoma
viruses
*-human
genomic
DNA (95) F
-feline
genomic
DNA (96)
-chicken
genomic
DNA (97)
C. Epidermal
growth
factor
receptor
subfamily
EGFR:
epidermal
growth
factor
receptor;
cellular
homolog
of
*Protein kinases
that have
catalytic
domains
included
in the amino
acid
sequence
alignment.
44
oncogene product
(v-Erb-B)
from
AEV-H avian
erythroblastosis
virus
*-human
placenta
and A431 cell line cDNAs (98)
NEU: cellular
oncogene
product
activated
in induced
rat
neuroblastomas
(also
called ERB-B2
or HER2)
*-human
placenta
and
gastric
cancer
cell line cDNAs (99)
-rat neuroblastoma
cell line cDNA (100)
DER:
Drosophila
gene product
related
to EGFR
*-D. melanogaster
genomic DNA (101)
Insulin
receptor
subfamily
INS.R: insulin
receptor
*-human
placenta
cDNA (102)
IGF1R:
insulin-like
growth
factor 1 receptor
*-humran
placenta
cDNA (103)
DILR:
Drosophila
gene product
related
to INS.R
*-D. melanogaster
embryo cDNA (104)
Ros: cellular
homolog
of oncogene
product
from UR2 avian
sarcoma virus
*-human
placenta genomic
DNA (105)
-chicken
genomic
DNA (106), chicken
kidney
cDNA (107)
7less:
Drosophila
sevenless
gene product
essential
for R7
photoreceptor
cell development
*-D. melanogaster
eye imaginal disc cDNA (108)
TRK: colon carcinoma
oncogene
product
activated
by genetic
recombination
*-human
tumor
cell cDNA (109)
MET:
N-methyl-N'-nitro-N-nitrosoguanidine
(MNNG)-induced
oncogene
product
*-human
HOS cell line cDNA (110)
Platelet-derived
growth
factor
receptor
subfamily
PDGFR:
platelet-derived growth
factor
receptor
*-mouse
NR6 fibroblast cell line cDNA (111)
CSF1R:
colony-stimulating
factor-type
1 receptor;
cellular
homolog
of oncogene
product
(v-Fms)
from
McDonough
feline sarcoma
virus
*-human
placenta
cDNA (112)
Kit:
cellular
homolog
of oncogene product
from
Hardy-Zuckerman
4
feline sarcoma
virus
*-human
placenta
cDNA (113)
RET: cellular
oncogene product
activated
by recombination
*-human
T cell lymphoma
cDNA (114)
Other
receptor-like protein-tyrosine
kinases
TKR11:
putative protein-tyrosine
kinase
-chicken
genomic
DNA (partial)
(115)
TKR16: putative
protein-tyrosine
kinase
-chicken
genomic
DNA (partial)
(115)
Illllll Illll I I 'l I I Illlllllll Ill II IIII Illlllll I I I [~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
SCIENCE, VOL. 241
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
Conserved Features of the Catalytic Domains
To compare primary
structures,
we have aligned catalytic
domains
from the 65 protein kinases marked by an asterisk
in Tables 1 and 2
(Fig. 1). The 65 sequences represent each of the separate entries in
the Tables except for six family members that are not included
because their catalytic domain sequences have been only partially
determined. The alignment was made by eye and is parsimonious in
nature; the amount of gapping introduced into the sequences in
order to optimize positional similarities was kept to a minimum.
The alignment clearly demonstrates the overall similarity among the
catalytic
domains. The catalytic domains are not conserved uniform-
ly but, rather, consist of alternating regions of high and low
conservation. Eleven major conserved subdomains are evident (Fig.
1, I to XI), separated by regions of lower conservation wherein fall
the larger gaps or inserts. Very large inserts (in excess of 60 residues)
occur in CDC7 between subdomains VII and VIII and between
subdomains X and XI, and in PDGFR, CSF1R, and Kit between
subdomains V and VI. A similarity profile of the aligned catalytic
domains provides a ready visualization of the subdomain structure
(Fig. 2). Such an arrangement
of alternating regions of high and low
conservation is a common feature of homologous globular proteins
(10) and gives some clues to higher order structure. The conserved
subdomains must be important for catalytic function, either directly
as components of the active site or indirectly by contributing to the
formation of the active site through constraints imposed on second-
ary structure. The nonconserved regions, on the other hand, are
likely to occur in loop structures, where folding allows the essential
conserved regions to come together.
Highly conserved individual amino acids within the catalytic
domains are expected to play important roles in catalysis. We will
refer to amino acid positions using the residue numbering for
bovine adenosine 3',5'-monophosphate (cAMP)-dependent pro-
tein kinase catalytic subunit, (x form (cAPK-(x, Fig. 1). Nine
positions in the alignment contain the identical amino acid residue
in each of the 65 sequences. These invariant residues correspond to
cAPK-I(: Gly52, Lys72,
Glu91,
Asp166,
Asn'71, Asp'84, Gly186,
Glu208,
and Arg280.
An additional five positions contain the identical amino
acid in all but one of the sequences: Gly50, Val'17,
Phe'85, Asp220
and Gly225. Many of these most highly conserved residues directly
participate
in adenosine triphosphate (ATP) binding and phospho-
transfer.
The consensus Gly-X-Gly-X-X-Gly, found in many nucleotide
binding proteins in addition to the protein kinases (11), is found in
subdomain I, very near the catalytic domain amino terminus. The
invariant or nearly invariant residues corresponding to cAPK-(x
Gly5?
and Gly52
fall within this consensus. Only two positions on the
amino-terminal side of this consensus show conservation through-
out the protein kinase family; hydrophobic residues occupy posi-
tions one and seven upstream from the first glycine in the consensus.
The amino terminus of some catalytic domain polypeptides lies as
close as ten residues from the first conserved glycine. A model for
the ATP-binding site of v-Src (12), based on the three-dimensional
structures
from other nucleotide binding proteins, shows the Gly-X-
Gly-X-X-Gly
residues forming an elbow around the nucleotide, with
the first glycine in contact with the ribose moiety and the second
glycine lying near the terminal pyrophosphate. A nearly invariant
valine residue lies within subdomain I, located just two positions on
the carboxyl-terminal side of the Gly-X-Gly-X-X-Gly consensus
(Val57
for cAPK-(x) and may contribute to the positioning of the
conserved glycines.
In subdomain II lies an invariant lysine, corresponding to cAPK-
c( Lys72, that is certainly the best characterized catalytic domain
residue. This lysine appears to be directly involved in the phospho-
I JULY 1988
transfer
reaction, possibly mediating proton transfer (13). In cAPK-
ot (14), v-Src (15), and EGFR (16), Lys72
or its equivalent reacts
with the ATP analog p-fluorosulfonyl 5'-benzoyl adenosine, thereby
inhibiting enzyme activity. Site-directed mutagenesis techniques
have been used to substitute alternate amino acids at this position in
v-Src (13, 17), v-Mos (18), v-Fps (19), EGFR (20), INS.R (21), and
PDGFR (22). All substitutions, including arginine, result in loss of
protein kinase activity. In all but three of the aligned sequences, an
alanine is present two positions on the amino-terminal side of the
invariant
lysine in subdomain II. The invariant lysine lies 14 to 23
residues downstream of the last conserved glycine in subdomain I,
but no mutations have been made to test whether this spacing is
critical.
The central core of the catalytic domain, the region with greatest
frequency of highly conserved residues, consists of subdomains VI
through IX. The invariant
or nearly invariant residues in subdomain
VI (corresponding to Asp'66 and Asn'71) and subdomain VII
(corresponding to Asp'84, Phes,85 and Gly'86) also have been
implicated in ATP binding. These residues are part of a feature
found in a number of bacterial
phosphotransferases
that use ATP as
phosphate donor (23). The aspartic acid residues corresponding to
cAPK-at
Asp'66 and Asp'84 may interact with the phosphate groups
of ATP through Mg2+ salt bridges (23). The triplet corresponding
to Asp'84-Phel85-Gly'86
in subdomain VII is of further interest in
that it represents the most highly conserved short stretch in the
catalytic domains. It is flanked for two positions on either side by
hydrophobic or near-neutral
residues.
Subdomain VIII contains the consensus triplet Ala-Pro-Glu, a
conserved feature often mentioned as a key protein kinase catalytic
domain indicator (1). The invariant residue corresponding to
cAPK-(x
Glu208
contributes to the Ala-Pro-Glu consensus. In addi-
tion to the conservation of these residues, several other lines of
evidence implicate this region as important in catalysis. Mutagenesis
studies have shown that each residue in the Ala-Pro-Glu consensus is
required for activity of v-Src (24). Other studies have provided
evidence that this consensus lies very near the catalytic site. An
affinity
peptide substrate analog reacts with cAPK-(x
Cys'99, thereby
inhibiting enzyme activity (25). Also, sites of autophosphorylation
found in many protein-tyrosine kinases (1) as well as cAMP-
dependent protein kinase [Thr'97 (26)] lie within 20 residues
upstream of the Ala-Pro-Glu consensus. The role of this autophos-
phorylation site is not entirely settled, but for several protein-
tyrosine kinases there is evidence that phosphorylation of this site
leads to increased catalytic activity (27). Autophosphorylation may
result in a conformational change that allows better access of
exogenous substrates to the active site.
Subdomains VI and VIII are of additional interest in that they
contain residues that are specifically
conserved in either the protein-
serine/threonine or the protein-tyrosine kinases and, as such, may
play a role in recognition of the correct hydroxyamino acid. The
most striking indicator of amino acid specificity is found in subdo-
main VI, lying between the invariant residues corresponding to
cAPK-(x
Asp'66 and Asn'71; two of the residues implicated in ATP
binding. The consensus Asp-Leu-Lys-Pro-Glu-Asn in this region is
a strong indicator of serine/threonine specificity, whereas the pro-
tein-tyrosine kinase consensus is either Asp-Leu-Arg-Ala-Ala-Asn
(for the vertebrate members of the Src subfamily) or Asp-Leu-Ala-
Ala-Arg-Asn (for all others). Another such region is found in
subdomain VIII and lies immediately on the amino-terminal side of
the Ala-Pro-Glu consensus. This region is highly conserved among
the protein-tyrosine kinases with a more limited conservation
among the protein-serine/threonine kinases. The protein-tyrosine
kinase consensus through this region is Pro-Ile/Val-Lys/Arg-Trp-
Thr/Met-Ala-Pro-Glu while the protein-serine/threonine kinase
ARTICLES 45
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
consensus is Gly-Thr/Ser-X-X-Tyr/Phe-X-Ala-Pro-Glu.
These re-
gions in subdomains VI and VIII that indicate substrate specificity
have been targeted for the design of degenerate oligonucleotide
probes for use in screening cDNA libraries to identify novel
members of both the protein-serine/threonine (4) and protein-
tyrosine (28) kinase families.
To date, no evidence has been reported concerning the possible
functions of residues in conserved subdomains III, IV, V, IX, X, and
XI. Subdomain IX contains a very well conserved short stretch that
includes the nearly invariant residues corresponding to Asp220
and
Gly225.
Subdomains III and XI each contain an invariant residue,
corresponding to Glu91
and Arg280.
The latter or its equivalent must
lie very near the catalytic domain carboxyl terminus. Arginine
residues occupying this position reside just 16 residues upstream
from both the CDC28 and HSVK polypeptide carboxyl termini,
and just 19 residues upstream from both the Mos and Fes carboxyl
termini. Deletion analysis of v-Src places the carboxyl terminus of
the catalytic domain of the protein-tyrosine kinases at a conserved
hydrophobic residue ten residues downstream of this arginine (8).
The point mutation conferring temperature
sensitivity in some cdc28
mutants replaces this conserved arginine with glutamine (29).
A leap in our understanding of the functional roles of the
conserved catalytic domain residues will come with the solution of a
crystal
structure
for one of the protein kinase catalytic
domains. The
similarities in primary
strucure should carry
over to the higher order
structure and catalytic mechanism as well. Other investigators have
been making progress toward the solution of the three-dimensional
structure of cAPK-o (30).
Catalytic
Domain Phylogeny
Amino acid sequence alignments can be used to deduce phyloge-
netic relationships (31). We have used the alignment data from Fig.
1 to construct a phylogenetic tree of the protein kinase catalytic
domains (Fig. 3). All 65 of the sequences in the alignment are
included in the tree. They derive from both vertebrate and inverte-
brate sources and, in some cases, presumed functional homologs
from both vertebrate and invertebrate sources are represented. The
tree, therefore, reflects catalytic domain evolution stemming from
gene duplication events (for example, when the vertebrate, mostly
human, sequences are compared), speciation events (when verte-
brate and invertebrate
functional homologs are compared), or both.
The tree reveals a relation between catalytic domain sequence and
certain biochemical properties; catalytic domains from protein
kinases having similar modes of regulation or substrate specificities
tend also to have similar primary structures and cluster together
within the tree. Five major branch clusters are present in the tree: (i)
protein-tyrosine kinases, (ii) cyclic nucleotide- and calcium-phos-
pholipid-dependent protein kinases, (iii) calcium-calmodulin-de-
pendent protein kinases, (iv) protein kinases closely related to
SNF1, and (v) protein kinases closely related to CDC28. These
major
clusters account for all but 12 of the 65 sequences included in
the tree. Generally, a sequence found within one of these clusters
shares in excess of 35% identical amino acids with each of the other
sequences in the cluster, whereas the catalytic
domain sequences that
do not map within the same cluster have identities in the range of
20 to 25%.
The most highly populated cluster contains all 27 confirmed or
putative protein-tyrosine kinases. The large number of protein-
tyrosine kinases probably reflects the intense research
effort devoted
to this group, rather than a true indication of their abundance
relative
to the protein-serine/threonine kinases. Branches leading to
the Src subfamily and to each of the three receptor subfamilies
46
diverge from the main line at about the same point. In light of the
oncogenic potential of many of the protein-tyrosine kinases, it is of
interest that the protein-serine/threonine kinases having the least
divergence from this group include Raf and Mos, cellular
homologs
of retroviral oncogene products. However, another potentially
oncogenic protein-serine/threonine kinase, PIM-1, is not closely
related to the protein-tyrosine kinases.
The next most populous cluster in the tree includes two separate
subfamilies that can be classified according to their mode of
regulation: the cyclic nucleotide-dependent protein kinases and the
calcium-phospholipid-dependent protein kinases.
The similarities in
the mode of regulation of the members of these two subfamilies,
namely, activation by "second messengers" released in response to
ligand binding at the cell surface, may be a reflection of their recent
evolutionary divergence.
The third major catalytic
domain cluster contains the subfamily of
protein kinases that have activities regulated by calmodulin. The
calmodulin-dependent cluster falls near the cyclic nucleotide- and
calcium-phospholipid-dependent cluster. All members of the cal-
modulin-dependent subfamily have a calmodulin binding domain,
characterized
by a high proportion of basic amino acid residues and
having a propensity for formation of an amphiphilic a helix, residing
outside the catalytic domain. (Note that the calmodulin binding
domain sequences were not included in the phylogenetic analysis.)
The different
protein kinases thus far described as being regulated by
calmodulin, therefore, appear to have diverged from a common
ancestor after acquisition of the calmodulin binding domain. The
mapping of the putative protein kinase PSK-H1 within this cluster
predicts that this enzyme will also prove to be regulated by
calmodulin.
Also mapping near the cyclic nucleotide- and calcium-phospho-
lipid-dependent protein kinases is a small cluster composed of four
protein kinases recently identified in the budding or fission yeasts;
SNF1, niml+, KIN1, and KIN2. Whether these protein kinases
Fig. 1. Multiple
amino
acid
sequence alignment
of 65 protein
kinase
catalytic
domains. The first 38 sequences
derive from protein-serine/threonine
kinases
(indicated
by asterisks
in Table
1) and
the remaining
27 sequences
in
the alignment
are from protein-tyrosine
kinases
(indicated
by asterisks in
Table
2). cAPK-a
and Src have been chosen
as prototype
protein-serine/
threonine and protein-tyrosine
kinases,
respectively;
their
catalytic
domain
sequences
are
numbered to indicate residue
position
from the polypeptide
amino
terminus.
(Although
the human Src
sequence
is shown,
the number-
ing is actually
taken
from
the chicken Src
sequence
to maintain
established
convention).
The number
of additional
amino- and
carboxyl-terminal
flank-
ing residues
lying
outside
the catalytic
domains
are shown at the beginning
and
end, respectively,
of each
sequence.
In several
cases the sequences
have
not been
determined
through
to the polypeptide
amino
or carboxyl
termini;
for
these,
the
number
of determined
residues
is given
followed
by a plus
(+)
sign.
An asterisk
(*) at the beginning
or end of a sequence
indicates
that no
additional
flanking
residues are
contained
in the polypeptide.
Gaps, repre-
sented by dashes, were introduced
into the sequences
to optimize the
alignment.
In six cases,
long insert
segments
have been excluded
from the
alignment
to shorten the figure.
The positions
and
lengths
of the excluded
inserts
within the alignment
are indicated
by numbers
within braces
(for
example,
{-48-}); the excluded
gap positions
in the other sequences
that
correspond
to these
long inserts are
shown as double
slashes
(//). Residues
conserved in 62 or more of the 65 sequences
are
shown as white letters in
black boxes.
Positions
where
residues of similar
structure
are
conserved
in 63
or more
sequences
are
shown in shaded
boxes.
Structurally
similar
groupings
used for this purpose
are nonpolar
chain
R groups
(M, L, I, V, and C);
aromatic
or ring-containing
R groups;
(F, Y, W, and H); small R groups
with
near neutral
polarity
(A, G, S, T, and
P); acidic
and
uncharged
polar
R
groups
(D, E, N, and Q); and basic
polar
R groups (K, R, and H). The
single-letter
amino acid code is used (A, alanine;
C, cysteine;
D, aspartic
acid; E, glutamic acid; F, phenylalanine;
G, glycine; H, histidine; I,
isoleucine;
K, lysine;
L, leucine;
M, methionine;
N, asparagine;
P, proline;
Q, glutamine;
R, arginine;
S, serine;
T, threonine;
V, valine;
W, tryptophan;
and Y, tyrosine).
Roman numerals
at bottom indicate
conserved
subdo-
mains.
SCIENCE, VOL. 241
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
A Al
1 3
9 1 S-9 A A V A 3 A I 1 1 d 9-- 0 A S 3 V 9 A 1 A I
1 J N 1 1 0-9 A 3 A I d -- 9 I 3 VA 1 1 N A
1 J N 1 1 O- A 3 3
1A
3 1I A A d -----9 H 1 3 V9 1 1 N A
1 A a A 1 a-3 A 3 A 3 1 I I A I d -----9 1 3 V 9 1 1 N A
I j N 1 0-9 H A W
A d 1 A A 1 d S 9----3S A 1 3 I 9 1 1 S 1
1 jA N 1 A-9 H A W
A 3 A A W
1 1 d A-----9 3 1 3 A 9 j j 8 A
1 A S 1 1 a-9 V 3 W
H 3 W
I 1S I S 3-----1 a A 3 I 9 A A
d A
1 A i 1 -9 9 3 W
1 3 1 I I A & d 3-----N 1 1 3 A 9 1 b A 1
1 S 1
J-A
S >- W
1 3 W A A V d ----- S 3 A 9 1 A
1 A S a 1 0-9 a I W
1 3 W
I A 1 1 d -----9 A A 9 1 1 A
I A S A
1 A-9 H V W 3 W A
A 1 d ----- A A A 9 1 1 A
A A a 1 1 3-9 1 d W
1 b I A W W
b ------ s w 3 A V 1 1 N
A H a 1 1 3-9 A d W
1 A
A S------ 1 1 I 9 1 1
A A a 1 1 3-9 J d W
1 I A S------
1 1
A I 9 1 1 d 3
1 1 1 J -9 9A A A 1 3 A I A I d ----- a 1 3 A 9 I 1
3 A
1 A 3 1 1 N-9 N 3 W 3 1 I I A d V----- H 1 3 AA
91 1 d A
1 J 1 1 N-9 H S W J 3 1 I I A d d-----3 3 A
A 9 I 1 A
1 A a 1 1 N-9 A I W J 3 1 I I A d d-----3 1 3 A 9 1 1 A
1 A N 1 S-9 H N W
A 3 1 A I A I -----H 1 3 A
1 J 1 1 S-9 S W
A 3 O A I AId 3-----3 S 3 A
J a 1 S-9 N V J 3 1 I I A I d 3------ A A
1 J a A 1 S-9 N 3 W
A 3 1 I IA I 3------ A A
1 J a 11 S-9 N V W A 3 1 I I A Id 3-----3 A A
1 J a l S-9 ) N W A 31A IAI d ------ 3 S A A
l J a S-9 H 3 W
A 3 A IAI 3------ 3 S A A
1 J a S-9 S W 3 1 A IAI d ------ 3 S A A
1 a 1 S-9 A S W A
IAIA Id 3------ 3 S A
s0? Ot? 0??
9A 1 A
VA1V A
VA 1 A
VAIbA
VA 1 A
A 1d A
I
I
I
I
I
H d H N------A IN
1 AN
N W
H N-----91 A S 1 A
N 3 H A-----91 H S W I ) 1
N I H d-----91 H S W
I N 1
N d H S------A a A W I I 9
H a H A------I WH
1 113 A
H A I a------ 3 ) A V
H H 3 i------J 9 A W A S V
N A H 3-----3 S V I A V
A d S ------A 9 V A A
H d N a------A S V W AA V
N d H S------A a I A) V
N A H H------1 a A w I V V
N d H )------W 3 )t W IV V
N d H A------I 3 w A
AV
N d H
N H
I a H
'd H
X a H
X a H
X O H
X a H
A 3 H
0Z?
ANH
AAH
AAH
b------ 1 I W
A
------J A A W I V V
------ 1
i W A N V
O------1 & w
W
1 N V
O------ I1
i W
1 N V
1------1 >
X W
I a v
------1 1 1 W A a v
------ 1 N W I A
a------1 N W A a
III II
S W 3 3
SW1V3A3
SWHVOA3
SdSVN3 s
J1HVSdAl
a VH V S
SSaViSN
1 S A 3----------------------------9 I a I d N
J 0 0 8 V
u v----------------------------- s 3 S v 3 >
a 1 3 V J 3----------------------------- - S V 9
s d
) 1 A 3 I ) 3----------------------------? a IS 9 >) >
S 1 J N I d 3----------------------------A a I V N 3 N
N 1 J 3 I V. 3----------------------------? W S V V 3 N
N 1 3 I 3---------------------------- S S 3
d 1 j 3 3 S S----------------------------3 V 9 IS X 1
a 1 I 3 N V---------------------------- d S 1 N 3 d
a I I 3 X N V----------------------------? d S i V 3 d
a1 J V I----------------------------a d d 1 3
V 1 J 3 H1H d----------------------------- - W V a 3 >
31 )
j a v ?
v----------------------------- - w i a 3 )I
) 1 J 3 3 A 3----------------------------- - W 1 a 3 )
3 I J a a 3 s----------------------------- - W 1i9 3 )
b1 V V i s----------------------------- - w 1i 9 3
V 1 J V 3 A S----------------------------- - W S 9 d N
V d 1 b 9 V S----------------------------- - W S 9 a )
3 1 A V a A S----------------------------- - W 1 9 d >
3 1
A S 3 d S----------------------------- - W 1 9 d >
3 1 J V > d S----------------------------- - W 1 3 d A
a1 J V 3 d W----------------------------- - W 1 9 d >
1 3 d S----------------------------- - W 1 9 d
00?
A I i A 9---------AVAA1H J V 1 AV A
A W
V ---------aS)I A V I 3 A
A ) I A V---------a03N199 V i V 3 A
A N W
I V---------SHS19 H V V3 A
3 H I )-----N------aa 1 1 H A
A 1 W a--------- 3dllN H 33V l J
A d 3----------3S031 1 9 3 A
A I 3 9--------S9A9I A V 1 9 3 A
3 3 a A---------9NddJS 1 I 9 3 A
A l1 3 d
--------- 3 AA9 V A 9 3 A
A d 1 3 ---------39IIA N 3 A
A d I A----------N393d A M
A 9 'A
A d I A----------N390d I M I 9 N A
A d I A---------- 393d I M
1 9 N A
A 1 1 N a--------------A l 1 A 9 s
I 1 3 a H--------------d . M A 93 A
A 1 N 9 A-------------- M A 3 A
A S A-------------- M
A 3 A
I I ---------------9
A a I S ---------------N
A A 1 H ---------------
A A I 1 -------------9
A >1
IS ---------------N
A > I N ---------------9
A IS ---------------9
A N I1 ---------------
A l 1 --------------9
06Z
J 3
V
J v
Vl
AH
S
V
V
AS
J N
A a
A
A
d
b M
N 9
A3 HA I J N
N A I V W A
33 A
N AA 9 WM 3 3
N M
1 9 W M 3
N M
1 9 1 M a 3
J
N M
1 9 W
M 31 3 3
N M 1 9 W 1313 3
j
I A 9 1 A 1 N A/SAS
)I 9 J S 1 'A
N/98S
N 9
3J 1 N N/08S
1 A 9 I A 1 0 a/s9s
A 3 N J H A I 1/?601
3 M
N 1 A I a /AOES
A 1 1 1 b0
S/Sb9T
1 1 A 1 1 1 N 3/+ESI
Ad V 1 I I N 3/+ZIL
A3 S W I I A 3/996
3 A 1 1 1 I 3/020I
A 9 9 d 1I 3 V/+S89
A A A A
A 1 3 l/LIL
A I
N A 3 1/589
3 91 A 10 3/+I9S
A N H W
I I 3 S/+6SI
A H
A W
I a 1/+ZI
A H A W
I I 1/6?E
3 3 W 1 W 1 a IILZE
A A 1 1 0 I 3 S/18Z
A
) 3 1 A 1 S 3/8EZ
A 3 A 1 A 1i 3/E1 Z
d A A 1 > I S 3
/AZ
A n
I 1 ) 1 s 3/89Z
A A 3 1 1 I S S/09Z
A A 3 1 d 1 S 3/ALZ
A 3
A1 1 S 3/Z9Z
e>z
I A 0 H 1 I-A N 9 9 J 3 W
I I 1 9 1 SNS9Vdl d I S V V A A A
A
1 H H A 1 S-S 9 3 A
M I I V A 9 d------ d I W 3W J d 1
1 H H A S-S 9 3 3 M I I I 9 d------
Ad 1 J 3 W 3 1 1
1 H A 1 S-S 9 3 M
a I A I 1 N a------ I
I W
A 9 W
31 1
I J a aOA
d 3 d
A 3 1 I 1 A S -----d d 3 J MA 1 1 d I
I A 1 A 1 0-9 N d 3 A a 1 A A A I V 0-----3 1 3 1A a H 1 1 I
1 A I A 1 a-- V a A > d 1 A 1 3 1 A 9-----S A A H i a 1 1 d 1
1 J A a 1 S-9 N 3 9 1 3 A a W A 1 9-----9 H 3 M S a W 1 3 A
A I X a 1 S-9 9 a W A 3 W 3 W
A A V A-----3 I A A V 9 A J a A
1 I A a 1 S-9 3 a s A 3 W
1 I I I 3 N--NIHO
N A A V 9 A 1 I
A J I A J 3-3 H d A A d 1 A V I A a a-----A A d > V a 3 1 d V
A 1 A A a
-I N N A H 3
A I 1A d I d------ -S A d a > A
AA
I A
A3 1 a- Id
A J 3 1 A1 N
1 N
1 -----A V W J I a I 1 3 I
1 A I d 1 a-- b a A H 3 J A 1 1 A A I3AlIIS I V 3 A a w 1 l A
1 A I a-- W S 1 A 3 A I 1 A 8l S-----a 0 w A a 0 1 s A
W
A A 1
Ia-- W
a J 3 A 1 A 1 S-----3 V H
1 I ia 1 1 A
W
A d > 1 a-- 1 a 1 J 3 J A 1 A 1
A H----AV S H A I a A 1 A A
I A a 1 1 0-9 9 S A A 3 J 1 W
A J H N-----S W
1 3 W
3 A
1 l 3
I A a 1 0-9 9 S A A 3 J 1 W A J H N-----S 11 3 J
3 1 Il 3
I A H A 1 3-9 a d A A
A3 1 1 A W H A-----H a I M A a A 1 l 1
I A a 1 3-- N 9 V A 3 I A W
I I 3
A-----l S l I A ( A 1 I
I A a J 1 3-9 9 1 V 1 3 w A w A A d 3----- 1 3 J A 3 A 1 a I
I A 3 J 1 3-9 9 S A W 3 1 A W A I -----N 3 3 J V a A A A
I 3 J 1 3-9 9 3 I A 3 W
J 1A I 3 H-----d 1 3 IV A
A1 I
1 A a J 1 3-93 A W a J A1 J J A 1-----N 1 3 A I a >1 I
I a 3 J 1 3-9 3 I A
I a J A 1 A H 3 9-----3 3 S I S a H1 li A
I a 3 J 1 3-9 9 I A 1a
0 1 1 A H H 9-----3 3 S I S a H 1 A
I 0 J I 1 a-9 9 N A A 3 w A J J 1 0
a-----w I 3 H 1 t A
I H A W
1 0-9 9 1 A A 3 W AA A 1 A 0-----d A
b I S H A
1
I H A W 1 a-9 9 N A A 3 W
AH A
A a-----w 1 j 3 s H1 a1
I H A W 1 0-9 9 N AA
A 3 W A A 1 A -----A I A
I S H 1 a
I I I M
1 3-3 9 1 3 V 3 W
1 W
A 1 A -----S a J I
1 1 S A 1 3-9 9 3 I A a w A W J A a0 -----s a
0 I
1 1 S A 1 3-9 3 I A a W
A W
J I N -----V a a A I
1 1 S 3 1 3-9 9 3 I A a W
I W
J I 0 b-----v a
0b J
1 I S A I d-3 A > 1 - - I I HW I 0
-----0 a b
A I
1 H S W
3-9 9 d A A3 W
A W
A 1 N S----N a A
J V
1 H S j W
3-9 9 d A A 3 W A W
A 1 N S-----N a A 3 S
A 1 A
9 MW I
M W I
M
W I
9 W I
A 3 1 A
3 3 1 A
I
I
I
I
f c
A
I
II
i
I
iS
9
11
-I
Ill
IX
N a H ------- 1 V A N 1
N A H d------I 1 A a W
N A H d------I 1 A
A W
N A H d------ I A A 1 A V A
9 S J 0----SSA A 1 1 A A
A d H S-----SA d V H 1 V 1
V d H A------1 A 1 1 Ad
V
H a H 9-----l d a I S A
A d S N------3 ) H 1 A 3 1
N 3 H d-----?lA N X A IS 1
d S S 9-----IW I A 1 1 N 1
1 1 I I------N 1 9 3 d 1 N
N d H a------W 3 a 1 A A A
N d H 3---JV31 l l 1 1 V A
N d H a------1 3 1 1 S I
N S A N--N30NA
3 )i1 1 S I
N a a ------?1 3 1 s I
H d H A------1 I 9 1 S V
H d H A------1 I a 9 1 s
N d H ------1 1 al W 1 I
H d H a------ 1 a 1 A S I
N V H ad------?A 1 A a 1
A d H H------1 3 N W
I S I
N a H N------1 b N W
A 3 I
N d H 9-----SA A
A 1 I a A
N S H A1------1 1 A 3 I A V
N d H A1------1 I1 3 I a V
J d d A-----39 1 V I A 8 ?
J H d B99d993 1 V 1 A A A
J d d A-----9d 1 V 1 A dl A
J d d -----l 01 A 1 A A A
j a S H------V 9 0 W I a A
J d H S------A I S I W ld
3 d H 3------?A 1 >1 Wd d
J d H I------A IS I W
1 A
J d H I------A I S I W
1 A
J d J N------A V b 1 I a >
J d J N------ A V 0 1 I d A
001
V M J S d A S--------------------------ld N 1 3 N
N V --------------------------- 3V d S A
N J V a --------------------------- 3V b S A
N J V J ---------------------------3d Id aAA
A
W
d A d 1 9 N----------------------- d39Ma S I A a A 3
l V b 1 A b
A--------------------------- 3 N 1 90
A1
H S IS 1 A M --------------- - - - - - -
a 1 1 d N 3
---------------------------Ad 9 S J A
AA
W
1I ---------------------------- 3 3
B A I N I I---------------------------S N 3 A d
N A I d S----------------------------- - - S A A
1 I 1 I 3 ld A---------------------------I. )1 N A d
A
b I V S a 1--------------------------- 3
A I S I d -------------- - 99999 9 9 N d A
I V S A---------------------------3 3 S 3
I V S A---------------------------93 S 3 3
I V 1 S d A
------------------------- 3 3 3 1
b A I A )1 a VI13X31lAX3113S3NAdSdlSHOXH1A
V X S V 8 N
b I 1 d a
A0 SI3A31AbdA31Aab3NAdddlWb3NH1l
V A I V d N
9 I S V A -------------------------- - - - - I d
a 3 I -------------------------- A
>1 N
S33A3B
d I N 3
l
W A W 3
H3AA1H
N 3 1 l b H (
----------------------------- 3 A 3
3-------------------- - S A V 1
a----------------------------- A
d 1 0 >
-------------------- A33SS 9 9 A a
aS-- - -- M s >1 A IN
-------- 1 >
I IN,
I W
1 3 3 A a------aa I I A a >
A 1 3 a--------------------------aa A I a
A W
1 3 A a--------------------------aa I A A a
A 13 3 A --------------------------aa I A A
S
a
a
a
a
N
N
6
I H 30
NIH 3 A
NIH 3 A
NIH 3 A
NIH3 A
1 IH3 I
1 I H 3 I
Ul A
AIA A
AWN
A
NlA A
Ald A
A>I A
A1A
A
08
I H 8
A >
I I H
>
A O X
XI
A I 3
A A
A I 3
>
A A
0 I
a
A A
b I
a
I A d A - --------------- A IV A
A aG
- - ---------------9 H M d9 3 I
A A a - - ---------------3 H M d 9 J
3
A a -- --------------- H M 9 A
I A d N -------------- A I S A
A A 1 1 3 a--------------A I a 3 V A
A I A d A--------------a d H S S a
A A A lI A
------------- 3A d a 3A 0
I W I A N I--------------d > H 1 A A S
> A I S --------------d A H VA
1 A A N S 9ANMAHSVJ>NIIN91
I > V A A
A A 3 1 1-------------- I N I 3
I I --------------S H 0 391 A
A H S--------------H d a V A
W A A 9 -------------- H A
AW I l 9 S--------------1 > H V
A A
A A
D 0 9
----------- 09dd A
IA 3 3 I-------------- H >A
A
A 9 A 3 N I-----------A d H AV I A
I V 109 1 ------- H
----- I VHXV
1 A --------------I H AV 1
A d I-------------- H 3A A
M A 1 9 1-------------- A
A
X
11 1--------------S 1313
A A 3 --------------d A H I3 A
d
A 3 H 9 1------------ 1 A 3 d
V A 3 0 9 V
-------------- A > A d
I A 1 3 3 S-------------- 3V 11
I A 1 3 S-------------- 3 1
A A 1 3 1--------------9 3S 1
I A 1 3 3 1 -------------- a
W J I X S 3-------------3S 1 A 1 3
A N--------------H N S I 1 H
I A N------------H A S A 1H
A A N--------------H d S I 1 H
A A 9 N--------------H S I 1 H
W A A 3 I-------------- H A 1 W
A H N 9 1----------- 3 W H A 1
tL 09
13J
9
J S
j s
j3 S
13 s
J 3
1^
A N
A
I
A V
I
A 1
NW S
IW S
1 S
1 S
ij S
S A
13 V
1 s
1 S
A S
13 S
1J 9
13 s
'3 S
13 s
J s
J S
13 S
13 S
T
u& 1 1
93 A b 3/ZS Wo
I A1 1 A 3 S/+91 Al
AI
A 1 I 1 A 3 S/l7E? P>!-V
I S I W A 3 S/9AE $PH
l d 9 A
a A 0 S/S? i-Wld
I S A J d 1 S 9/SI IIIP.
V 9 H I I J 9 W/681 NASH
l I A N i
d 1/?9S 4,a
3 a 1 J 3 1 3 a/zLS ZSgd
91
a A -1 a 0/881 31S
N a I 1 N A 3 N/S 1303
x d A I b A a a/l? I
In
N A 3 N I A 3 W/b 81ZNIN
3 V A d 3 A d S/E? F-)Sd
N 3 I N I A a 3 W* sHZ303
N 3 A N 0 A N 3 WN + 2JpD
> 3 1 A
N A N V/S 8Z303
i1 3 1 J 3 M
a 9/96 ZNIA
13 A 3 M a 9/LII INI
I A 9 1 d M
A 9/6 mtlu
I N A I bA N 9/Z IINS
l A
V I O A
A V/+56 IlH-Sd
3' 3 I N A A a/+?IZ N-A31W
iV 3 S N W S J/+8S X-IW
I 3 N d 3 A N 3/91 L-)qd
a 3 A 1 A
A3 a/IA d-11-IW
33 J 1 A 3 3/01 -I1IWP]
A
A I J N a 1/8?? 3)dO
A W
1 J S J a S/Z? L-JNd
A W
1 J N J a 1/6E?? -3Xd
A W
I1 J N J a i/9?? V-3X>d
1 a I I N J a S/9S? M
I d 1 I 0 a S/S8 ?Edl
I
1 N I b J a H/Z9 ZNdl-
1 1 I b J a 0/s8 lNdl
I
18 1 I A N >/Z6 VYNS
I I N ' 3 J a 9/117 d-dV,
.I d 3 J 0 a/o -Ndv3
Tl
c
\0
00
00 1311
11V
I!
HJ90d
1^M
1:-l4
ssall
sob
B'S#I
S3J
Iqv
SON
NAJ
J6j
sAN
saa
lJqV
H
CAI
I
I
I
I
f
t
f
f
t
t
t
f
f
f
t
t
f
t
t
f
t
I
I
f
t
ols
4 I
4 I
\ I
1 I
1 ;
1 I
[II
[I.
[ .
[ I
I-
iI
4 I
iIl
I
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
140 150
R R I G R-------------------------//-F SEPHARFYAAQIVL
R R I G R-------------------------//-F SE P H A R F Y A A Q I VL
R K S K D-----------------..-- -...-- //-F P N P V A Q I F A A E VCL
R K S Q R-------------------------//-F P NPVAKFYAAEVCL
R K S Q ?R-------------------- //-F PNPVAKFYAAEVIL
R K S R-------------------------//-F PNPVAKFYAAEVCL
R DRGS-------------------------//-F E D STTRFYTAC V V E
Q VG K-------------------------//-FKEPQ AVFYAAEIS I
Q QV G R------------------.-------//-F K E P H A A F Y A A E A
IA
Q QLG K-------------------------//-F KEPHAAFYAAEIAI
Q FG K------------------------- //-F K EPVAVFYAA EIAA
V ARE Y-------------------------//-Y SEADASHCIQQILE
V A R E Y---------------------- //-Y S E A A A S H C I QAQ
I L E
TEKV T
---------------------- -//-L SEKET RKIM RALLE
V D ED YH
-- ------------------- //-LTEVDTMVFVRQICD
I D E D FE------------------------//-L T E R E C I K Y M RQ I S E
I A K G S-----------------------//-F TERDATRVLQMV LD
V QRA
D K----- ?-- ------------ //-M SEQEARRFFQQIIS
R K H G P-- -----------------?//-L S E RAE A A H Y L S Q I L A
I Q H G S--------------- I------//-I A F H A A A K F A R G I A S
I Q H G S-------------------------//-LKEHHA RKFARG IAS
E G I P KDQP----------------------//-L A D I V K K F M M
Q L C K
D R I S ETGATS--------------------//-L DPRL VQ K F T Y Q L V N
D S I P PGQYA--------------------//-M DSS LV K SY L YQ IL Q
D K A P PPGF
---------------------//-L P A E T I K D L M R Q F L R
K DKS IL------------------------//-FTP A DI KA WMLMT LR
Q T - - --------/------------------/-L TDYE IRYY L F E L L K
R D - - --------/ -LP I K G I K K Y I W
E L L R:
S V Y K RFVQRGTVSSKKTW
-- -------- //-F N EL T I S K I A Y G V L N
D E S S EIGG
--------------------//-I D E P Q L A F I A N A VIH.
E E Q G QLSR
----------------------//-L DEFRVWKILVEVAL
S R R LNP
--------------------//-L GRPQIAAVSRQLLS
T E K K VYQ------------------- //-G NSHLIKTVFLQLIS
T ERG A
- --------?-???---?-- -//-L QEDLARG F F W
QVLE
H V Q F TK------------------------//-F QMFQLIDIARQTAQ
H V A TR
------------------------//-F DMVQLIDVARQTAQ
H V A D TR
----------------------//-F D M
V Q L I D V A R Q T A Q
Y G A A GHPEGDAGEPHCRTGGQ---------//-L
S L GKCLKYSLD V V N
360 370
K GETGKY--------------------//-L RLPQLVDMAAQIAS
K E G DGKY----------------- //-L KLPQLVDMAAQIAD
K N P EGQD
-------- -- ??
??---------//-L RLPQLVDMAAQVAE
K DGEGRA---------------------//-LKLPNLVDMAAQVAA
K S D E 5GGK-I-- ? - - //-V LLPKLIDFSAQIAE
K TPSGIK ---------------------1//-L TINKLLDMAAQIAE
K S GSK---------------------//-Q PLPKLIDFS A Q IA E
R E G DGRY
AI------------------//-L HFEDL I YIATQVAS
R R H EKTLI ?I/- N--M-A-F-F-F C//-G NMGLLLDMCIQVSK
R ECN RQE --------------------//-V NAVVLLYMATQISS
R SAGRET ---------------------//-LDAVALLYMATQIAS
R RTD KSL
---------I----------//-L PPIILVQMASQIAS
R TEGAR ------------------------//-L RVKTLLQMVGDAAA
R E H K N
AN-I--- ------//-I GSQYLLNWCVQIAK
R E N RGR ----------------------//-L G S QDLLNWCM QI A K
R N N DK----------------------//-I GSKALLNWSTQIAK
R S L RPEAENNPGRPP
-I-- ---- ?--- //-P TLQEMIQM A A E IA D
R SLR PEMENNPVLAP
- ? I------ -?//-PSLSKMIQMAGEIAD
R A H RPRSGMRPDDVSLIAWMTGNVQP
-- -//-P TYGRIYQVAIEIAD
R K A RMAT'FYGPL
------------------//-L TLVDLVDLCVDISK
R A A RATSTQEPQPTAG
-------------- //-L SLSELLAMCIDVAN
R SHGPDAKLLAGGEDVAPGP
--------//-L GLGQLLAVASQVAA
R N E T HN.-------------- //-P TVKDLIGFGLQVAK
H R N KHTFLQRHSNKHCPPSAELYSNAL{-77-}L
SYTD L VGFSYQVAN
RRK A EAMLGPSLSPGQDPEPGVDYKNI{-48-}L
ELRDLLHFSSQVAQ
R R K RDSFICSKQEDHAEAALYKNLLHS{-54-}L
DLEDLLSFSYQVAK
R E S R KVGPGYLGSGGSRNSSSLDHPDERA-//-L TMGDL I S F A WQISQ
1
A
A
A
A
.A
6
6
6
A
A
v
6
A
A
A
iA
6
6
6
A
A
S
G
6
A
A
At
a
C
C
C
F E
r:F E
ALE
A:E
L E
iL E
AL E
t!-L
E
(I~F
A
L F
L F
iL F
VL F
V L
k
VL
"I C
.v E
,V R
1
V E
V A
,1L I
UL Q
:I A
iV N
MD
I V
L D
L D
:L K
L
MD
V D
: V E
-V R
M D
M
D
M D
L L
6
Ai
.1
LI
61
Ii
:
i
LII
IFII
I'
II
'II
11
MP
MA
MA
MA
MA
MA
M A
ME
MT
ME
MS
MS
ME
MN
MS
MS
MA
MA
MA
CV
CS
MV
MK
MD
MA
MA
MQ
V6L:
;TI
F
L
:
II
'
n i.
?
ii t
'i AL^
ir iH:-
~YR
V!YL
.IC
iV
Ei
i,
Vt
VC
NIl"
EII
BlI
y :!
iF.;i!
F l
V
irl
VCiij
V~Li
V-L-'
VYt1
iY'iii~
VLi
Vtffl
Mt
IA
160
H SL-D L I
H SL-D L I
H SK-D I I
H SK-D I I
H AH-N I I
H SK-D I T
H SK-G I I
H KR-G I I
Q S K-G I I
H N Q-G I I
H TK-G I L
H QM-G V
H Q M-G
V V
H KL-N I V
H KM-R V L
H KQ-G I V
H AL-G I T
H RH-K I V
H RF-R F R
H A N-N I V
H A N-N I V
H S H-R I L
H SR-R I I
H SR-R V L
H AN-C I V
H RN-F I L
H SM-G I M
H SK-G I I
Y R QYK I I
K E QHN I I
H HK-N Y V
H RQ-G I I
H SV-G I Y
H NC-G V L
H AK-N I I
H AK-N I I
H AK-N I I
H SQ-S I V
380
E R M-N YV
E R M-N
Y I
E R M-N Y I
E R M-N
YI
E R K-N YI
E E R-N YI
E Q R-N YI
E S K-Q LI
E R H-N YI
E K K-N F I
E S R-N Y I
E A R-H F I
E S K-C C I
E D R-R L V
E D V-R L V
E E K-R L V
N A K-K F V
N A N-K F V
A A K-K F V
E R M-H F I
E D M-H F V
A G L-H F V
A S K-K F V
A S K-N C V
A S K-N C I
A S K-N C I
A E M-K L V
.i
i
i
it i
V.,
VI
Ni
I'
NI'
li i
Ni
1I
#ii
NIII
Ni
Ni
Nii
.Nii
Ni
N i
Ni
Ii?i
.11l
-NI
I
N:i~
il:I
-I F
#li
1;#1
NI
NI
I
11
1I
RLI
NI
Ni
Ni
Wi
I
Iii
RI
NI
MI
It
IL
?L
It
It
IL
at
It
i.t
IL
It
It
It
It
It
I V
II
IV
IL
II
II
IN
IL
I t
It
Itt
It
IL
III
II
It
IL
17
KPE
KPE
KPE
KPE
K P E
KPE
KPE
KLA
KLA
KLA
KLD
KPE
KPE
KPE
KPE
KPE
KPE
K P E
KPEE
KIE
KIE
K PQ
KPQ
K PQ
KPE
K PN
K P H
K P T
KPS
KPT
K PA
KTE
KPE
K E
KS N
KAN
KAN
K PA
391
A A
A A
A A
S A
RA A
A A
A A
A R
A R
A A R
A A R
A A R
A A R
A A R
A A R
A A R
A A R
A A R
A A R
A A R
A C R
A T R
A A R
ft A R
ft A R
ft A R
ft A RI
L
I
I
I
I
L
V
V
V
V
I
I
A
A
A
L
t
I
A
I
L
L
I
I
L
V
I
V
I
I
I
I
I
I
I
I
I
I
i
V
i
Ii
VI
C?
C
M
B
i
I
180
iL IDQQ G--?- ? Y I C
L IDHQ G-------Y I C
L L DK N G-------H I K
L LDKN GA- --H I K
L LD R N G- ?H I K
L LDKNG-------H I K
I LD H R G-------Y A K
M LA SE
GE -----H I K
IM LD
DSE G?-- ?--H I K
M
L D A E G-------H II K
L L D A D G-------H V K
L LAS K LKGA----A
V l<
L L A S K LKGA
----A V K
L L DDD
M-------N I K
L CV N T TGH
-----L V K
M
CV N K TGT
-----S I K
L YY H P GTDS----K I I
'L L E H L ?-- ?N V K
!L IKV N EQ
------Q I K
M I DS S-----E I K
M ISSS G-------E I K
L I NKD
G-K - -N L K
iL IDKE G-------N L K
L I D K G- S?-T I H
L VTSG G-----QIT V K
L FS PD G----??-Q I K
M I D H E NR
------K L F
L FNLE LG
- ---R G
;LINSK G-A ----QI K
L CSAN QG
----T V
A
M IT F E G-------T L
F I N T P E-------D I I
M VG N D GN
A --N T V
.L I DL S RG
------E I H
F LHE G L-------T V K
F L HE G L-
------T V
FLHE G L------- T V
L IS EQ D------- V C
400
L V GEN L?--- VCK
L V G EN L-------VC K
L V GER L- --AC .
L V GNG L-?-- -ICK
L V S E S L------ C K
L V S DT L------- SC K
L V S A S L---VC
L I G E N N----V A K
L V G S E
NH --- V
L V G E N H-----?-- V Kl
L V G D N K------.LV Kl
L V S E.H N------I.V K
L V TEK N .... L K
L V K TP Q-------H V K
L V KSP N----- HV K
L V RLL --------AG E
MAH
M V A H D F?--??- TV K
M V A E D F-------T V K
M V A DD L-----?TV K
L V S V K DYTSPR--I
V K
L V T E S TGSTDRRRT
V K
L V GQG L----?- VV K
M L DEK F-----TV K
L I CEG K----- LV K
L L TNG H-
----VA K
L L THG
R. ---I T K
L V A E G R--- K M K
VI
V
I
I
I
I
I
I
I
V
V
V
I
I
I
A
?
A
V
I
I
I
V
I
I
I
I
190 200
AKRVKGRT - ------//-W T LCG
AKRVA KGRAT- -------//-W T L C 1
A K YV P DVT- -------//-Y T L C
A KY V PD VT- -------//-Y T L C
A K E V Q TVT- -- //-W T LCG
A K Y V P D V T -------//-Y T L C G
1
A K K I G F G K K T------ //-W T F CG 1
C K E H MMD GV TT
---//-RT F CG T
CK EN IW DGV TT
-----//-KT T CG
C K E N V F P G S TT
- //-R T F C 1
C K E N I V G D K TT ---- //-K T F C G
i
A I E V EG EQ Q AW
---- //-FG FD A 1
A I E V Q GD Q Q AW?- //-F A F AG
A
S C Q L D P G E K L?- I-//-R EV CG
A1
A RR Y N P N E K L- - A//-K
V N F 1
A RR L E S A GS L-- //-K V L F 1
AS A RKKG DD CLM
-- -//-K T T CG 1
SN I MTDGN F L -- //-K T S C G
A TV E P N D S C L---- //-E N Y C G
S NI YDSR K Q L- //-H T F CG 3
SN I FDYRK Q L-- //-H T F CG S
P E
P E
P D
P D
PD
P D
PE
P D
PD
PD
PD
P
P G
PS
PE
P E
PE
PN
L H
L Y
L Y
A R A F G V P L RAY - //-T HE I V T L W
A R S F G V P L R NY
A- I//-T H E IV T L W
A R A F G I P I R VY -- //-T H E VV T L W
ARIYSYQMA L---- //-T PVVV TL W
A R AI PAP HE IL---//-TS NVV
ATR Wi
A E F Y H P GQE
YA
---F ?//-N V R V Ai R Y
A E AQM
MD Y K S MISS{-77-}A N R A G iR Gi
S K K LI N S I A ---//-D TF VG S T
S G N LV A S LA -------//-KTN IG C Q S
A S VW P V P RG ------//-M ER EGDC El
A C F VQ G S R S SPFP---//-Y G I AG T I D
A TTEPY SS ?- ???--//-D F G CG L F
G A L L K D T V Y ---- //-T D F DG R V
A T V K S RWS G SQQV
---//-EQ PTG I V L I
A T V K T R W S G AQPL---//-EQ PSG IIV L
ATS VKT R WS G AQPL
--//-EQPS G
I? V .
SE K L EDLLC FQTPS I//-Y P LAGG Y T
4 10 42
0
A L A R
L I E D N E Y TAR
----//-Q G A K F
A L A R LI E D N E Y TAR
----//-Q G A K F
A L A R L I K D D E Y NPC
- -
-//-Q G S K F
IA L A R L I E D N E Y TAR
----//-Q G A K F
IA L A R V I E D N E Y TAR
----//-E G A K F
A L A R L I E D N EY TAR
----//-E G A K F
SI EL A R I
V F NEAFY
TAR----//-EG A KF
I E VISADADFEY CPK----//-Q G S ARF
SIA L A A YVLDDQY TSS-- -//-GDG T F
I
A L S RLMTGDTY TAH
- -
-//-AG A K F
AI L ARLMR DD T T AH
----//-AG A K F
AI L ASRFMKEDTY TAH
----//-AG A K F
SI M SR EE AD
A A
SGS---//-G L R Q V
T L A K L LG A E EK YHA---//-EG G K V
STI L AAR L ALDIIDET EYHA---//-DG G K V
H LAKLLSS D S N EYKA---//-AG G K M
DIG M TRDIYE TDY YRKG---//-GK G L L
G I H
TARD IY ESTDAY
YRKG---//-GK G L L
I MM
TR I Y ETDY YRKG---//-T K G L L
G L A RDIY KND Y YRKR---//-G E G L L
IG L AR I YKSD Y YRKE---//,'-GEG L L
DI M R IYSTDY YRVG---//-GR T M L
IA L A RDMY DKEY YSVHNK-//-T
G A KL
cI LSA RRDIMARDSN YISK---//-G S T Y L
IG L ARD I MNDSN YIVK---//-GN A R L
l L ARDI KNDSN YVVK
- //-G NA R L
s L S R D V YE
E EDS YVKR---//-SQ G R I
VII
!P
:p,
:P!
~P4
'P
P~
Pi
P
P
P
pI
P?
~P
P.
II
I
II
PI
I
I K
I K
I K
I K
I K
I K
I K
K
I K
V K
I K
I K
VK
I K
I K
I K
V R
V R
V R
V R
V R
I R
V K
L K
V K
V K
V K
V
Y
y~
Y
V
Y
V
V
V
Yi
F
F
V
i
i
F
V
V
V
V
V
F
F
V
V
i.
V
Y:
V
V
I
V
:!
'a
L
I
V
I
I
L
L
L
A
A
A
A
A
A
A
R
R
K
R
M
M
I
N
M
S
M
M
M
R
[i
iV
I
IV'
Vi
:V
I
wI
VI'
V1
VII
VII
II
MI
A
A
A
A
A
A
A
S~
Ai
A
Ai
I
A?
$
A
A
A
A
A
'$
P
?A-
430
I
~Al
Al
iA
A
A
A
P
?I
At
A
A
A
A
A
A
i
210
I I------- //-L S KGYN K-A V
I i------ //-L S KGYN K-A V
V V----- //-S T KPYN K-S I
v VA- ??-//-S TKPYN K-S I
V I------- //-T TKPYN K-S V
V V-------//-S TKPYN K-S V
I I------- //-L NKGHD I-S A
I I-------//-AYQPYG K-S V
I I-------//-A YQPYG K-S V
I I-------//-A YQPYG K-S V
I I--- -//-L SAYQPYG
K-S V
V L--- //-R K DPYG K-P V
V L-------//-R KEAYG K-P V
I IECSMND-//-N
HPGYG K-E V
v V-------//-N YDQISD-K T
V I-------//-N YEPIG Y-E T
V L-------//-V RKPYT N-S V
V I-------//-S GKLYA GPE
V
I V-------//-S H KPYR GAPA
L L----- //-K ANPYT GPE
V
L L-------//-K AQPYT GPE V
V LL
------//-G GKQYS T-G V
V LL
- -- -//-GSRHYST-G V
V LL
-- -//-G SARYS T-PV
V LL
---- //-QST-YAT-P V
L LF?--- //-GAKHYT S-A I
LLV
---?-//-DYQMYD Y-SL
V LM
--- //-K CGAQS T-K I
R I-------//-Q GNVYSI-K G
R IKSLNP--//-D R A T Y T V-Q S
V L-------//-ANHLYD K-PA
V L ----- //-A GDPYT T-T V
C QREVK{-19-}S
S S F A T A-P N
W IR
---- //-YHRYHG R-S A
V IRMQ---
-//-DNNPFS F-Q S
V IRMQ
----//-D PNPYS F-Q S
V TRMQ
---- //-DPNPYS F-Q S
L LS---- //-K GEGVT P-K A
N A-------//-L Y G R F T I-K S
^A
A--- //- L Y GRFT I-K S
A A-------//-L F GRFT I-K S
A A------- //-L YGRFT I-K S
A I-------//-N FGCFT I-K S
A I-------//-N YGTFT I-K S
A I------- //-N FGSFT I-K S
A I ?----//-I YGKFS I-K S
V L------- //-N YTRFS S-K S
S L------- //-A YNKFSI-KS
G L------- //-A YNKFS T-KS
G LS
--- //-AFNTFSS-KS
A L-- --I//-N YGRYS S-E S
SI -------//-L HRIYT H-Q S
S I---- I/- AL
RRRFT H-Q S
C I------- //-R NRVFT S-K S
S LF
----I//-K DGVFT T-S S
S L---- I-//-K D GVFT T-Y S
S L-------//-R DGVYS S-AS
S L-------//-M D GIFT T-Q S
S L------- //-V DGLFT T-Q S
S I ?
I----//-L Y RKFT T-E S
S L------- //-Q TQ K F T T-K S
S I------- //-F NSLYT T-L S
S I---- //-F D C VYT V-Q S
S I-------//-F NCVYTF-E S
S L----- //-F DHIYT T-Q S
VIII 00oo
+-
.)
0
w
Q
z
M
S-
^n
cAPK-a
cAPK-p
SRA3
ITPKI
IPK2
TPK3
cGPK
PKC-a
PKC-p
PKC-7
DPKC
CaM
Il-a
CaMII-p
PhK-7
MLCK-K
MLCK-M
PSK-HI
SNfE
I
niml
KINI
KIN2
CDC28
cdc2+
CDC2Hs
PSK-J3
KIN28
DCKII
CDC7
STIE
PBS2
weeI *
HSVK
PIM-
I
Rdtl
A-Kat
PKS
Mos
Src
Yes
fgr
fYN
IYN
LCK
HCK
Dsrc64
Dsrc28
Abl
Dash
Nab
)
Fes
EGFR
NEU
DER
INS.R
IGFIR
DILR
Ros
7less
IRK
MHT
PDGFR
CSFIR
Kit
RET
I
r~
r
[:
ti
I
:1
.i
I
I
w.
A
d
A
:
1
:1
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
P
I
I
I
I
I
I
I
I
i
) I
I
) I
) I
) I
) I
) I
) I
) I
I
) I
) I
) I
I
I
I
p
p
p
p
p
p
p
p
p
p
p
p
p
L
L
L
p
p
p
p
p
p
L
p
p
p
I
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
230 240 250
LIYEMAA-GYTPFY F ?-? DQ?
S Q ITYEKIT
L I Y E M A A - G Y P P F F A-----------D Q P I Q I Y E K I V
L I Y E M
A A - G Y P P F F A-----------D Q P I Q I Y E K I V
L I Y E M L A - G Y T P F Y D-----------S N T M K T Y E K I L
L I Y E M L A - G Y T P F Y D-----------S N T M K T Y E K I L
L I Y E M L A - G Y T P F Y D-----------T TT P MKT Y E K I L
LIYEMLA-GYTPFY N-----------S N TMKT YENIL
LMYELLT-GSPPFS G-----------P DPMKTYN IIL
260
S G - K-V R F P S H
S G - K-V R F P S H
N A - E-L R F P P F
..N A - E-L R F P P F
Q G - K-V V Y P PY
N A - E-L K F P P F
R G I DMI
E F P K K
- -------------//-----------------F S S D L K D L
- - - - - - -------------//-----------------F S S D L K D L
- - - ------//-----------------FN E D V K D L
- - -------------//-----------------F N E D VK D L
- - --- -//-----------------FQ P D V V D
- - - -------------//------------- F H P D A Q D L
_ _ _ -------------//---------------I A K N A A.N L
LL Y E M L A - Q P P F D G-----------E D E D E LF Q S I M E H - N-V KS - - - - - - -------------//-----------------L S K E A V S I
LL Y E M LA-G Q A P F E G --- E D E DEL F Q S I M E H - N-V A Y P K S - - - - - - -------------//-----------------M SK E A V A I
LY E M L A - Q P P F D -----------E D E E E L F Q A I M E Q - T-V T Y P K S - - - - - - -------------//-----------------L S R E A V A I
L L YE M L V - Q P P F D G-----------E D E E E L F A A I TD H- N-V S K
S - - - - - - -------------//-----------------L S K E A K E A
I L Y I L V - YP P F W
D-----------E D Q H R Y Q Q I K A - A-Y D F P S P E W
D T-- ------------//-----------------V T P E A K D L
I L Y IL V - Y P P F W D-----------E DQH K L Y Q Q I K A G- A-Y D F P S P E W D T - - -------------//-----------------V T P E A K N L
IMYTLLA-G S P P F W
H-----------R K Q ML M M I M S G- N-Y Q FG S P E W D -- ------------//-----------------Y SDTVKDL
I T Y M L L S - L S P F G-----------D DD T E T L N N V L S - N-W Y F D E E F E A - - ------------------------------V S D E A K D F
I C Y I L V S - LS P F M
G-----------D N D N E A N V T S A - T-W
T F D E A F D E -- ------------ ----------------I S D D A K D F
I A Y I S - M P FE D-----------D N R R L Y R Q I L RG - K-Y S Y S G E P W P S -- ------------//-----------------V S N LA K D F
I Y V M C - R P F D D-----------E S I P V L F K N I S N - V-Y T P K F - --------------//--------------- L S P G A A
I Y S S - N K P F G G-----------Q N T DV I Y N K I R H - A-Y D L P S - - - - - - -------------//-----------------I SS A A Q D L
V L F V L V C - G K V P F D D-----------E N S S V L H EK I K Q G - K-V E Y P Q H - - - - - - -------------//-----------------L S I E V I S L
V LYV V C - K V P F D D-----------E N S I L H E K I K K - K-V D Y P S H - - - - - - -------------//-----------------L S I E V I S
I F A E M
C N - R K P I F S GDSEI-------D Q I F K I F R V L G T P N E-A I W P D I V Y L P D F KPS---------- //--FPQWRRKDLSQVVPSL
D P R G I D L
I F A E M I R - R S P L F P GDSEI-------D E I F K I F Q V L G T P N E-E V W P G V T L L Q D Y KST---------- //--FPR WKRMDLHKVVPNG
E E D A I E L
I F A E L A T - K K P L F H GDSEI-------D Q L F R I F R A L G T P N N-E V W P E V E S L Q D Y KNT---------- //-- FPKWKPGSLASHVKNL
D E N G L D L
'I F A E M F R - R K P L F C GNSEA-------D Q L G K I F D L I G L P P E-D D W P R D V S L P R G A------------ //--FPPRGPRPVQSVVPEM
E E S G A Q L
I F A E L M L - R I P Y LP GQNDV-------D Q M E V T F R A T P D-R D W P E V S S F M T YNKLQI--------//YPPSRDELRKRFIAA S E Y A L D F
M AS M I F - R K E P F F HGHDNY------DQ V R I AK V GT E E L-Y AY D K Y N ID L PRFHDILQRHS--//--RKRWERFVHSDNQHLV
S P E ALD F
I LLS LL G - R R F P M F QSL---------D D A DS L E C T I FG W-K
E R K C A AL H G L GFEASGLIWDK{-61-}VAYELKKYQEEIWSD
H Y W C F Q V
MI I E L V T - G E F P L G GHN---------D T P D G I L D L L Q R I V N-E P S P R L P K D R I - -------------//-----------------Y S K E M T D F
S I LE M
A L - GR Y P Y P P-----------E T Y D N I F S Q L S A I V D-G P P P R L P S D K - - -------------//-----------------F S S D A Q D -F
T V FE A A A - N I V P D NG----------QS W Q K R S D L S D AP-R S S T D N S S T SSSRETPANSII-// -----------------G Q L D R
V I FE T A V - HN A S L F SAPRGPKRGPCDS
Q I T R I I R Q A Q V H V D-E F S P H P E S R L T S RYRSRAAGNN---//--RPPYTRPAWTRYYK-M
D ID V E Y L
I L I N L C C - K R N P W K RAC---------S Q T D G T Y R S Y V H N P S-T L L S I L P - - - - - -- -----------//-----------------I S R E L N S L
L YD M V C - D I P F - ------------E H D E E I I K G Q V F F R Q-T -- //-----------------V S S E C Q H
V L YE L M - G E L P Y S HI----------N N R DQ I I F M V G R G Y A-S P.D L S K L Y K N - - -------------//-----------------C P K A M K R L
V L YE L M T - G S L P Y S HI----------G C. R D Q I I F M V G R G Y L-S P D L S K I S S N - - -------------//-----------------C P K A M R R L
V L Y E LM T - G S P Y S HI----------G C R D Q I I F M
V G R Y L-S P D S K I S S N - - -------------//-----------------C P K AM R R L
T L W Q M T T - K Q A P Y S - E R Q H I LY A VV A Y D L-R P SL S A A V F E D S L------------//-----------------P G Q R L D V
LLT ELT
L Q T E L V
L L T ELI
L L T E L V
LL Y E I V
L L TE IV
L L M E I V
LL M ELF
L MW
EI F
460 470 480 490
T K G R V P Y P G-----------M VN R E V L D Q V E R G Y R-M
P C P P E - - - - - - -------------//-----------------C P E S L H D L
T K G R V P Y P G-----------M V N R E V L E Q V E R G Y R-M
P C P Q G - - - - - - -------------//-----------------C P E S L H E L
T K G R I P Y P G-----------M N K R E V L E Q V E Q G Y H-M P C P P G - - - - - - ------------//-----------------C P A S L Y E A
T K G R V P Y P G-----------M N N R E V L E Q V E R G Y R-M
P C P Q D - - - - - - -------------//-----------------C P I S L H E L
T Y G K I P Y P G-----------R T N A D V MT A L S Q G Y R-M P R V E N - - - - - - ------------//-----------------C P D E L Y D I
T H G R I P Y P G-----------M T N P E V I Q N L E R G Y R-M
V R P D N - - - - - - -------------//-----------------C P E E L Y Q L
T Y G R I P Y P G-----------M S N P E V I R A L E R G Y R-M P R P E N - - - - - - -------------//-----------------C P E E L Y N I
T Y G Q V-P Y P G-----------M H S R E V I E N I E R G F R-M P K P T N H Y - - - - -------------//-----------------F P D N I Y Q L
T C'G K'M
PEY
G R-----------L K N T E V V E R V Q R G I I-L E K P K S - - - - - -------------//-----------------C A K E I Y D V
L L W E I A T Y G M
S P Y P G-----------I D P.S Q V Y E L L E K D Y R-M
K R P E G - - - - - - -------------//-----------------C P E K V Y E L
L L W E I A T Y G M S P Y P A-----------I D L T D V Y H K L D K G Y R-M
E R P P G - - - - - - ------------------------------C P P E V Y D L
L L W E I A T Y G M A P Y P G-----------V E L S N V Y.G,L L E N.G F R-M
D G P Q G - - - - - - -------------//-----------------C P P S V Y R L
L L W E T F S L G A S P Y P N----------L S N Q Q T R E F V E K G G R-L P C P E L - - - - - - -------------//-----------------C P D A V F R L
T V W E L M T F G S K P Y D G-----------I P A S E I S S I L E K G E R-L P Q P P I - - - - - - -------------//-----------------C T I D V Y M I
T VW E L M TF G A K P Y D G-----------I P A R E I P D L L E K G E R-LP Q P P I - - - - - - -------------//-----------------C I D V Y M I
T I W E L L T F G Q R P H E N-----------I P A K D I P D L I E V G L K-L E Q P E I - - - - - - -------------//-----------------C S L D I Y C T
V L W E I T S L A E Q PY Q G-----------L S N E Q V L K F V M D G G Y-L D Q P D N - - - - - - -------------//-----------------C P E R V i D L
V L W E I A T L A E Q P Y Q G-----------L S N E Q V L R F V M E G G L-L D K P D N - - - - - - -------------//-----------------C P D M L F E L
V L W E M A TL A A Q P Y Q G-----------L S N E Q V L R Y V ID G G V-M
E R P E N - - - - - - -------------//-----------------C P D F L H K L
L I W E I L T L G H Q P Y P A-----------H S N L D V L N Y V Q T G G R-L E P P R N - - - - - - -------------//-----------------C P D D L W N L
L C W E I L T L G Q Q P Y A A-----------R N N F E V L A H V K E G G R-L Q Q p
P M - - - - - - ------------//-----------------C T E K L Y S L
V L W E I F T Y G K Q P W Y Q-----------L S N T E A I D C I T Q G R E-L E R P R A - - - - - - -------------//-----------------C P P E V Y A I
V L W E L M T R G A P P Y P D-----------V N T F D I T V Y L L Q G R R-L L Q P E Y - - - - - - -------------//-----------------C P D P L Y E V
L L W
E I F T L G G T P Y P EL----------P M N D Q F Y N A I K R G Y R-M
A Q P A H - - - - - - -------------//-----------------A S D E I Y I
L L W E I F S L G L N P Y P GI----------L V N S K F Y K L V K D G Y Q-M A Q P A F - - - - - -------------//-----------------A P K N I Y S I
F L WE L F S L G S S P Y P GM----------P V D S K F Y K M I K E G F R-M
L S P E H - - - - - - -------------//-----------------A P A E M Y D I
L L.W
E I V T L G G N P Y P G-----------I P P E R L F N L L K T G H R-M E R P D N - - - - - ------------------------------C S E E M Y R L
X
2
'T-5
I
t
S
I
V
V
I
I
ii
ijl
if
ft
70
R
R
S
S
K
K
280
N L L Q V D L TK F
N L L Q V D L T K F
RLI T R D L S Q
RLI TRDLS Q L
K LI TAD LT R I
KL I T R D L S E
KLCRDN PS E
KGLMTKHPGK
KG LITKHPG K
KGFLTKHPAK
K G F L T K Q:P N K
N KMLTINPS K
NQM LTIN P A K
SRF LVVQPQK
SN LI VK E Q G A
SNLLKKDMKS
DRLLTVDPGA
K R MLI VN PLN
HRMLDVNPST
SKM LV V D P K R
D K L LAYDPIN
SAMLVYDPAH
SKMLIYDPAK
EEMLT F NPHK
CGMLTMNPQK
DKLLRYDHVD
EQ C F EMD PQ K
NRCCIKNERE
S L C LQ K IPE R
E W
M L S P E P.R N
CKALTFDG A L
N R I F D R N P K T,
KW CLSLRP SD
ADC VKKV KEE
SD CE K FQR E E
SDCLKFQREE
S D C L K F Q R E E
QRCWRPSAAQ
600
C Q CW R K E PEE
NLCWKK D PDE
EQTWRLDPE E
IHCW K KDPEE
KM C W K E K A E E
RLCWKERPED
MRCWKNRPEE
LQCWDAV P E K
KLCWS H G P EE
R A C W
Q W
N P SD
RQCWQWDATD
LQCWNWSPSD
EQCWAYEPGQQ
VKCWMIDADS
VKCWMIDSEC
LSCWHLDAAM
RMCWQFNP NM
.R M CW Q Y N PK M
QRCWHHRSSA
TQCWAQEPDQ
LLCWRTDPWE
RGCWQREPQQ
LQCWKQ E PD K
XI
GN L
GN L
GN L
GN L
G N L
G N L
G NL
290 300
KDG V N D I K N H K W F A TT/48
KN G V S D I K T H K W F A T T/50
QNG T EDV K N H P W F KE V/53
Q NSGSRDI
T E D V K N H P W F K E /53
Q S G S R D I K A H P W F S E V/53
Q NG S EDV K N H P W F N E V/53
K N G V K D I Q K H K W F E G F/50
0\
?Ci
0
LG CG P E.G E R D VR EH A F F R 1/72
LG C GP EGER DI K E H A F F R Y 1/70
L G SGPDG E P T I R A H G F F R W /80
L G C G S S G E E D V R L H P F S R R 1/37
I T A A E A L K - - - - - H P W I S H R/203
I T A H E A L K - - - - - H P W
V C Q R/258
Y T A E EA L A----- H P F F QQ Y/96
M S A A Q C L A-----H P W L N N L/51
L N C T Q C L Q - - - - - H P W L Q K D/27
M T ALQALR -- - - - H P W
VV S M/139
I S I H E I M
Q - - - D D W F K V D/324
I T I P 'E F F S - - - - - H P F L MG C/109
A T L K Q V V E - - - - - H H W
M
V R G/663
A T L K NV V E- - ---H P W
MN R G/773
I S A K R A L Q - - - Q N Y L R D F H
ISARRAAI-----HPYFQES*
ISAKRALQ-----QNYLRD FH*
I S G K M A L N- ----H P Y F N D L/7
I.S A F R A L Q- - - - H S Y L H K D/5
WTAVQCLE------S DYFKEL/13
L T A R E A M
A - - - - - HP Y F L P I/11
S S A E D L L K-----T P F F N E L/35
S S I H E LL H----- D
HD L I M K Y/46
P T Y A ALT E - - - - - H P W L V K Y/84
P T ID I LA T D - - - E V C WV E M/32
P S AAELLC----- L F Q Q K *
I T L P E L S T - - - - - L V S N C K N/170
P S F E E I R N - - - - - H P W
M
Q G-D/20
P L F P Q I L S S I - - -E L L Q H S L/35
P L F P Q I L A T I - - - E L L Q R S L/32
PLFPQI LATI ---ELLQRSL/15
PSARLLLVDL---TSLKAELG*
510
PTFEYLQAFL ---ED
PTFEY I QSFL - - - ED
PTFEYLQSFL - - - ED
PTFEYLQSFL - - - ED
PTFDYLQSVL - - -DD
PTFDYLRSVL - - - ED
PTFEYIQSVL- - -DD
PTFEFLNHY F - - - ES
PAFRVLMDQL---AL
520
Y F T S T/10
Y F T A T/10
Y F T S A/10
Y F T A T/10
F Y T A T/8
F F T A T/8
F Y T A T/8
F S V T S/9
V A Q T L/2
P S F AE I H Q A F - - - E T M F Q ES/630
PTFKSIHHAL - - - H M F Q V
QG/9+
P R F R D I H F N L - - - E N L I SS N/137+
PS FS TI YQ EL - - - QS I R KRHR
P K F REL II E F - - - S K M
A R D P/235
P R F RE L V S E F - - - S R M A R DP/272
PT F K Q L T TV F - - A E F A R D P/183
PT F LE I V N L L - - - K D LH PS/85
PSFLEIISS I - - - K M E PG/94
P S F LD I I A Y L - - - E P Q C P NS/95
P T F HR I Q D Q L - - - Q L F R N FF/38
PSFRR C Y N TL - - - A I S
DHAISTD L/66
HS I KDV H A R L - - - Q AA A P/8
P S FS E L V S R I - - -S A I F S T F/46
P P F SQ LV LLL- - - ER LLGE G/134
P T F Q Q I C S F L - - - Q E Q A Q E D/55
P T F K Q I V Q L I - - - E K Q I S E S/45
PVFADISKDL---EKMMVKR/60 00
_
220
cAPK-a
cAPK-p W
SRA3 W
TPK1 W
TPK2 W
TPK3
cGPK Y
PKC-a
PKC- W
PKC-p W
DPKC W
CaMII-a L
CamlI
-P I
PhK-7 M
MLCK-K M
MLCK-M M
PSK-HI M
SNFI V
niml+ V
KINI D
KIN2
CDC28 T
cdc2+
CDC2Hs I
PSK-J3
KIN28 I
DCK
I M
CDC7
STE7 V
PBS2
weel
HSVK
ranl+ V
PIM-I A V
Raf ' V
A-Rdf V
PKS V
Mos
iII
i::
SI
I
W I
l i
VBi'
B#S
IV-SW
I :s1*
FS'
1
Wi.
ViS
II 'S
VAt
I
I
I
I
I
I
B
I
i
I
M
i
I
-V
?
I
I
T
I
I
I
I
L
. '.
I
y
I
r V{
.
Al,
450
I
I
?
i
i
i
i
i
V
i
i
't
i
i
i
11
I
iii
i
V t
.3'
3'
3'
3'
3'
3'
3'
.3
I'
3'
Src
Yes
Fgr
FYN
LYN
LCK
HCK
Dsrc64
Dsrc28
Abl
Dash
Nabl
Fes
EGFR
NEU
DER
INS.R
IGFR
DILR
Ros
71ess
TRK
MET
PDGFR
CSF1R
Kit
RET
IX
Ii
I:
k
I
I
I
I
I
I
I
I1
I
I1
I
I
I
I
I
I
I
I
I
Ie
(
I
L
L
F
F
L
F
L
Y
F
F
Y
c
C
T
L
I
L
C
C:
F
F
I
V
I
v
V
L
V
L
L
L
A
L
L
y V
V
F
v
v
v
v
ii
i'-
v
ii
v
v
V
v-
v
v
VI
v
m
V.
v
v
v
v
v
v
v
v
Vv
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
Fig. 2. Similarity profile of protein kinase catalytic domains. For each 90
position in the alignment shown in Fig. 1, a relative similarity score was
determined based on the "structure-genetic" scoring matrix (116) for amino
acid similarities. Similarity scores were calculated as the sum of all possible a 80 -
pairwise comparisons between the individual amino acids at each position 8
and expressed as the percentage of the highest possible score (that is, the
score obtained when an identical residue occupies the position in all 65
aligned sequences). To smooth out the curve, a 9-position running average E 70-
of the relative scores was determined, and every third position was plotted. a
Positions that contain gaps for ten or more of the sequences were not ,
included in the profile; however, the locations of the major gap sites are 60 -
indicated by breaks
in the curve. The mean relative score for all the positions
included in the profile is 66 with a standard deviation of 14.9. Relative
similarity scores obtained when the catalytic domain sequences were ran-
domly scrambled had a mean of 47 and standard deviation of 1.85. Roman 50
numerals indicate conserved subdomains.
.o.
Raf
A?Raf
STE7 -
Fig. 3. Deduced phylogeny of
protein kinase catalytic do-
mains. The phylogenetic tree
was constructed from the multi-
ple alignment shown in Fig. 1. I i
The tree-building concept of
Fitch and Margoliash (117) was
used as implemented by Feng and Doolittle (118). Briefly, similarity
scores were obtained
for all possible pairwise comparisons and transformed
into a difference
matrix from which
branch order and length were determined. Programs were run on a VAX-785 computer
equipped with 40 megabytes physical memory under virtual memory operating system
(VMS). Systems limitations required that the branch lengths for the protein-serine/
threonine and protein-tyrosine kinases be calculated
separately,
and the tree shown is thus
a composite of these two determinations. The position of the protcin-tyrosine kinase
cluster was determined by including two protein-tyrosine kinases (Src and EGFR) in the
protein-serine/threonine kinase tree construction. The individual sequences are indicated
by the abbreviated names in Tables 1 and 2. The protein-tyrosine kinases arc not labeled
in (A), but are shown in the cluster enlargement in (B). The tree is shown "unrooted"
in
(A) as the branches are all measured relative to one another with no outside reference
point. The scale bars represent
a branch length corresponding to a relative
difference
score
of 25. The tree depicted is likely to underestimate distances between the least related
members of the family, particularly since the alignment used in its construction is
parsimonious.
50
Amino acid position (cAPK-a)
-cGPK
SCIENCE, VOL. 24I
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
have similar modes of regulation remains to be determined. KIN1
and KIN2 were identified through screening a Saccharomyces
cerevisi-
ae DNA library with probes designed to recognize sequences
characteristic
of protein-tyrosine kinases and, as such, have been
suggested to represent "structural
mosaics" with some features of
catalytic domain structure more indicative of the protein-tyrosine
kinases than the protein-serine/threonine kinases (5). The deduced
phylogeny of KIN1 and KIN2, however, does not suggest a close
evolutionary relationship with protein-tyrosine kinases. In fact, the
probe target used to identify KIN1 and KIN2 encodes the stretch of
amino acids corresponding to cAPK-a Asp220-Gly225
in conserved
subdomain IX, a region of high conservation in all of the catalytic
domains regardless of substrate specificity.
The subfamily related to CDC28 includes functional homologs
from three widely divergent species: CDC28 from the budding
yeast S. cerevisiae,
cdc2+ from the fission yeast Schizosaccharomyces
pombe, and human CDC2Hs. Functional homology was demon-
strated by heterologous complementation of conditional mutants
defective in cell cycle progression (32, 33). The other two sequences
mapping within this cluster are putative protein kinases identified in
Saccharomyces
cerevisiae
(KIN28) and human HeLa cells (PSK-J3).
The members of this cluster are also distinguished by the small sizes
of the catalytic domain-containing polypeptides, suggesting their
activities may be regulated through association with other polypep-
tides in a holoenzyme complex. Indeed, support for this notion has
been obtained for cdc2+ (34).
Perspectives
The tremendous diversity of the protein kinase family is just now
beginning to be appreciated.
Most of the catalytic
domain sequences
referenced
in Tables 1 and 2 were reported within the past 2 years.
With continued characterizations of regulatory mutants in inverte-
brates, along with the recent development of new hybridization
approaches
for the identification of DNA clones that encode novel
protein kinase catalytic
domains, it is likely that the rate of discovery
will continue to accelerate through the next several years. The
difficult tasks will be to confirm protein kinase activities for the
newly identified family members and to elucidate their functional
roles. Clues to function may come through an analysis of catalytic
domain primary
structure and subsequent phylogenetic mapping. A
catalytic domain that has only limited divergence from another,
better characterized,
member of the family can be expected to play a
similar role in cellular physiology. Further clues are likely to come
from an inspection of amino acid sequences lying outside the
catalytic
domain where residues involved in enzyme regulation may
be found.
REFERENCES AND NOTES
1. T. Hunter
and
J. A. Cooper,
in The
Enzymes,
P. D. Boyer
and E. G. Krebs,
Eds.
(Academic Press,
Orlando,
FL, 1986), vol. 17, pp. 191-246.
2. A. M. Edelman,
D. K. Blumenthal,
E. G. Krebs,
Annu. Rev.
Biochem.
56, 567
(1987).
3. T. Hunter,
Cell
50, 823 (1987).
4. S. K. Hanks,
Proc.
Natl.
Acad.
Sci.
U.SA. 84, 388 (1987).
5. D. E. Levin,
C. I. Hammond,
R. 0. Ralston,
J. M. Bishop,
ibid.,
p. 6035.
6. W. C. Barker
and
M. 0. Dayhoff,
ibid.
79, 2836 (1982).
7. A. D. Levinson,
S. A. Courtneidge,
J. M. Bishop,
ibid.
78, 1624 (1981); J. S.
Brugge
and
D. Darrow,J.
Biol.
Chem.
259, 4550 (1984); J. Y. J. Wang
and
D.
Baltimore,
ibid.
260, 64 (1985); R. J. Bold
and D. J. Donoghue,
Mol.
Cell.
Biol.
5, 3131 (1985); I. Sadowski and T. Pawson,
Oncogene
1, 181 (1987).
8. V. W. Wilkerson,
D. L. Bryant,
J.
T. Parsons,J.
Virol.
55, 314 (1985); P. Yaciuk
and D. Shalloway,
Mol. Cell.
Biol.
6, 2807 (1986).
9. We have
attempted
to include all
catalytic
domain
sequence
reports except
those
encoded
by retroviruses.
Reports
of partial
sequences
are included
only if a
complete
catalytic
domain
sequence
is not available.
We have chosen not to
include
retroviral
oncogene
product
sequences
since the catalytic
domains
from
I JULY 1988
this
group
are
effectively
represented
by their
closely
related
cellular
counterparts.
A listing
of references
for retroviral
protein
kinase
sequences
can be found in
MolecularBiology
of
Tumor
Viruses:
RNA
Tumor
Viruses
[R. A. Weiss,
N. Teich,
H.
Varmus,
J. Coffin,
Eds. (Cold
Spring
Harbor
Laboratory,
Cold Spring
Harbor,
NY, ed. 2, 1985)]. Since
the time
these
tables
were
compiled,
complete
sequences
for six additional
members
of the protein
kinase
family
have
been
published:
(i)
EPH, a novel receptor-like
protein-tyrosine
kinase [H. Hirai, Y. Maru, K.
Hagiwara,
J. Nishida,
F. Takaku,
Science
238, 1717 (1987)]; (ii) TKL, a novel
member
of the Src subfamily
[K. Strebhardt,
J. I. Mullins, C. Bruck,
H.
Rubsamen-Waigmann,
Proc.
Natl Acad.
Sci.
U.SA. 84, 8778 (1987)]; (iii)
ninaC
protein,
a
Drosophila
gene product
essential
for
normal
photoreceptor
cell
function
[C. Montell
and G. M. Rubin, Cell 52, 757 (1988)]; (iv) nimA,
a cell cycle
control
protein
kinase
from
Aspergillus
[S. A. Osmani,
R. T. Pu, N. R. Morris,
Cell
53, 237 (1988)]. Further,
in
Drosophila,
a full-length
sequence
of cAMP-
and
a partial
sequence
of cGMP-dependent
protein
kinase
catalytic
domain
have
been
reported
[J. L. Foster,
G. C. Higgins,
F. R. Jackson,
J. Biol.
Chem.
263, 1676
(1988)] as have
catalytic
domain
sequences
from
a number
of vertebrate
protein
kinases
previously
reported
from
other
vertebrate
species
(not cited);
(v) byrl+, a
suppressor
of sporulation
defects in Schizosaccharomycespombe
[S. A. Nadin-Davis
and A. Nasim,
EMBO
J. 7, 985 (1988)]; and (vi) GCN2, a protein
kinase
essential
for
translational
derepression
of GCN4
mRNA
in Saccharomyces
cerevisiae
[I. Roussou,
G. Thireos,
B. M. Hauge,
Mol. Cell.
Biol.
8, 2132 (1988)].
10. C. Chothia
and
A. M. Lesk,
EMBOJ. 5, 823 (1986).
11. R. K. Wierenga
and W. G. J. Hol, Nature
302, 842 (1983).
12. M. J. E. Sternberg
and
W. R. Taylor,
FEBS
Lett. 175, 387 (1984).
13. M. P. Kamps
and
B. M. Sefton,Mol.
Cell.
Biol.
6, 751 (1986).
14. M. J. Zoller,
N. C. Nelson,
S. S. Taylor,J.
Biol.
Chem.
256, 10837 (1981).
15. M. P. Kamps,
S. S. Taylor,
B. M. Sefton,
Nature
310, 589 (1984).
16. M. W. Russo, T. J. Lukas,
S. Cohen, J. V. Staros,
J. Biol.
Chem.
260, 5205
(1985).
17. M. A. Snyder,
J. M. Bishop,
J. P. McGrath,
A. D. Levinson,
Mol. Cell.
Biol.
5,
1772 (1985).
18. M. Hannink
and
D. J. Donoghue,
Proc.
Natl.
Acad.
Sci.
U.SA. 82, 7894 (1985).
19. G. Weinmaster,
M. J. Zoller,
T. Pawson,
EMBOJ. 5, 69 (1986).
20. W. S. Chen et al.,
Nature
328, 820 (1987); A. M. Honegger
et al., Cell
51, 199
(1987).
21. C. K. Chou et al.,J. Biol.
Chem.
262, 1842 (1987).
22. L. T. Williams,
personal
communication.
23. S. Brenner,
Nature
329, 21 (1987).
24. D. Bryant
and
J. T. Parsons,
J. Virol.
45, 1211 (1983); Mol. Cell.
Biol.
4, 862
(1984).
25. H. N. Bramson
et al.,J. Biol.
Chem.
257, 10575 (1982).
26. S. Shoji et al., Proc.
Natl. Acad. Sci. U.SA. 78, 848 (1981); S. Shoji, L. H.
Ericsson,
K. A. Walsh,
E. H. Fischer,
K. Titani,
Biochemistry
22, 3702 (1983).
27. G. Weinmaster,
M. J. Zoller,
M. Smith, E. Hinze, T. Pawson,
Cell 37, 559
(1984); G. Weinmaster
and
T. Pawson,J.
Biol.
Chem.
261, 328 (1986); T. E.
Kmiecik
and D. Shalloway,
Cell 49, 65 (1987); H. Piwnica-Worms,
K. B.
Saunders,
T. M. Roberts,
A. E. Smith,
S. H. Cheng,
ibid.,
p. 75; R. Herrera
and
0. M. Rosen,
J. Biol.
Chem.
261, 11980 (1986); L. Ellis et al., Cell
45, 721
(1986).
28. R. A. Lindberg,
unpublished
data.
29. A. Lorincz
and
S. I. Reed,
Mol. Cell.
Biol.
6, 4099 (1986).
30. J. M. Sowadski,
N. h. Xuong,
D. Anderson,
S. S. Taylor,
J. Mol.
Biol.
182, 617
(1984).
31. R. F. Doolittle,
in The
Proteins,
H. Neurath and
R. L. Hill,
Eds.
(Academic Press,
New York,
ed. 3, 1979), vol. 4, pp. 1-118.
32. D. Beach,
B. Durkacz,
P. Nurse,
Nature
300, 706 (1982); R. Booher
and D.
Beach,
Mol. Cell.
Biol.
6, 3523 (1986).
33. M. G. Lee and
P. Nurse,
Nature
327, 31 (1987).
34. L. Brizuela,
G. Draetta,
D. Beach,
EMBOJ.
6, 3507 (1987).
35. M. D. Uhler
et al., Proc.
Natl.
Acad.
Sci.
U.SA. 83, 1300 (1986).
36. M. O. Showers
and
R. A. Maurer,J.
Biol.
Chem.
261, 16288 (1986).
37. M. D. Uhler,
J. C. Chrivia,
G. S. McKnight,
ibid.,
p. 15360.
38. J. F. Cannon
and
K. Tatchell,
Mol. Cell.
Biol.
7, 2653 (1987).
39. T. Toda,
S. Cameron,
P. Sass,
M. Zoller,
M. Wigler,
Cell
50, 277 (1987).
40. J. Lisziewicz,
A. Godany,
H.-H. Forster,
H. Kuntzel,J.
Biol.
Chem.
262, 2549
(1987).
41. K. Takio
et al., Biochemistry
23, 4207 (1984).
42. P. J. Parker et al., Science
233, 853 (1986).
43. S. Ohno
et al., Nature
325, 161 (1987).
44. L. Coussens
et al., Science
233, 859 (1986).
45. J. L. Knopf
et al., Cell
46, 491 (1986).
46. G. M. Housey,
C. A. O'Brian,
M. D. Johnson,
P. Kirschmeier,
I. B. Weinstein,
Proc.
Natl.
Acad.
Sci.
U.SA. 84, 1065 (1987).
47. A. Rosenthal
et al., EMBOJ.
6, 433 (1987).
48. R. M. Hanley
et al., Science
237, 293 (1987); C. R. Lin et al., Proc.
Natl.
Acad.
Sci.
U.SA. 84, 5962 (1987).
49. M.
K.
Bennett
and
M. B. Kennedy,
Proc.
Natl.Acad. Sci.
U.SA. 84, 1794 (1987).
50. E. M. Reimann
et al., Biochemistry
23, 4185 (1984); E. F. da Cruz e Silva
and
P.
T. W. Cohen,
FEBS
Lett.
220, 36 (1987).
51. J. S. Chamberlain,
P. VanTuinen,
A. A. Reeves,
B. A. Philip,
C. T. Caskey,
Proc.
Natl. Acad. Sci. U.SA. 84, 2886 (1987).
52. K. Takio
et al., Biochemistry
24, 6028 (i985).
53. V. Guerriero, Jr.,
M. A. Russo,
N. J. Olson,
J. A. Putkey,
A. R. Means,
ibid.
25,
8372 (1986).
54. The entire
catalytic
domain
sequence
has been determined
by J. R. Woodgett
(unpublished data).
55. J. L. Celenza
and
M. Carlson,
Science
233, 1175 (1986).
ARTICLES SI
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
56. P. Russell and P. Nurse, Cell 49, 569 (1987).
57. A. T. L6rincz and S. I. Reed, Nature 307, 183 (1984).
58. J. Hindley and G. A. Phear, Gene 31, 129 (1984).
59. The entire catalytic domain sequence has been determined by S. K. Hanks
(unpublished data).
60. M. Simon, B. Seraphin, G. Faye, EMBO J. 5, 2697 (1986).
61. K. Takio et al., Proc.
Natl. Acad. Sci. U.S.A. 84, 4851 (1987).
62. A. Saxena, R. Padmanabha, C. V. C. Glover, Mol. Cell. Biol. 7, 3409 (1987).
63. T. I. Bonner et al., NucleicAcids Res. 14, 1009 (1986).
64. T. W. Beck, M. Huleihel, M. Gunnell, T. I. Bonner, U. R. Rapp, ibid. 15, 595
(1987).
65. M. Huleihel et al., Mol. Cell. Biol. 6, 2655 (1986).
66. G. E. Mark, T. W. Seeley, T. B. Shows, J. D. Mountz, Proc. Natl. Acad. Sci.
U.S.A. 83, 6312 (1986).
67. R. Watson, M. Oskarsson, G. F. Vande Woude, ibid. 79, 4078 (1982).
68. C. Van Beveren et al., Nature 289, 258 (1981).
69. F. A. Van der Hoorn and J. Firzlaff,
NucdeicAcids
Res. 12, 2147 (1984).
70. M. A. Teague et al., Proc. Natl. Acad. Sci. U.SA. 83, 7371 (1986).
71. G. Boguslawski and J. 0. Polazzi, ibid. 84, 5848 (1987).
72. M. Patterson, R. A. Sclafani, W. L. Fangman, J. Rosamond, Mol. Cell. Biol. 6,
1590 (1986).
73. P. Russell and P. Nurse, Cell 49, 559 (1987).
74. M. McLeod and D. Beach, EMABOJ.
5, 3665 (1986).
75. G. Selten et al., Cell 46, 603 (1986).
76. D. J. McGeoch and A. J. Davison, Nucleic
Acids Res. 14, 1765 (1986).
77. S. K. Anderson, C. P. Gibbs, A. Tanaka, H.-J. Kung, D. J. Fujita,
Mol. Cell. Biol.
5, 1122 (1985); A. Tanaka et al., ibid. 7, 1978 (1987).
78. R. Martinez, B. Mathey-Prevot, A. Bemards, D. Baltimore, Science
237, 411
(1987).
79. T. Takeya and H. Hanafusa, Cell 32, 881 (1983).
80. R. E. Steele, NucleicAcidsRes. 13, 1747 (1985).
81. J. Sukegawa et al., Mol. Cell. Biol. 7, 41 (1987).
82. M. Nishizawa et al., ibid. 6,511 (1986); R. C. Parker,
G. Mardon, R. V. Lebo,
H. E. Varmus, J. M. Bishop, ibid. 5, 831 (1985).
83. K. Inoue et al., Oncogene
1, 301 (1987).
84. K. Semba et al., Proc. Natl. Acad. Sci. U.SA. 83, 5459 (1986); T. Kawakami, C.
Y. Pennington, K. C. Robbins, Mol. Cell. Biol. 6, 4195 (1986).
85. Y. Yamanashi et al., Mol. Cell. Biol. 7, 237 (1987).
86. J. M. Trevillyan
et al., Biochim.
Biophys.
Acta 888, 286 (1986); Y. Koga et al., Eur.
J. Immunol. 16, 1643 (1986).
87. J. D. Marth, R. Peet, E. G. Krebs, R. M. Perlmutter, Cell 43, 393 (1985); A. F.
Voronova and B. M. Sefton, Nature 319, 682 (1986).
88. N. Quintrell et al., Mol. Cell. Biol. 7, 2267 (1986); S. F. Ziegler, J. D. Marth, D.
B. Lewis, R. M. Perlmutter, ibid., p. 2276.
89. M. A. Simon, B. Drees, T. Komberg, J. M. Bishop, Cell 42, 831 (1985).
90. F. M. Hoffman,
L. D. Fresco,
H. Hoffman-Falk,
B.-Z. Shilo, ibid.
35, 393
(1983).
91. R. J. Gregory,
K. L. Kammermeyer,
W. S. Vincent
III, S. G. Wadsworth,
Mol.
Cell.
Biol.
7, 2119 (1987).
92. E. Shtivelman,
B. Lifshitz,
R. P. Gale, B. A. Roe, E. Canaani,
Cell
47, 277
(1986).
93. G. D. Kruh
et al., Science
234, 1545 (1986).
94. J. M. Goddard,
J. J. Weiland,
M. R. Capecchi,
Proc.
Natl. Acad. Sci.
U.SA. 83,
2172 (1986).
95. A. J. M. Roebroek
et al., EMBO
J. 4, 2897 (1985).
96. A. J.
M. Roebroek,
J.
A. Schalken,
C. Onnekink,
H. P. J. Bloemers,
W. J. M. Van
de Ven,J. Virol.
61, 2009 (1987).
97. C.-C.
Huang,
C. Hammond,
J. M. Bishop,J.
Mol.
Biol.
181, 175 (1985).
98. A. Ullrich
et al., Nature
309, 418 (1984).
99. L. Coussens
et al., Science
230, 1132 (1985); T. Yamamoto
et al., Nature
319,
230 (1986).
100. C. I. Bargmann,
M.-C.
Hung, R. A. Weinberg,
Nature
319, 226 (1986).
101. E. Livneh,
L. Glazer,
D. Segal,
J. Schlessinger,
B.-Z. Shilo,
Cell
40, 599 (1985).
102. A. Ullrich
et al.,
Nature
313, 756 (1985); Y. Ebina
et al., Cell
40, 747 (1985).
103. A. Ullrich
et al., EMBOJ.
5, 2503 (1986).
104. Y. Nishida,
M. Hata,
Y. Nishizuka,
W. J. Rutter,
Y. Ebina,
Biochem.
Biophys.
Res.
Commun.
141, 474 (1986).
105. H. Matsushime,
L.-H. Wang,
M. Shibuya,
Mol. Cell.
Biol.
6, 3000 (1986); C.
Birchmeier,
D. Birbaum, G. Waitches,
O. Fasano,
M. Wigler,
ibid.,
p. 3109.
106. W. S. Neckameyer,
M. Shibuya,
M.-T. Hsu, L.-H. Wang,
ibid.,
p. 1478.
107. S. B. Podell and B. M. Sefton,
Oncogene
2, 9 (1987).
108. E. Hafen,
K. Basler,
J.-E.
Edstroem,
G. M. Rubin,
Science
236, 55 (1987).
109. D. Martin-Zanca,
S. H. Hughes,
M. Barbacid,
Nature
319, 743 (1986).
110. M. Park
etal., Proc.
Natl.Acad.
Sci. U.SA. 84, 6379 (1987);
A. M.-L. Chan
etal.,
Oncogene
1, 229 (1987).
111. Y. Yarden
et al., Nature
323, 226 (1986).
112. L. Coussens
et al., ibid.
320, 277 (1986).
113. Y. Yarden
et al., EMBOJ.
6, 3341 (1987).
114. M. Takahashi
and G. M. Cooper,
Mol. Cell.
Biol.
7, 1378 (1987).
115. D. A. Foster,
J. B. Levy,
G. Q. Daley,
M. C. Simon,
H. Hanafusa,
ibid.
6, 325
(1986).
116. D.-F. Feng,
M. S. Johnson,
R. F. Doolittle,J.
Mol. Evol.
21, 112 (1985).
117. W. M. Fitch
and E. Margoliash,
Science
155, 279 (1967).
118. D.-F. Feng
and
R. F. Doolittle,J.
Mol.
Evol.
25, 351 (1987).
119. We thank D.-F. Feng and R. F. Doolittle
for much
helpful
advice
concerning
phylogenetic
tree construction,
R. W. Holley for support,
J. R. Woodgett
for
making
unpublished
sequence
data
available,
R. A. Lindberg
and L. T. Williams
for allowing
us to reference their
unpublished
work,
K. Hyde for assistance
in
sequence
data
entry,
and L. Norris for
help
in preparation
of Fig. 1. Supported
by
grant
GM38793
from the NIH (SKH).
SCIENCE, VOL. 241
52
This content downloaded from 132.239.1.231 on Sun, 21 Apr 2013 19:08:54 PM
All use subject to JSTOR Terms and Conditions
... Eukaryotic protein kinases belong to a large gene superfamily. They contain a protein kinase catalytic domain that phosphorylates the serine, threonine, or tyrosine residues of target proteins [11]. In plants, PKs are classified into seven large families, including AGC (PKA-PKG-PKC), CAMK (calcium-and calmodulin-regulated kinase), CMGC (cyclin-dependent kinases, mitogen-activated protein kinases, glycogen synthase kinases, and cyclin-dependent-like kinases), RLK (receptor-like kinase), STE (serine-threonine kinase), and TKL [12]. ...
Article
Full-text available
Background Flowering time has an important effect on regional adaptation and yields for crops. The tyrosine kinase-like (TKL) gene family is widely existed and participates in many biological processes in plants. Furthermore, only few TKLs have been characterized functions in controlling flowering time in wheat. Results Here, we report that TaCTR1, a tyrosine kinase-like (TKL) gene, regulates flowering time in wheat. Based on identification and evolutionary analysis of TKL_CTR1-DRK-2 subfamily in 15 plants, we proposed an evolutionary model for TaCTR1, suggesting that occurrence of some exon fusion events during evolution. The overexpression of TaCTR1 caused early flowering time in transgenic lines. Transcriptomics analysis enabled identification of mass differential expression genes including plant hormone (ET, ABA, IAA, BR) signaling, flavonoid biosynthesis, phenolamides and antioxidant, and flowering-related genes in TaCTR1 overexpression transgenic lines compared with WT plants. qRT–PCR results showed that the expression levels of ethylene (ET) signal-related genes (ETR, EIN, ERF) and flowering-related genes (FT, PPD1, CO, PRR, PHY) were altered in TaCTR1-overexpressing wheat compared with WT plants. Metabonomics analysis showed that flavonoid contents were altered. Conclusions Thus, the results show that TaCTR1 plays a positive role in controlling flowering time by activating various signaling pathways and regulating flowering-related genes, and will provide new insights on the mechanisms of wheat flowering regulation.
... The emergence of TSSK1a and TSSK2a as novel TSSK copies in distinct regulatory contexts may elucidate the substitution of TSSK3, TSSK5, and TSSK6 within this group 47 preferences. The C-terminal domain plays a crucial role in determining the accurate spatial con guration of the serine/threonine protein kinase activity 49 , facilitating the transfer of γ-phosphate residues from ATP to the hydroxyl group of serine, threonine, or tyrosine residues on the target protein 50 . Through bioinformatics analysis, it was revealed that TSSK1, TSSK1a, TSSK2, TSSK2a, and TSSK4 genes exhibit homology, with a highly conserved S-TKc domain. ...
Preprint
Full-text available
Genetic control system at molecular level presents a promising and eco-friendly strategy for the management of pest and insect-transmitted diseases. Although considerable advancements have been achieved in gene drive applications targeting mosquitoes, endeavors to combat agricultural pests have been somewhat restricted. Here, we identi ed that the testis-speci c serine/threonine kinases (TSSKs) family is uniquely expressed in the testes of Cydia pomonella, a prominent global invasive species. We further generated male moths with disrupted the expression of TSSKs and those with TSSKs completely knocked out using RNA interference and CRISPR/Cas 9 genetic editing techniques, resulting in signi cant disruptions in spermiogenesis, decreased sperm motility, and hindered development of fertilized oocytes. Further explorations into the underlying post-transcriptional regulatory mechanisms have revealed the involvement of lnc117962 as a competing endogenous RNA (ceRNA) for miR-3960, thereby regulating TSSKs. Notably, orchard trials have demonstrated that the release of male strains can effectively suppress population growth. Our ndings indicate that targeting TSSKs could serve as a feasible avenue for managing C. pomonella populations, offering signi cant insights and potential strategies for controlling invasive pests through genetic sterile insect technique (gSIT) technology.
... kinases (BYKs), and atypical serine kinases (ASKs) 16 . eSTKs have the same characteristic motifs described for eukaryotic serine/threonine kinases and have been found to control metabolism in a wide range of bacterial species, but not in E. coli [17][18][19] . Arginine kinases have been implicated in control of bacterial protein degradation 20 . ...
Article
Full-text available
Two-component systems, consisting of a histidine kinase and a response regulator, serve signal transduction in bacteria, often regulating transcription in response to environmental stimuli. Here, we identify a tandem serine histidine kinase function for KdpD, previously described as a histidine kinase of the KdpDE two-component system, which controls production of the potassium pump KdpFABC. We show that KdpD additionally mediates an inhibitory serine phosphorylation of KdpFABC at high potassium levels, using not its C-terminal histidine kinase domain but an N-terminal atypical serine kinase domain. Sequence analysis of KdpDs from different species highlights that some KdpDs are much shorter than others. We show that, while Escherichia coli KdpD’s atypical serine kinase domain responds directly to potassium levels, a shorter version from Deinococcus geothermalis is controlled by second messenger cyclic di-AMP. Our findings add to the growing functional diversity of sensor kinases while simultaneously expanding the framework for regulatory mechanisms in bacterial potassium homeostasis.
... STK11 plays a crucial role in regulating cell damage repair, energy metabolism, and tumor immune responses [16]. The LKB1 protein comprises three major domains: The N-terminal non-catalytic domain (encoded by amino acids 1-49), the catalytic kinase domain (encoded by amino acids 49-309), and the C-terminal non-catalytic regulatory domain (encoded by amino acids 309-433) [1,17]. Variations in patients with PJS are predominantly located in the catalytic domain region, leading to kinase activity dysfunction and the disruption of STK11 function [18]. ...
Article
Full-text available
BACKGROUND Peutz-Jeghers syndrome (PJS) is a rare hereditary neoplastic disorder mainly associated with serine/threonine kinase 11 (STK11 /LKB1 ) gene mutations. Preimplantation genetic testing can protect a patient’s offspring from mutated genes; however, some variations in this gene have been interpreted as variants of uncertain significance (VUS), which complicate reproductive decision-making in genetic counseling. AIM To identify the pathogenicity of two missense variants and provide clinical guidance. METHODS Whole exome gene sequencing and Sanger sequencing were performed on the peripheral blood of patients with PJS treated at the Reproductive and Genetic Hospital of Citic-Xiangya. Software was employed to predict the protein structure, conservation, and pathogenicity of the two missense variation sites in patients with PJS. Additionally, plasmids were constructed and transfected into HeLa cells to observe cell growth. The differences in signal pathway expression between the variant group and the wild-type group were compared using western blot and immunohistochemistry. Statistical analysis was performed using one-way analysis of variance. P < 0.05 was considered statistically significant. RESULTS We identified two missense STK11 gene VUS [c.889A>G (p.Arg297Gly) and c.733C>T (p.Leu245Phe)] in 9 unrelated PJS families who were seeking reproductive assistance. The two missense VUS were located in the catalytic domain of serine/threonine kinase, which is a key structure of the liver kinase B1 (LKB1) protein. In vitro experiments showed that the phosphorylation levels of adenosine monophosphate-activated protein kinase (AMPK) at Thr172 and LKB1 at Ser428 were significantly higher in transfected variation-type cells than in wild-type cells. In addition, the two missense STK11 variants promoted the proliferation of HeLa cells. Subsequent immunohistochemical analysis showed that phosphorylated-AMPK (Thr172) expression was significantly lower in gastric, colonic, and uterine polyps from PJS patients with missense variations than in non-PJS patients. Our findings indicate that these two missense STK11 variants are likely pathogenic and inactivate the STK11 gene, causing it to lose its function of regulating downstream phosphorylated-AMPK (Thr172), which may lead to the development of PJS. The identification of the pathogenic mutations in these two clinically characterized PJS patients has been helpful in guiding them toward the most appropriate mode of pregnancy assistance. CONCLUSION These two missense variants can be interpreted as likely pathogenic variants that mediated the onset of PJS in the two patients. These findings not only offer insights for clinical decision-making, but also serve as a foundation for further research and reanalysis of missense VUS in rare diseases.
... The highest number of genes (17) were found to be part of S-TKc domain predicted by SMART. S-TKc is a conserved protein kinase domain which is involved in phosphorylation (Hanks et al., 1988) this is the major molecular function for DEGs has been predicted in current study. The armadillo repeats, ankyrin repeats and leucine-rich repeats together form an extended N-terminal flexible 'solenoid'-like structure composed of tandem repeat modules likely to be important in anchoring to the membrane and cytoskeletal structures as well as binding to other protein ligands (Mills et al., 2012). ...
Article
Full-text available
Parkinson's disease (PD) is a progressive neurodegenerative disorder that affects millions of individuals worldwide, with an increasing prevalence in aging populations. In this study, we retrieved the mRNA expression dataset, GSE165082, for PD through GEOmnibus. Total of 220 downregulated and 354 upregulated genes were identified after data normalization. Functional annotation carried out by DAVID tools, revealed that these DEGs were mainly enriched in biological processes i.e., cell division and protein phosphorylation, and they were localized mostly cytoplasm and nucleus. Two molecular function protein binding and ATP binding were predominant. Additionally KEGG pathway analysis highlighted their involvement in neurodegenerative, cancer, alzheimer's and coronavirus diseases. Armadillo-type fold and Armadillo-like helical domains were found by INTERPRO while TKc domain by SMART. Transcription factors IRF1 was predicted by FunRich tool. Upregulated genes were found expressed in 6 sites i.e., Palate, Ventral striatum, Pluripotent stem cells, Ganglia, Curtilage and Ciliary muscle. A protein-protein interaction network was constructed by using Cytoscape v 6.0. Ten hub genes EFI3A, RPL28, SMG8. UPF2, XAF1, IFITM1, IFIT3, LY63, IFI3 and LY6B were identified by Cytohubba. The expression patterns of hub genes across different organs and immune response cells using a heatmap and expression of EIF3 was found in almost all organs except liver. MicroRNA for were predicted by FunRich tool. Finally, we predicted microRNAs for RPL28, SMG8, UPF2 and EIF3a that could potentially regulate these hub genes, providing insights into post-transcriptional gene regulation. This comprehensive analysis contributes to our understanding of the molecular mechanisms underlying Parkinson's disease and provides a foundation for future research and therapeutic development in this complex and challenging condition
... Reversible AL-phosphorylation is another mechanism of regulation in ePKs and has already been investigated in the context of AL-cysteine modification 24,50 . It was demonstrated that oxidation of C199 in the AL of PKA inactivates the kinase activity 51 while replacement of C199 with alanine does not inhibit activity but does facilitate dephosphorylation of pT197, which in turn is crucial for kinase activity 23,51,52 . ...
Article
Full-text available
Mutations of the human leucine-rich repeat kinase 2 (LRRK2) have been associated with both, idiopathic and familial Parkinson’s disease (PD). Most of these pathogenic mutations are located in the kinase domain (KD) or GTPase domain of LRRK2. In this study we describe a mechanism in which protein kinase activity can be modulated by reversible oxidation or reduction, involving a unique pair of adjacent cysteines, the “CC” motif. Among all human protein kinases, only LRRK2 contains this “CC” motif (C2024 and C2025) in the Activation Segment (AS) of the kinase domain. In an approach combining site-directed mutagenesis, biochemical analyses, cell-based assays, and Gaussian accelerated Molecular Dynamics (GaMD) simulations we could attribute a role for each of those cysteines. We employed reducing and oxidizing agents with potential clinical relevance to investigate effects on kinase activity and microtubule docking. We find that each cysteine gives a distinct contribution: the first cysteine, C2024, is essential for LRRK2 protein kinase activity, while the adjacent cysteine, C2025, contributes significantly to redox sensitivity. Implementing thiolates (R-S ⁻ ) in GaMD simulations allowed us to analyse how each of the cysteines in the “CC” motif interacts with its surrounding residues depending on its oxidation state. From our studies we conclude that oxidizing agents can downregulate kinase activity of hyperactive LRRK2 PD mutations and may provide promising tools for therapeutic strategies.
Chapter
Recently, the successful pharmacological targeting of protein and phosphoinositide kinases has been applied to a wide range of diseases, most notably cancer, immune deficiencies, viral infection, and neurodegenerative disease leading to the successful clinical development of effective and selective inhibitors of phosphoinositide kinases. However, targeting of phosphoinositide kinases is most effective for plasmodium to treat malaria. The phosphatidylinositol 4-kinase (a key member from family of phosphoinositide) type III beta (PI4KIII) is now in phase-2 clinical research as a possible therapeutic target for treatment of malaria. Golgi and trans-Golgi network (TGN) membranes control the cellular regulation and trafficking throughout the Golgi apparatus and are well defined by PI4KII. In this perspective, here in this chapter we discuss the promise of kinases as therapeutic targets for antimalarial drugs, efficiencies, and challenges. Many parasites depend on kinases for various physiological activities, and these enzymes are easily targeted with drugs provided. By virtue of strong conserved nature of all binding sites from ATP molecules, it is difficult to find Plasmodium kinase inhibitors (K+ inhibitor) that are selective against mammalian (human) ortholog(s) and other human kinase enzymes. Despite the numerous significant differences between the Plasmodium and human kinomes, these variations are leveraged to our benefit in drug targeting and the development of resistance to multiple drugs. There is also the possibility of using other medicine and pharmacological products that inhibit multiple plasmodium kinases which come under the subject of polypharmacology. It is crucial to evaluate prospective about kinase targets before beginning the drug discovery to ensure that the desired kinase inhibition would kill the parasites in important phase of their life cycle at faster rate within short period of time. In this publication, we focus on the critical drug target (PI4KIII) for the development of novel and innovative drugs to treat malaria, cGMP-dependent protein kinase, cyclin-dependent-like kinase and summarize the overall progress about targeting of proteins.
Article
Plasmodesmata (PDs) are intercellular organelles carrying multiple membranous nanochannels that allow the trafficking of cellular signalling molecules. The channel regulation of PDs occurs dynamically and is required in various developmental and physiological processes. It is well known that callose is a critical component in regulating PD permeability or symplasmic connectivity, but the understanding of the signalling pathways and mechanisms of its regulation is limited. Here, we used the reverse genetic approach to investigate the role of C‐type lectin receptor‐like kinase 1 (CLRLK1) in the aspect of PD callose‐modulated symplasmic continuity. Here, we found that loss‐of‐function mutations in CLRLK1 resulted in excessive PD callose deposits and reduced symplasmic continuity, resulting in an accelerated gravitropic response. The protein interactome study also found that CLRLK1 interacted with actin depolymerizing factor 3 (ADF3) in vitro and in plants. Moreover, mutations in ADF3 result in elevated PD callose deposits and faster gravitropic response. Our results indicate that CLRLK1 and ADF3 negatively regulate PD callose accumulation, contributing to fine‐tuning symplasmic opening apertures. Overall, our studies identified two key components involved in the deposits of PD callose and provided new insights into how symplasmic connectivity is maintained by the control of PD callose homoeostasis.
Article
Full-text available
p-Fluorosulfonylbenzoyl 5'-adenosine (FSO2BzAdo) was shown previously to be an irreversible inhibitor of the catalytic subunit of cAMP-dependent protein kinase II from porcine skeletal muscle (Zoller, M. J., and Taylor, S. S. (1979) J. Biol. Chem. 254, 8363-8368). The catalytic subunit of porcine heart cAMP-dependent protein kinase was also inhibited following incubation with FSO2[14C]BzAdo, and inhibition was shown to result from the stoichiometric, covalent modification of a single lysine residue. The amino acid sequence in an extended region around the carboxybenzenesulfonyl lysine (CBS-lysine) was elucidated by characterizing both tryptic and cyanogen bromide peptides containing the 14C-modified residue. The sequence in this region was Leu-Val-Lys-His-Lys-Glu-Thr-Gly-Asn-His-Phe-Ala-Met-Lys(CBS)-Ile-Leu-Asp-Lys-Glu-Lys-Val-Val-Lys-Leu-Lys-Gln-Ile. The covalently modified residue corresponded to lysine 71 in the overall polypeptide chain. Homologies to bovine heart catalytic subunit and to a site modified by FSO2BzAdo in phosphofructokinase are considered.
Article
Full-text available
We have isolated cDNAs representing a previously unrecognized human gene that apparently encodes a protein-tyrosine kinase. We have designated the gene as HCK (hemopoietic cell kinase) because its expression is prominent in the lymphoid and myeloid lineages of hemopoiesis. Expression in granulocytic and monocytic leukemia cells increases after the cells have been induced to differentiate. The 57-kilodalton protein encoded by HCK resembles the product of the proto-oncogene c-src and is therefore likely to be a peripheral membrane protein. HCK is located on human chromosome 20 at bands q11-12, a region that is affected by interstitial deletions in some acute myeloid leukemias and myeloproliferative disorders. Our findings add to the diversity of protein-tyrosine kinases that may serve specialized functions in hemopoietic cells, and they raise the possibility that damage to HCK may contribute to the pathogenesis of some human leukemias.
Article
Full-text available
A 1.6-kilobase cDNA (A-raf) has been isolated from a murine spleen cDNA library which encodes part of a protein related to the raf oncogene. Its amino acid sequence has 85% homology to raf in a central portion of 100 amino acids. In contrast to raf, A-raf shows a highly restricted tissue distribution of expression, with highest levels observed in epididymis, followed by intestine. When incorporated into a retrovirus, the resulting gag-A-raf fusion gene causes transformation in vitro and induces tumors in newborn mice. Thus, A-raf represents a new proto-oncogene. Transformation by A-raf is independent of ras gene function, as is the case for raf and mos but not other oncogenes.
Article
This chapter describes viral protein-tyrosine kinases and discusses their roles in viral transformation. For this, the properties of the individual viral protein-tyrosine kinases are described in the chapter and are contrasted to the cognate cellular enzymes that are encoded by the cellular genes homologous to the viral oncogenes. The chapter discusses a number of transformed cell types where altered tyrosine phosphorylation is evident and describes the general properties of protein-tyrosine kinases and their common structural features. In addition, the chapter includes a progress report on the identification of substrates for the viral protein-tyrosine kinases. The chapter discusses the way in which viral protein-tyrosine kinases differ from their cellular counterparts—enzymes that clearly coexist peaceably with normal cells. In the chapter, the properties of the viral enzymes are compared and contrasted to those of the growth factor receptor protein-tyrosine kinases, particularly because of the abnormal growth state of transformed cells.
Article
The complete nucleotide sequence of the transforming gene of a mouse sarcoma virus has been determined. It codes for a protein of 374 amino acids. The nucleotide sequence of the junctions between a murine leukaemia virus and cellular sequences leading to the formation of the viral transforming gene have also been elucidated. The viral transforming sequence and its cellular homologue share an uninterrupted stretch of 1,159 nucleotides, with few base substitutions. The predicted amino acid sequence of the mouse sarcoma virus transforming gene was found to share considerable homology with the proposed amino acid sequence of the avian sarcoma virus oncogene (src) product.
Article
We have deduced the entire 1,370-amino-acid sequence of the human insulin receptor precursor from a single complementary DNA clone. The precursor starts with a 27-amino-acid signal sequence, followed by the receptor α-subunit, a precursor processing enzyme cleavage site, then the β-subunit containing a single 23-amino-acid transmembrane sequence. There are sequence homologies to human epidermal growth factor receptor and the members of the src family of oncogene products.
Article
The epidermal growth factor (EGF) receptor is a transmembrane glycoprotein of relative molecular mass 170,000 with intrinsic ligand-dependent protein tyrosine kinase activity. Binding of EGF to its receptor activates a number of immediate biochemical processes, such as alterations of intracellular free calcium, pH, and increased transcription of several responsive genes, which usually culminate many hours later in DNA replication and cell division. Abolishing the tyrosine kinase activity of three related oncogenes, v-src, v-mos, and v-fps, eliminates their capacity to transform cell. Several reports have suggested that specific aspects of EGF receptor function are independent of the intrinsic tyrosine kinase activity; however, these studies used an antibody against EGF receptor which failed to activate phosphorylation of exogenous substrates and an insertional mutation in the EGF receptor tyrosine kinase domain which had not been shown to abolish protein kinase activity in cells. Because many transmembrane receptors interact with intrinsic membrane proteins to activate second messenger systems, it is important to resolve experimentally whether mechanisms, in addition to activation of the intrinsic tyrosine kinase activity, mediate some EGF actions. From functional analyses of an EGF receptor containing a single amino-acid mutation at a site required for phosphate transfer from ATP, we conclude that the tyrosine kinase activity of the EGF receptor is essential for the diverse biochemical effects of EGF, including rapid alterations in intracellular calcium, activation of gene transcription, receptor down-regulation and the ultimate stimulatory effects on cell proliferation.
Article
We report here the isolation of a human T cell-specific cDNA clone (YT16) that encodes a protein with extensive homology to a family of transformation and proliferation-related protein-tyrosine kinases (PTK). The message size of YT16 is 2.2 kb and it encodes a protein of 507 amino acids with a mol. mass of 57,136 Daltons. The deduced protein sequence has an overall homology of about 50% to the viral oncogene products of src and yes. Around the PTK catalytic domain, the homologies increase to 70-80% between the deduced protein sequence of YT16 and those of the PTK viral oncogenes. Of special interest is the extensive homology between the deduced protein sequence of YT16 and that of a recently reported murine B and T lymphocyte-specific PTK, lskT. It is possible that YT16 either encodes the human homologue of this murine gene, or a protein with similar function(s) in T cells.
Article
We examined the structure of protein kinase C in an attempt to understand the molecular events connecting protein kinase C activation with the cellular response. Rabbit complementary DNA clones coding for three distinct types of protein kinase C, named alpha, beta and gamma, have been identified and sequenced. The deduced amino acid sequence for alpha, beta and gamma (673, 671 and 672 amino acids, respectively) are closely related. Kinases alpha and beta share an identical N-terminal sequence of 621 amino acid residues and their messenger RNAs arise from a single gene. The C-terminal halves of alpha, beta and gamma are protein kinase domains and are highly homologous to other protein kinases. The mRNAs for alpha, beta and gamma are expressed in various tissues with strikingly different tissue specificities. The one for gamma is found ubiquitously among various tissues, while those for alpha and beta predominate in the brain.