ArticlePDF Available

Positive selection drives adaptive diversification of the 4-coumarate: CoA ligase (4CL) gene in angiosperms

Wiley
Ecology and Evolution
Authors:

Abstract and Figures

Lignin and flavonoids play a vital role in the adaption of plants to a terrestrial environment. 4-Coumarate: coenzyme A ligase (4CL) is a key enzyme of general phenylpropanoid metabolism which provides the precursors for both lignin and flavonoids biosynthesis. However, very little is known about how such essential enzymatic functions evolve and diversify. Here, we analyze 4CL sequence variation patterns in a phylogenetic framework to further identify the evolutionary forces that lead to functional divergence. The results reveal that lignin-biosynthetic 4CLs are under positive selection. The majority of the positively selected sites are located in the substrate-binding pocket and the catalytic center, indicating that nonsynonymous substitutions might contribute to the functional evolution of 4CLs for lignin biosynthesis. The evolution of 4CLs involved in flavonoid biosynthesis is constrained by purifying selection and maintains the ancestral role of the protein in response to biotic and abiotic factors. Overall, our results demonstrate that protein sequence evolution via positive selection is an important evolutionary force driving adaptive diversification in 4CL proteins in angiosperms. This diversification is associated with adaption to a terrestrial environment.
Content may be subject to copyright.
Positive selection drives adaptive diversification of the
4-coumarate: CoA ligase (4CL) gene in angiosperms
Haiyan Sun
1,2,3,4
, Kai Guo
2,3,4
, Shengqiu Feng
2,3,5
, Weihua Zou
2,3,5
, Ying Li
2,3,5
, Chunfen Fan
2,3,5
&
Liangcai Peng
2,3,4,5
1
School of Biology and Food Engineering, Changshu Institute of Technology, Changshu 215500, China
2
National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
3
Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan 430070, China
4
College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
5
College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
Keywords
4-Coumarate: coenzyme A ligase,
diversification, evolution, phylogeny, positive
selection.
Correspondence
Liangcai Peng, National Key Laboratory of
Crop Genetic Improvement, Huazhong
Agricultural University, Wuhan 430070,
China.
Tel: 86-27-87281765;
Fax: 86-27-87280016;
E-mail: lpeng@mail.hzau.edu.cn
Funding Information
The National Transgenic Project
(2009ZX08009-119B), the 111 Project
(B08032), the 973 Specific Pre-project
(2010CB134401) and the Youth Foundation
of Jiangsu Province (BK20140417).
Received: 11 May 2015; Revised: 25 June
2015; Accepted: 25 June 2015
Ecology and Evolution 2015; 5(16):
34133420
doi: 10.1002/ece3.1613
Abstract
Lignin and flavonoids play a vital role in the adaption of plants to a terrestrial
environment. 4-Coumarate: coenzyme A ligase (4CL) is a key enzyme of general
phenylpropanoid metabolism which provides the precursors for both lignin and
flavonoids biosynthesis. However, very little is known about how such essential
enzymatic functions evolve and diversify. Here, we analyze 4CL sequence varia-
tion patterns in a phylogenetic framework to further identify the evolutionary
forces that lead to functional divergence. The results reveal that lignin-biosyn-
thetic 4CLs are under positive selection. The majority of the positively selected
sites are located in the substrate-binding pocket and the catalytic center, indi-
cating that nonsynonymous substitutions might contribute to the functional
evolution of 4CLs for lignin biosynthesis. The evolution of 4CLs involved in fla-
vonoid biosynthesis is constrained by purifying selection and maintains the
ancestral role of the protein in response to biotic and abiotic factors. Overall,
our results demonstrate that protein sequence evolution via positive selection is
an important evolutionary force driving adaptive diversification in 4CL proteins
in angiosperms. This diversification is associated with adaption to a terrestrial
environment.
Introduction
Lignin and flavonoids are thought to play vital roles in
the adaptation of plants to terrestrial environments
(Rozemaa et al. 2002; Weng and Chapple 2010; Agati
et al. 2013). The enzyme 4-Coumarate: CoA ligase (4CL;
EC 6.2.1.12) is a key enzyme that functions in an early
step of the general phenylpropanoid pathway. The pro-
tein 4CL converts 4-coumaric acid and other cinnamic
acids, such as caffeic acid and ferulic acid, into the cor-
responding CoA thiol esters, which are then subse-
quently used for the biosynthesis of numerous secondary
metabolites, including flavonoids, isoflavonoids, lignin,
suberins, coumarins and wall-bound phenolics (Ehlting
et al. 1999; Saballos et al. 2012). The 4CL gene family is
typically small. The 4CL family has 4 members in Ara-
bidopsis (Hamberger and Hahlbrock 2004), 5 members
in rice (Gui et al. 2011; Sun et al. 2013), and 4 mem-
bers in soybean (Lindermayr et al. 2002). 4CL isoforms
with different substrate specificities may direct the flow
from general phenylpropanoid metabolism into the dif-
ferent pathways for specific end products (Souza et al.
2008).
In dicots, 4CLs can be divided into two distinct groups:
class I and class II. The disruption of 4CL expression has
demonstrated that class I 4CLs participate in lignin
ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
This is an open access article under the terms of the Creative Commons Attribution License, which permits use,
distribution and reproduction in any medium, provided the original work is properly cited.
3413
formation, while class II 4CLs impact flavonoid metabo-
lism (Lee et al. 1997; Hu et al. 1998; Ehlting et al. 1999;
Harding et al. 2002; Nakashima et al. 2008). The remark-
able functional diversity of 4CL suggests that it may be
subject to positive Darwinian selection. However, how the
4CL genes evolve and functionally diverge and whether
natural selection plays a role in their evolution have been
poorly studied. In this study, we analyzed nucleotide
divergence in the 4CL genes from 16 species and used
likelihood methods with various evolutionary models to
investigate potential patterns of positive selection.
Methods
Sequence data collection
All known and reported 4CL protein-coding sequences
from dicots, monocots, and gymnosperms (loblolly pine)
were retrieved from the National Center for Biotechnol-
ogy Information (NCBI). In total, 42 4CL protein
sequences from 16 species were collected and are listed in
Table S1.
Phylogenetic analysis
The 4CL protein-coding sequences were aligned using the
program CLUSTALW implemented in MEGA5 (Tamura
et al. 2011) and manually edited. Highly variable regions,
indels, and gaps were excluded. A phylogenetic tree was
constructed using MEGA5 with the neighbor-joining (NJ)
method. The reliability of the branches was evaluated by
1000 bootstrap replicates.
Test for selection
The nonsynonymoussynonymous substitution rate ratio
(x=dN/dS) provides a measure of the selective pressure
at the protein level, where a xof 1, <1, or >1 indicates
neutral evolution, purifying selection, or positive selec-
tion, respectively. The hypothesis of positive selection was
tested using the CODEML program in the PAML v4.3b
package (Yang 2007). Three approaches, branch, site, and
branch-site models, incorporated into the program were
used. In the lineage-specific selection analyses, we
employed the recently developed dynamic programming
procedure to search for the optimal branch-specific model
that had a likelihood equal to or close to the global maxi-
mum likelihood for all of the possible models (Zhang
et al. 2011). In the site-specific selection analyses, the
dataset was fitted to three pairs of codon substitution
models (M2a vs. M1a, M3 vs. M0, and M8 vs. M7). The
branch-site model A was used to detect positively selected
sites along the branches that showed elevated xratios.
The sites under positive selection were identified by the
Bayes Empirical Bayes (BEB) approach.
Results and Discussion
Angiosperm 4CL gene phylogeny
The conserved protein-coding sequences of 42 4CLs from
16 species were used to reconstruct a phylogenetic tree.
Analysis revealed that all of the 4CL genes fell into one of
two general groups: A and B (Fig. 1). Group A contains
representatives from all of the available dicots, including
verified 4CL sequences from Arabidopsis, poplar and soy-
bean. The monocot 4CL isoenzymes in group B form a
highly supported monophyletic group and are thus sepa-
rated from the dicot isoforms. The gymnosperm 4CLs,
the loblolly pine isoforms Lp4CL1 and Lp4CL2, also
formed a separate cluster that was closest to the monocot
isoenzymes.
The functional divergence of the 4CL gene
family
The dicot 4CLs can be divided into two distinct groups
that are designated dicots class I and dicots class II
(Fig. 1). Previous studies have demonstrated that 4CL
genes in dicots class I are associated with lignin accumu-
lation, while dicots class II 4CLs are involved in the meta-
bolism of other phenolic compounds, such as flavonoids.
For example, the genes Pt4CL1,At4CL1,At4CL2,At4CL4,
Gm4CL1, and Gm4CL2 in dicots class I are involved in
lignin formation (Hu et al. 1998; Ehlting et al. 1999; Lin-
dermayr et al. 2002). However, the genes Pt4CL2,At4CL3,
and Gm4CL4 in dicots class II are believed to play a role
in flavonoid biosynthesis (Uhlmann and Ebel 1993; Hu
et al. 1998; Ehlting et al. 1999).
The 4CLs from monocots can also be classified into
two groups, which are designated monocots class I and
monocots class II (Fig. 1). The 4CL genes in monocots
class I are associated with lignin accumulation. For
example, Pv4CL1 in monocots class I is the key 4CL
isoenzyme involved in lignin biosynthesis because RNA
interference of Pv4CL1 reduces the activity of extractable
4CL by 80% leading to a reduction in lignin content
and a decrease in the guaiacyl unit composition (Xu
et al. 2011). The Os4CL3 gene in the same group is also
involved in lignin biosynthesis because suppression of
Os4CL3 expression results in significant lignin reduction,
retarded growth and other morphological changes (Gui
et al. 2011). However, the genes in monocots class II
(Fig. 1) are likely to participate in the flavonoid biosyn-
thetic pathway. For example, based on phylogenetic
analysis, Xu et al. (2011) hypothesized that Pv4CL2 in
3414 ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
Evolution of the 4CL Genes H. Sun et al.
monocots class II mainly participates in the flavonoid
biosynthesis pathway in switchgrass. Recent research
(Sun et al. 2013) has demonstrated that the primary
function of Os4CL2 is to channel the activated 4-
coumarate to chalcone synthase and subsequently to
different branched pathways of flavonoid secondary
metabolism leading to flower pigments and UV protec-
tive flavonols and anthocyanins. The remarkable func-
tional diversity of not only dicot but also monocot 4CLs
suggests that 4CL may be subject to positive Darwinian
selection.
Evolutionary patterns among lineages and
among sites
To test the hypothesis that positive selection acts on 4CLs,
we applied branch-specific models to the 4CL dataset. It
was clear that 40RM (40 ratio model) with 40 different x
ratios was the optimal branch model (Table S2). The six
branches where xwas >1 were defined as branches a,b,c,
d,e, and f, respectively (Fig. 1). To examine whether the x
ratio for each branch was significantly greater than the
background ratio, the log-likelihood values were calculated
Figure 1. Phylogenetic relationship between
4CL genes from angiosperms based on the
neighbor-joining method. The branch lengths
are proportional to distances, and the values at
the interior nodes are the bootstrap
percentages derived from 1000 replicates. The
six branches potentially under positive selection
are indicated as a, b, c, d, e, and f,
respectively.
ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. 3415
H. Sun et al.Evolution of the 4CL Genes
from two-ratio models that assigned the ratios x
a
,x
b
,x
c
,
x
d
,x
e
, and x
f
to branches a,b,c,d,e, and f, and the ratio
x
0
was assigned to all other branches. All of these two-ra-
tio models were individually compared with the one-ratio
model (M0). The one-ratio model, which assumes the
same xparameter for the entire tree, yielded a log-likeli-
hood value of -28951.48 with an estimated x
0
of 0.089
(Table 1). The low average ratio indicated the dominating
role of purifying selection in the evolution of the 4CL
genes. The two-ratio models for branches a,c,d, and ffit
the data significantly better than the one-ratio model
(Table 2), resulting in the rejection of the null hypothesis
that the 4CL genes evolved at constant rates along the
branches. To test whether the six xratios were signifi-
cantly higher than 1, we calculated the log-likelihood val-
ues using the two-ratio models with x
a
,x
b
,x
c
,x
d
,x
e
,
and x
f
fixed to 1 (Table 1). The likelihood ratio tests were
also implemented for comparing each two-ratio model
and its corresponding fixed two-ratio model. The likeli-
hood ratio tests in Table 2 revealed that the xratios for
branches a,b,c,d,e, and fwere not significantly greater
than one. We therefore conclude that the evolution of the
Table 1. Log-likelihood values and parameter estimates for the 4CL genes.
Model plnL Parameter estimates Positively selected sites
M0: one ratio 1 28951.48 x
0
=0.089 None
Branch specific models
Two ratios (branch a) 2 28949.50 x
0
=0.089, x
a
=3.463
Two ratios (fixed x
a
=1) 1 28949.58 x
0
=0.089, x
a
=1
Two ratios (branch b) 2 28950.10 x
0
=0.088, x
b
=0.273
Two ratios (fix x
b
=1) 1 28950.49 x
0
=0.088, x
b
=1
Two ratios (branch c) 2 28948.26 x
0
=0.088, x
c
=2.270
Two ratios (fixed x
c
=1) 1 28948.29 x
0
=0.088, x
c
=1
Two ratios (branch d) 2 28948.76 x
0
=0.088, x
d
=0.574
Two ratios (fixed x
d
=1) 1 28948.79 x
0
=0.088, x
d
=1
Two ratios (branch e) 2 28950.04 x
0
=0.088, x
e
=
Two ratios (fixed x
e
=1) 1 28950.17 x
0
=0.088, x
e
=1
Two ratios (branch f) 2 28939.38 x
0
=0.088, x
f
=
Two ratios (fixed x
f
=1) 1 28940.20 x
0
=0.088, x
f
=1
Sites-specific models
M1:neutral (K =2) 1 28731.92 p
0
=0.946 (p
1
=0.054) Not allowed
M2: selection (K =3) 3 28731.92 p
0
=0.946, p
1
=0.011 None
(p
2
=0.042), x
2
=1
M3: discrete (K =2) 3 28260.60 p
0
=0.520 (p
1
=0.480) None
x
0
=0.020, x
1
=0.175
M3: discrete (K =3) 5 28136.31 p
0
=0.408, p
1
=0.456, (p
2
=0.136) None
x
0
=0.010, x
1
=0.108, x
2
=0.320
M7: beta 2 28118.00 p=0.542, q=4.443 Not allowed
M8: beta and x428116.60 p
0
=0.994, p=0.570, q=5.015
(p
1
=0.006), x=1
None
Branch-site model A
Model a 3 28731.43 p
0
=0.912, p
1
=0.052 None
(p
2
+p
3
=0.036), x
2
=9.870
Model fixed x
a
228731.55 p
0
=0.725, p
1
=0.041 Not allowed
(p
2
+p
3
=0.234), x
2
=1
Model c 3 28719.14 p
0
=0.885, p
1
=0.051 82C 291S (at P>0.95)
(p
2
+p
3
=0.064), x
2
=15.947 379M 423T (at P>0.99)
Model fixed x
c
228724.93 p
0
=0.780, p
1
=0.046 Not allowed
(p
2
+p
3
=0.155), x
2
=1
Model d 3 28723.32 p
0
=0.922, p
1
=0.053 79V 202S 211S (at P>0.95)
(p
2
+p
3
=0.024), x
2
=14.873
Model fixed x
d
228726.61 p
0
=0.825, p
1
=0.047 Not allowed
(p
2
+p
3
=0.128), x
2
=1
Model f 3 28716.01 p
0
=0.780, p
1
=0.045 65L 69E 181I 223L
(p
2
+p
3
=0.160), x
2
=234K 239K (at P>0.95)
Model fixed x
f
228720.97 p
0
=0.349, p
1
=0.020 Not allowed
(p
2
+p
3
=0.631), x
2
=1
3416 ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
Evolution of the 4CL Genes H. Sun et al.
4CL genes in angiosperms is dominated by purifying
selection.
Because the branch model test averages the xratios
across all of the sites and is a very conservative test for
positive selection, we applied site-specific models to the
4CL dataset. The log-likelihood values and the parameter
estimates under models with variable xratios among the
sites are listed in Table 1. Two site classes (M3, K =2) fit
the data significantly better than one site class (M0) by
690.88 log-likelihood units revealing significant variation
in the selective pressure on the sites. However, none of
the site-specific models allowed for the presence of posi-
tively selected sites, such as M2a (selection), M3 (dis-
crete), and M8 (beta and x), suggesting the existence of
positively selected sites with x>1. The majority of the
sites in the 4CL sequences appear to be under strong
selective constraints.
Evidence for positive selection on
lignin-related 4CL genes
Positive selection is difficult to detect because it often
operates episodically on just a few amino acid sites and
purifying selection may mask the signal. Branch-site mod-
els can detect positive selection that affected a small num-
ber of sites along prespecified lineages. We used branch-
site model A to test the hypothesis. As detailed in
Table 3, branch-site model A using branch cas the fore-
ground branch (MAc) resulted in a significantly better fit
to M1a (2DlnL =25.56, df =2, P<0.00) and to null
model A for branch c(2DlnL =11.58, df =1, P<0.00)
(Table 3). This result also suggested that 5.1% of amino
acids are under positive selection in lineage c with
x=15.95 (Table 1). Branch-site model A using branch d
as the foreground branch (MAd) provided a significantly
better fit to M1a (2DlnL =17.2, df =2, P<0.00) and
the null model A for branch d(2DlnL =6.58, df =1,
P<0.00) (Table 3). This result also suggested that 5.3%
of the protein sites are under positive selection in lineage
dwith x=14.873 (Table 1). When the analysis was
repeated with branch fas the foreground branch (MAf),
model A was much more realistic and fit the data signifi-
cantly better than M1a (2DlnL =31.82, df =2, P<0.00)
and the null model A for branch f(2DlnL =9.92, df =1,
P<0.01) (Table 3), which suggested that 4.5% of the
amino acids are under positive selection in lineage fwith
x=(Table 1). Model A using branch aas the fore-
ground branch (MAa) did not fit the data better than the
two null models in test 1 and test 2 (Table 3). These evi-
dences are sufficient to support the positive selection
hypothesis on lineages c,d, and f.
Based on the BEB method, four and six candidate sites
for positive selection were identified in dicots and mono-
cots, respectively (Table 1). These positively selected sites
are labeled in Figure 2. Sites 181I, 202S, 211S, 223L, 234K,
and 239K are located in the substrate-binding pocket, and
379M and 423T are located in the in catalytic centers. Sites
65L, 69E, 79V, and 82C are located between the conserved
sequence motifs A2 and A3, which form a phosphate-
binding loop. Site 291S is close to motif A6, which is
important for the formation of a stable tertiary structure.
Thus, amino acid substitutions in these positively selected
sites in the 4CL genes might influence the 4CL substrate
specificity, activity, or secondary structure, which would in
turn have a profound effect on 4CL’s function.
Table 2. Likelihood ratio statistics (2DlnL) for testing branch hypothe-
sis.
M0
(one
ratio)
Fixed
x
a
=1
Fixed
x
b
=1
Fixed
x
c
=1
Fixed
x
d
=1
Fixed
x
e
=1
Fixed
x
f
=1
x
a
free
3.96*0.16
x
b
free
2.76 0.78
x
c
free
6.44*0.06
x
d
free
5.44*0.06
x
e
free
2.88 0.26
x
f
free
24.2** 1.64
*Significant (P<0.05, v
2
=3.84).
**Extremely significant (P<0.01, v
2
=6.63).
Table 3. Likelihood ratio statistics (2DlnL) for testing branch-site hypothesis.
M1a Branch-site MAa (x
a
=1) Branch-site MAc (x
c
=1) Branch-site MAd (x
e
=1) Branch-site MAf (x
e
=1)
MAa 0.98 (0.61) 0.24 (0.89)
MAc 25.56 (2.82E-07)** 11.58 (6.67E-04)**
MAd 17.2 (1.84E-04)** 6.58 (1.03E-02)*
MAf 31.82 (1.23E-07)** 9.92 (7.01E-03)**
*Significant (P<0.05).
**Extremely significant (P<0.01).
ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. 3417
H. Sun et al.Evolution of the 4CL Genes
We have demonstrated that 4CL genes in branches c
and f, which are associated with lignin accumulation, are
under positive selection. Interestingly, positive selection is
also detected at the At4CL genes in branch d. However,
the role of these proteins in lignin formation is similar
to other proteins from dicots class I (Hu et al. 1998;
Ehlting et al. 1999). We hypothesize that positive selec-
tion on the At4CL genes may be related to functional
specialization.
Selective constraints on flavonoid-related
4CLs in dicots
The 4CL genes involved in flavonoid biosynthesis (dicots
class II and monocots class II, Fig. 1) have been largely
conserved during plant evolution, suggesting that they are
constrained by purifying selection. Land plants evolved
from green algae in the mid-Ordovician over 450 million
years ago (Langdale 2008). After arriving in terrestrial
environments, the pioneering land plants were confronted
with several major challenges such as ultraviolet irradia-
tion, desiccation stress. The presence of flavonoid in the
earliest land plants and the associated ability to resist UV
irradiations made survival on land possible for the plants
(Rozemaa et al. 2002). Flavonoid evolved prior to the lig-
nin pathway. For example, bryophytes do not synthesize
lignin, but accumulate soluble phenylpropanoids, such as
flavonoids and lignans (Weng and Chapple 2010). Flavo-
noids accumulate in the epidermal layer of extant plants,
which has been shown to absorb over 90% of UV-B radi-
ation (Stafford 1991). These evidences suggested that the
ancestral role of 4CL was to participate in the flavonoid
biosynthesis and that this role was maintained in the
adaption to a terrestrial environment.
Conclusions
4CLs play important roles in both lignin and flavonoid
biosynthesis. 4CLs that play a role in lignin biosynthesis
are subject to positive selection. This positive selection
resulted in a functional divergence after the monocot
dicot split approximately 200 million years ago. Positive
selection could have been involved in the early stages of
the evolution of the 4CL genes; 4CL rapidly evolves after
speciation events. Strong purifying selection operates on
the novel 4CL genes to maintain the protein’s existing
function. Based on the BEB method, four and six candi-
date sites for positive selection were identified in dicots
and monocots, respectively (Table 1). Most of the posi-
tively selected sites are located in the substrate-binding
pocket and the catalytic centers (Fig. 2). Therefore,
amino acid replacements in these sites might imply a
neofunctionalization. The result is in agreement with our
findings that 4CL genes functionally diversified in angios-
perms (Hu et al. 1998; Ehlting et al. 1999; Gui et al.
2011; Xu et al. 2011; Sun et al. 2013). Although several
positively selected sites were detected using the branch-
site model, we find that the 4CL gene family as a whole
experiences purifying evolution rather than pervasive
selection throughout evolution. The 4CLs involved in fla-
vonoid biosynthesis have been largely conserved during
plant evolution and maintain the ancestral role in
response to biotic or abiotic factors. These findings pro-
vide deeper insights into understanding the evolutionary
Figure 2. The deducted amino acid sequence
for Arabidopsis At4CL1 referred to in this
article. The residues involved in
hydroxycinnamate binding are indicated with
stars, while those involved in enzymatic
functions are labeled with triangles (Hu et al.
2010). The bold-type letters indicate conserved
motifs (Gulick 2009), while those on a gray
background indicate positively selected sites.
3418 ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
Evolution of the 4CL Genes H. Sun et al.
mechanisms of 4CL isoforms and their functional diversi-
fication.
Acknowledgments
We thank Dr. Peng Chen and Dr. Liqiang Wang for read-
ing and discussing the manuscript and Dr. Chengjun
Zhang for assistance with statistical analysis. This work
was supported in part by grants from the National Trans-
genic Project (2009ZX08009-119B), the 111 Project
(B08032), the 973 Specific Pre-project (2010CB134401),
and the Youth Foundation of Jiangsu Province
(BK20140417).
Conflict of Interest
None declared.
References
Agati, G., C. Brunetti, M. Di Ferdinando, F. Ferrini, S.
Pollastri, and M. Tattini. 2013. Functional roles of
flavonoids in photoprotection: new evidence, lessons from
the past. Plant Physiol. Biochem. 72:3545.
Ehlting, J. E., D. Buttner, Q. Wang, C. J. Douglas, I. E.
Somssich, and E. Kombrink. 1999. Three 4-coumarate:
coenzyme A ligases in Arabidopsis thaliana represent two
evolutionarily divergent classes in angiosperms. Plant J.
19:920.
Gui, J., J. Shen, and L. Li. 2011. Functional characterization of
evolutionarily divergent 4-coumarate: coenzyme a ligases in
rice. Plant Physiol. 157:574586.
Gulick, A. M. 2009. Conformational dynamics in the Acyl-
CoA synthetases, adenylation domains of non-ribosomal
peptide synthetases, and firefly luciferase. ACS Chem. Biol.
4:811827.
Hamberger, B., and K. Hahlbrock. 2004. The 4-coumarate:
CoA ligase gene family in Arabidopsis thaliana comprises
one rare, sinapate-activating and three commonly occurring
isoenzymes. Proc. Natl Acad. Sci. USA 101:22092214.
Harding, S. A., J. Leshkevich, V. L. Chiang, and C. J. Tsai.
2002. Differential substrate inhibition couples kinetically
distinct 4-coumarate:coenzyme a ligases with spatially
distinct metabolic roles in quaking aspen. Plant Physiol.
128:428438.
Hu, W. J., A. Kawaoka, C. J. Tsai, J. Lung, K. Osakabe, H.
Ebinuma, et al. 1998. Compartmentalized expression of two
structurally and functionally distinct 4-coumarate: CoA
ligase genes in aspen. Proc. Natl Acad. Sci. USA 95:5407
5412.
Hu, Y., Y. Gai, L. Yin, X. Wang, C. Feng, L. Feng, et al. 2010.
Crystal structures of a Populus tomentosa 4-coumarate: CoA
ligase shed light on its enzymatic mechanisms. Plant Cell
22:30933104.
Langdale, J. A. 2008. Evolution of developmental mechanisms
in plants. Curr. Opin. Genet. Dev. 18:368373.
Lee, D., K. Meyer, C. Clint, and C. Douglas. 1997. Antisense
suppression of 4-Coumarate: coenzyme A ligase activity in
Arabidopsis leads to altered lignin subunit composition.
Plant Cell 9:19851998.
Lindermayr, C., B. Mollers, J. Fliegmann, A. Uhlmann, and F.
Lottspeich. 2002. Divergent members of a soybean (Glycine
max L.) 4-coumarate:coenzyme A ligase gene family. Eur. J.
Biochem. 269:13041315.
Nakashima, J., F. Chen, L. Jackson, G. Shadle, and R. A.
Dixon. 2008. Multi-site genetic modification of
monolignol biosynthesis in alfalfa (Medicago sativa): effects
on lignin composition in specific cell types. New Phytol.
179:738750.
Rozemaa, J., L. O. Bjorn, J. F. Bornman, A. Gaberscik, D.
P. Hader, T. Trost, et al. 2002. The role of UV-B
radiation in aquatic and terrestrial ecosystemsan
experimental and functional analysis of the evolution of
UV-absorbing compounds. J. Photochem. Photobiol., B
66:212.
Saballos, A., S. E. Sattler, E. Sanchez, T. P. Foster, Z. Xin, C.
Kang, et al. 2012. Brown midrib2 (Bmr2) encodes the major
4-coumarate: coenzyme A ligase involved in lignin
biosynthesis in sorghum (Sorghum bicolor (L.) Moench).
Plant J. 70:818830.
Souza, C. A., B. Barbazuk, S. G. Ralph, G. Bohlmann, B.
Hamberger, and C. J. Douglas. 2008. Genome-wide analysis
of a land plant-specific acyl: coenzymeA synthetase(ACS)
gene family in Arabidopsis, poplar, rice and Physcomitrella.
New Phytol. 179:9871003.
Stafford, H. A. 1991. Flavonoid evolution: an enzymic
approach. Plant Physiol. 96:680685.
Sun, H., Y. Li, S. Feng, W. Zou, K. Guo, C. Fan, et al. 2013.
Analysis of five rice 4-coumarate:coenzyme A ligase enzyme
activity and stress response for potential roles in lignin and
flavonoid biosynthesis in rice. Biochem. Biophys. Res.
Commun. 430:11511156.
Tamura, K., D. Peterson, N. Peterson, G. Stecher, M. Nei, and
S. Kumar. 2011. MEGA5: molecular evolutionary genetics
analysis using maximum likelihood, evolutionary distance,
and maximum parsimony methods. Mol. Biol. Evol.
28:27312739.
Uhlmann, A., and J. Ebel. 1993. Molecular cloning and
expression of 4-coumarate:coenzyme A ligase, an enzyme
involved in the resistance response of soybean (Glycine
max 1.) against pathogen attack. Plant Physiol. 102:1147
1156.
Weng, J. K., and C. Chapple. 2010. The origin and
evolution of lignin biosynthesis. New Phytol. 187:
273285.
Xu, B., L. L. Escamilla-Trevino, N. Sathitsuksanoh, Z. Shen, H.
Shen, Y. H. Zhang, et al. 2011. Silencing of 4-
coumarate:coenzyme A ligase in switchgrass leads to reduced
ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. 3419
H. Sun et al.Evolution of the 4CL Genes
lignin content and improved fermentable sugar yields for
biofuel production. New Phytol. 192:611625.
Yang, Z. 2007. PAML 4: phylogenetic analysis by maximum
likelihood. Mol. Biol. Evol. 24:15861591.
Zhang, C., J. Wang, W. Xie, G. Zhou, M. Long, and Q. Zhang.
2011. Dynamic programming procedure for searching
optimal models to estimate substitution rates based on the
maximum-likelihood method. Proc. Natl Acad. Sci. USA
108:78607865.
Supporting Information
Additional Supporting Information may be found in the
online version of this article:
Table S1. Members of plant 4-coumarate:coenzyme A
ligase (4CL) genes.
Table S2. Likelihood Ratio Statistics (2DlnL) for Testing
Method1 result.
3420 ª2015 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.
Evolution of the 4CL Genes H. Sun et al.
... In this study, we identi ed 12 genes (C4H, CCoAOMT, F3'H, UA3'5'GT, PAL, 4CL, CCR, CAD, CALDH, Bglx, SGTase and E1.11.17) that related ower color synthesis pathways. And these genes had been investigated and reported in previous studies [24][25][26][27][28][29][30][31][32][33]. C4H had been reported not only the key enzyme involved in the second step of avonoid synthesis, but also is the rst oxidoreductase of cytochrome P450 in phenylpropane biosynthesis pathway, which catalyzed a speci c hydroxylation reaction and generated coumaric acid that was a precursor of avonoids [24]. ...
... 4CL was a key enzyme that functions early in the phenylpropane pathway. And the proprotein 4CL converts 4-coumaric acid and other cinnamic acids to the corresponding CoA thiol esters, which can be used to many secondary metabolites, such as avonoids, iso avones, lignin, etc [32]. SGTase can be used as a multiple phenylpropanoid glucosylation enzyme, which exhibited signi cant activity with avonoids [35].β-glucosidase ...
Preprint
Full-text available
Background Phalaenopsis is an important ornamental plant, which occupies an important position in the world flower market and has great economic value due to its rich and diverse flower colors. In order to investigate the flower color formation of Phalaenopsis at transcription level, the flower color formation involved genes were identified from RNA-seq in this study. Results White and purple petals of Phalaenopsis were collected in this study, and results were focused on two aspects: (1) the differential expression genes (DEGs) between white and purple flower color; and (2) association between SNP mutations and DEGs in transcriptome level. Results indicated that a total of 1,175 DEGs were identified, and the up- and down-regulation genes were 718 and 457, respectively. Gene Ontology (GO) and pathway enrichment showed that the biosynthesis of secondary metabolites pathway was key responsible for color formation and twelve crucial genes (C4H, CCoAOMT, F3'H, UA3'5'GT, PAL, 4CL, CCR, CAD, CALDH, bglx, SGTase and E1.11.17) from them involved in the regulation of flower color in Phalaenopsis. Conclusion This study firstly reported that the SNP mutations strongly associated with DEGs in color formation at RNA level, and provides a new insight to further investigate the gene expression and its relationship with genetic variants from RNA-seq data in other species.
... 4-Coumarate-CoA ligase (4CL) is a key enzyme in phenylpropane metabolism, which catalyzes 4-coumaric acid, ferulic acid, and other acids to generate corresponding CoA thioesters, providing a substrate for lignin synthesis (Figure 1) Sun et al., 2015;Tang et al., 2019). At present, 4CL gene has been reported in abundant plant species, such as Arabidopsis thaliana (Ehlting et al., 1999), Boehmeria nivea (Tang et al., 2019), and Pinus massoniana (Van Huan et al., 2012). ...
Article
Full-text available
4-Coumarate-CoA ligase (4CL) gene plays vital roles in plant growth and development, especially the regulation of lignin metabolism and flavonoid synthesis. To investigate the potential function of 4CL in the lignin biosynthesis of Ginkgo biloba, this study identified two 4CL genes, Gb4CL1 and Gb4CL2, from G. biloba genome. Based on the phylogenetic tree analysis, Gb4CL1 and Gb4CL2 protein were classified into Class I, which has been confirmed to be involved in lignin biosynthesis. Therefore, it can be inferred that these two genes may also participate in lignin metabolism. The tissue-specific expression patterns of these two genes revealed that Gb4CL1 was highly expressed in microstrobilus, whereas Gb4CL2 was abundant in immature leaves. The onion transient expression assay indicated that Gb4CL1 was predominantly localized in the nucleus, indicating its potential involvement in nuclear functions, while Gb4CL2 was observed in the cell wall, suggesting its role in cell wall-related processes. Phytohormone response analysis revealed that the expression of both genes was upregulated in response to indole acetic acid, while methyl jasmonate suppressed it, gibberellin exhibited opposite effects on these genes. Furthermore, Gb4CL1 and Gb4CL2 expressed in all tissues containing lignin that showed a positive correlation with lignin content. Thus, these findings suggest that Gb4CL1 and Gb4CL2 are likely involved in lignin biosynthesis. Gb4CL1 and Gb4CL2 target proteins were successfully induced in Escherichia coli BL21 with molecular weights of 85.5 and 89.2 kDa, proving the integrity of target proteins. Our findings provided a basis for revealing that Gb4CL participated in lignin synthesis in G. biloba.
... For example, Arabidopsis 4CL1, 4CL2, 4CL4 belong to class I, related to lignin biosynthesis, and At4CL3 belongs to class II, responsible for flavonoid biosynthesis (Ehlting et al. 1999;Li et al. 2015). Rice 4CL2 belongs to class II, can be strongly activated by UV irradiation, and is potentially involved in flavonoid synthesis (Gui et al. 2011;Sun et al. 2015). In addition, there are some 4CL-like genes-derived amino-acid sequences that are highly homologous to "true" 4CLs bearing the same conserved motifs as 4CLs encoding adenylate-forming enzymes (Raes et al. 2003;Hamberger et al. 2007;Wei et al. 2013). ...
Article
Full-text available
Key message Overexpression and antisense expression of Sm4CL2 re-directed the biosynthesis of salvianolic acids and tanshinones in Salvia miltiorrhiza hairy roots. Abstract Danshen (Salvia miltiorrhiza Bunge) is a widely used traditional Chinese medicine and its main active ingredients are water-soluble phenolic acids and lipophilic diterpenoids which are produced through the phenylpropanoid pathway and terpenoid pathway, respectively. 4-Coumaric acid: Coenzyme A ligase (4CL) is a key enzyme in the phenylpropanoid metabolism. We had obtained Sm4CL2-overexpressing (Sm4CL2-OE) and antisense Sm4CL2-expressing (anti-Sm4CL2) danshen hairy roots over ten years ago. In the follow-up study, we found that total salvianolic acids in Sm4CL2-OE-4 hairy roots increased to 1.35 times of the control-3, and that in anti-Sm4CL2-1 hairy roots decreased to 37.32% of the control-3, but tanshinones in anti-Sm4CL2-1 was accumulated to 1.77 ± 0.16 mg/g of dry weight, compared to undetectable in Sm4CL2-OE-4 and the control-3 hairy roots. Interestingly, Sm4CL2-OE-4 hairy roots contained more lignin, 1.36 times of the control-3, and enhanced cell wall and xylem lignification. Transcriptomic analysis revealed that overexpression of Sm4CL2 caused the upregulation of other phenylpropanoid pathway genes and antisense Sm4CL2 expression resulted in the downregulation of other phenylpropanoid pathway genes but activated the expression of terpenoid pathway genes like SmCYP76AK5, SmGPPS.SSUII.1 and SmDXS2. Protein–protein interaction analysis suggested that Sm4CL2 might interact with PAL, PAL4, CSE, CCoAOMT and SmCYP84A60, and appeared to play a key role in the interaction network. The tracking work in this study proved that Sm4CL2 could redirect both salvianolic acids and tanshinones biosynthesis possibly through synergistically regulating other pathway genes. It also indicated that genetic modification of plant secondary metabolism with biosynthetic gene might cause other responses through protein–protein interactions.
... Moreover, we captured two hub genes in the light-yellow module, which shared 29 and 44 edges with other candidate genes and were highly correlated with flavonoids. For example, Cp4CL, which is an important branch point for directing metabolites to flavonoids (58), and CpHCT, which is associated with monolignol biosynthesis and phenylpropanoid metabolism (59,60). Interestingly, several TFs were significantly correlated with triterpenoid biosynthesis in the black module, including two MYB-related genes and CpWRKY1. ...
Article
Full-text available
Cyclocarya paliurus , a well-known nutrient and beverage plant, is under development for use in functional health care products best and natural and organic foods. We hypothesis that the composition and metabolic accumulation of hypoglycemic nutrient metabolites exhibit significant differences depending on harvest time. Therefore, it is of great significance to establish the best harvest time for C. paliurus leaves for the further development of healthy teas and other products. However, the detail compositions and molecular mechanisms of nutrients biosynthesis in C. paliurus leaves during different harvest stages remain largely unclear. Metabolome analysis showed that a suitable leaf-harvesting strategy for C. paliurus could be in September or October each year due to the high content of hypoglycemic nutrient metabolites. We found that two of the seven differentially accumulated phenolic acid metabolites have a relatively good inhibitory effect on α-amylase, indicating that they may play a role in the hypoglycemic function. Combined analysis of coexpression, ceRNA network, and weighted gene correlation network analysis (WGCNA) showed that several genes or transcription factors (TFs) in three modules correlated highly with hypoglycemic nutrient metabolites, including CpPMM, CpMan, CpFK, CpSUS, CpbglX, Cp4CL, CpHCT , and CpWRKY1 . These findings help in the understanding of the molecular mechanisms and regulatory networks of the hypoglycemic nutrient metabolites in C. paliurus leaves which are dependent on harvest time and provide theoretical guidance in the development of functional health care products and foods from C. paliurus .
... Due to the high connectivity of the hub genes, it was speculated that they played a key role in anthocyanins synthesis. According to the perspective of carbon flow, 4Cl, F3H, F3 H and F3 5 H were proposed to catalyze the synthesis of anthocyanins, flavonols and proanthocyanidins [41][42][43][44]; consequently, their high expression laid the foundation for the accumulation of total anthocyanin, which was considered to be the direct reason for the higher TAC in L. spicata fruits than that in O. japonicas fruits (unpublished). In particular, the expression patterns of 4CL, F3 H and F3 5 H were significantly correlated with eight major anthocyanins (R 2 > 0.7). ...
Article
Full-text available
Liriope spicata is an important ornamental ground cover plant, with a fruit color that turns from green to black during the development and ripening stages. However, the material basis and regulatory mechanism of the color variation remains unclear. In this study, a total of 31 anthocyanins and 2 flavonols were identified from the skin of L. spicata fruit via integrative analysis on the metabolome and transcriptome of three developmental stages. The pigments of black/mature fruits are composed of five common anthocyanin compounds, of which Peonidin 3–O–rutinoside and Delphinidin 3–O–glucoside are the most differential metabolites for color conversion. Using dual-omics joint analysis, the mechanism of color formation was obtained as follows. The expression of structural genes including 4CL, F3H, F3′H, F3′5′H and UFGT were activated due to the upregulation of transcription factor genes MYB and bHLH. As a result, a large amount of precursor substances for the synthesis of flavonoids accumulated. After glycosylation, stable pigments were generated which promoted the accumulation of anthocyanins and the formation of black skin.
... In addition to structural genes, many transcription factors (TFs, which act as modulators of gene expression at the transcriptional level through sequence-specific DNA binding or protein-protein interactions during chromatin remodeling) are required for flavonol and PA biosynthesis 27,28 . The key enzyme 4coumarate: CoA ligase (4CL) provides the precursors for lignin and flavonoid biosynthesis 29 . The first committed step in flavonoid biosynthesis, which is a major pathway of plant secondary metabolism, is catalyzed by chalcone synthase (CHS) 30 . ...
Article
Full-text available
Flavonols and proanthocyanidins (PAs) are the main pigments in the black spines of cucumber (Cucumis sativus) fruit, and CsMYB60 is a key regulator of the biosynthesis of flavonols and PAs. However, in cucumber, the tissue distribution pattern of flavonols and PAs and the mechanism of their biosynthesis regulated by CsMYB60 remain unclear. In this study, we clarified the tissue-specific distribution of flavonoids and the unique transcriptional regulation of flavonoid biosynthesis in cucumber. CsMYB60 activated CsFLS and CsLAR by binding to their promoters and directly or indirectly promoted the expression of CsbHLH42, CsMYC1, CsWD40, and CsTATA-box binding protein, resulting in the formation of complexes of these four proteins to increase the expression of Cs4CL and interact with CsTATA-box binding protein to regulate the expression of CsCHS, thereby regulating the biosynthesis of flavonols and PAs in cucumber. Our data provide new insights into the molecular mechanism of flavonoid biosynthesis, which will facilitate molecular breeding to improve fruit quality in cucumber.
... The 4CL genes are classified into two types: Type I and Type II. The amino acid sequences of Type I genes are more conserved and are mainly associated with lignin accumulation and Type II regulates flavonoid metabolism (Rao et al. 2014;Sun et al. 2015). Crystal structures of 4CL showed that 4CLs exhibit different isoform distribution patterns allowing recognition of specific substrates for the synthesis of different products (Gao et al. 2015;Li and Nair 2015b). ...
Article
Full-text available
Solanum habrochaites (SH), a closely wild related species of Alisa craig (AC), is an important germplasm resource for modern tomato breeding. Trichomes are developed from epidermal cells, producing a role in defense against insect attacking, and their secretions are of non-negligible value. Here, the glandular heads of type VI trichomes were clearly distinguished in AC and SH under cryo-scanning electron microscopy, such difference that allowed SH to secrete more anti-insect metabolites than AC. Pest preference experiments showed that aphids and mites preferred to feed near AC compared to SH. The integration analysis of transcriptomics and metabolomics data revealed that phenylpropanoid biosynthesis pathway was an important secondary metabolic pathway in plants, and SH secreted large amounts of phenylpropanoids and flavonoids than AC by up-regulating the expression of relevant genes in this pathway, which in turn resisted feeding by phytophagous insects. Notably, virus-induced silencing of the Sl4CLL6 not only decreased the expression of genes downstream of the phenylpropanoid biosynthesis pathway, SlHCT, SlCAD, and SlCHI, but also reduced resistance to mites in tomato. These findings provided new genetic resources for the synthesis of phenylpropanoid compounds and anti-insect breeding in Solanum habrochaites and new theoretical basis for the improvement of important traits in cultivated tomato.
Article
Background: Phalaenopsis is an important ornamental plant that has great economic value in the world flower market as one of the most popular flower resources. Objective: To investigate the flower colour formation of Phalaenopsis at the transcription level, the genes involved in flower color formation were identified from RNA-seq in this study. Methods: In this study, white and purple petals of Phalaenopsis were collected and analyzed to obtained (1) differential expression genes (DEGs) between white and purple flower color and (2) the association between single nucleotide polymorphisms (SNP) mutations and DEGs at the transcriptome level. Results: The results indicated that a total of 1,175 DEGs were identified, and 718 and 457 of them were up- and down-regulated genes, respectively. Gene Ontology and pathway enrichment showed that the biosynthesis of the secondary metabolites pathway was key to color formation, and the expression of 12 crucial genes (C4H, CCoAOMT, F3'H, UA3'5'GT, PAL, 4CL, CCR, CAD, CALDH, bglx, SGTase, and E1.11.17) that are involved in the regulation of flower color in Phalaenopsis. Conclusion: This study reported the association between the SNP mutations and DEGs for color formation at RNA level, and provides a new insight to further investigate the gene expression and its relationship with genetic variants from RNA-seq data in other species.
Article
Full-text available
4-Coumarate:CoA ligases (4CLs, EC 6.2.1.12) are a group of enzymes necessary for maintaining a continuous metabolic flux for the biosynthesis of plant phenylpropanoids, such as lignin and flavonoids, that are essential to the survival of plants. So far, various biochemical and molecular studies of plant 4CLs seem to suggest that 4CL isoforms in plants are functionally indistinguishable in mediating the biosynthesis of these phenolics. However, we have discovered two functionally and structurally distinct 4CL genes, Pt4CL1 and Pt4CL2 (63% protein sequence identity), that are differentially expressed in aspen (Populus tremuloides). The Escherichia coli-expressed and purified Pt4CL1 and Pt4CL2 proteins exhibited highly divergent substrate preference as well as specificity that reveal the association of Pt4CL1 with the biosynthesis of guaiacyl–syringyl lignin and the involvement of Pt4CL2 with other phenylpropanoid formation. Northern hybridization analysis demonstrated that Pt4CL1 mRNA is specifically expressed in lignifying xylem tissues and Pt4CL2 mRNA is specifically expressed in epidermal layers in the stem and the leaf, consistent with the promoter activities of Pt4CL1 and Pt4CL2 genes based on the heterologous promoter-β-glucouronidase fusion analysis. Thus, the expression of Pt4CL1 and Pt4CL2 genes is compartmentalized to regulate the differential formation of phenylpropanoids that confer different physiological functions in aspen; Pt4CL1 is devoted to lignin biosynthesis in developing xylem tissues, whereas Pt4CL2 is involved in the biosynthesis of other phenolics, such as flavonoids, in epidermal cells.
Article
Full-text available
4-Coumarate:coenzyme A ligase (4CL; EC 6.2.1.12) is a key enzyme in the phenylpropanoid metabolic pathways for monolignol and flavonoid biosynthesis. 4CL has been much studied in dicotyledons, but its function is not completely understood in monocotyledons, which display a different monolignol composition and phenylpropanoid profile. In this study, five members of the 4CL gene family in the rice (Oryza sativa) genome were cloned and analyzed. Biochemical characterization of the 4CL recombinant proteins revealed that the rice 4CL isoforms displayed different substrate specificities and catalytic turnover rates. Among them, Os4CL3 exhibited the highest turnover rate. No apparent tissue-specific expression of the five 4CLs was observed, but significant differences in their expression levels were detected. The rank in order of transcript abundance was Os4CL3 > Os4CL5 > Os4CL1 > Os4CL4 > Os4CL2. Suppression of Os4CL3 expression resulted in significant lignin reduction, shorter plant growth, and other morphological changes. The 4CL-suppressed transgenics also displayed decreased panicle fertility, which may be attributed to abnormal anther development as a result of disrupted lignin synthesis. This study demonstrates that the rice 4CLs exhibit different in vitro catalytic properties from those in dicots and that 4CL-mediated metabolism in vivo may play important roles in regulating a broad range of biological events over the course of rice growth and development.
Article
Full-text available
• The lignin content of feedstock has been proposed as one key agronomic trait impacting biofuel production from lignocellulosic biomass. 4-Coumarate:coenzyme A ligase (4CL) is one of the key enzymes involved in the monolignol biosynthethic pathway. • Two homologous 4CL genes, Pv4CL1 and Pv4CL2, were identified in switchgrass (Panicum virgatum) through phylogenetic analysis. Gene expression patterns and enzymatic activity assays suggested that Pv4CL1 is involved in monolignol biosynthesis. Stable transgenic plants were obtained with Pv4CL1 down-regulated. • RNA interference of Pv4CL1 reduced extractable 4CL activity by 80%, leading to a reduction in lignin content with decreased guaiacyl unit composition. Altered lignification patterns in the stems of RNAi transgenic plants were observed with phloroglucinol-HCl staining. The transgenic plants also had uncompromised biomass yields. After dilute acid pretreatment, the low lignin transgenic biomass had significantly increased cellulose hydrolysis (saccharification) efficiency. • The results demonstrate that Pv4CL1, but not Pv4CL2, is the key 4CL isozyme involved in lignin biosynthesis, and reducing lignin content in switchgrass biomass by silencing Pv4CL1 can remarkably increase the efficiency of fermentable sugar release for biofuel production.
Article
Full-text available
Comparative analysis of molecular sequence data is essential for reconstructing the evolutionary histories of species and inferring the nature and extent of selective forces shaping the evolution of genes and species. Here, we announce the release of Molecular Evolutionary Genetics Analysis version 5 (MEGA5), which is a user-friendly software for mining online databases, building sequence alignments and phylogenetic trees, and using methods of evolutionary bioinformatics in basic biology, biomedicine, and evolution. The newest addition in MEGA5 is a collection of maximum likelihood (ML) analyses for inferring evolutionary trees, selecting best-fit substitution models (nucleotide or amino acid), inferring ancestral states and sequences (along with probabilities), and estimating evolutionary rates site-by-site. In computer simulation analyses, ML tree inference algorithms in MEGA5 compared favorably with other software packages in terms of computational efficiency and the accuracy of the estimates of phylogenetic trees, substitution parameters, and rate variation among sites. The MEGA user interface has now been enhanced to be activity driven to make it easier for the use of both beginners and experienced scientists. This version of MEGA is intended for the Windows platform, and it has been configured for effective use on Mac OS X and Linux desktops. It is available free of charge from http://www.megasoftware.net.
Article
Full-text available
The substitution rate in a gene can provide valuable information for understanding its functionality and evolution. A widely used method to estimate substitution rates is the maximum-likelihood method implemented in the CODEML program in the PAML package. A limited number of branch models, chosen based on a priori information or an interest in a particular lineage(s), are tested, whereas a large number of potential models are neglected. A complementary approach is also needed to test all or a large number of possible models to search for the globally optional model(s) of maximum likelihood. However, the computational time for this search even in a small number of sequences becomes impractically long. Thus, it is desirable to explore the most probable spaces to search for the optimal models. Using dynamic programming techniques, we developed a simple computational method for searching the most probable optimal branch-specific models in a practically feasible computational time. We propose three search methods to find the optimal models, which explored O(n) (method 1) to O(n(2)) (method 2 and method 3) models when the given phylogeny has n branches. In addition, we derived a formula to calculate the number of all possible models, revealing the complexity of finding the optimal branch-specific model. We show that in a reanalysis of over 50 previously published studies, the vast majority obtained better models with significantly higher likelihoods than the conventional hypothesis model methods.
Article
We discuss on the relative significance of different functional roles potentially served by flavonoids in photoprotection, with special emphasis to their ability to scavenge reactive oxygen species (ROS) and control the development of individual organs and whole plant. We propose a model in which chloroplast-located flavonoids scavenge H2O2 and singlet oxygen generated under excess light-stress, thus avoiding programmed cell death. We also draw a picture in which vacuolar flavonoids in conjunction with peroxidases and ascorbic acid constitute a secondary antioxidant system aimed at detoxifying H2O2, which may diffuse out of the chloroplast at considerable rates and enter the vacuole following excess light stress-induced depletion of ascorbate peroxidase. We hypothesize for flavonols key roles as developmental regulators in early and current-day land-plants, based on their ability to modulate auxin movement and auxin catabolism. We show that antioxidant flavonoids display the greatest capacity to regulate key steps of cell growth and differentiation in eukaryotes. These regulatory functions of flavonoids, which are shared by plants and animals, are fully accomplished in the nM concentration range, as likely occurred in early land plants. We therefore conclude that functions of flavonoids as antioxidants and/or developmental regulators flavonoids are of great value in photoprotection. We also suggest that UV-B screening was just one of the multiple functions served by flavonoids when early land-plants faced an abrupt increase in sunlight irradiance.
Article
Successful modification of plant cell-wall composition without compromising plant integrity is dependent on being able to modify the expression of specific genes, but this can be very challenging when the target genes are members of multigene families. 4-coumarate:CoA ligase (4CL) catalyzes the formation of 4-coumaroyl CoA, a precursor of both flavonoids and monolignols, and is an attractive target for transgenic down-regulation aimed at improving agro-industrial properties. Inconsistent phenotypes of transgenic plants have been attributed to variable levels of down-regulation of multiple 4CL genes. Phylogenetic analysis of the sorghum genome revealed 24 4CL(-like) proteins, five of which cluster with bona fide 4CLs from other species. Using a map-based cloning approach and analysis of two independent mutant alleles, the sorghum brown midrib2 (bmr2) locus was shown to encode 4CL. In vitro enzyme assays indicated that its preferred substrate is 4-coumarate. Missense mutations in the two bmr2 alleles result in loss of 4CL activity, probably as a result of improper folding as indicated by molecular modeling. Bmr2 is the most highly expressed 4CL in sorghum stems, leaves and roots, both at the seedling stage and in pre-flowering plants, but the products of several paralogs also display 4CL activity and compensate for some of the lost activity. The contribution of the paralogs varies between developmental stages and tissues. Gene expression assays indicated that Bmr2 is under auto-regulatory control, as reduced 4CL activity results in over-expression of the defective gene. Several 4CL paralogs are also up-regulated in response to the mutation.
Article
The plant enzyme 4-coumarate:coenzyme A ligase (4CL) is part of a family of adenylate-forming enzymes present in all organisms. Analysis of genome sequences shows the presence of '4CL-like' enzymes in plants and other organisms, but their evolutionary relationships and functions remain largely unknown. 4CL and 4CL-like genes were identified by BLAST searches in Arabidopsis, Populus, rice, Physcomitrella, Chlamydomonas and microbial genomes. Evolutionary relationships were inferred by phylogenetic analysis of aligned amino acid sequences. Expression patterns of a conserved set of Arabidopsis and poplar 4CL-like acyl-CoA synthetase (ACS) genes were assayed. The conserved ACS genes form a land plant-specific class. Angiosperm ACS genes grouped into five clades, each of which contained representatives in three fully sequenced genomes. Expression analysis revealed conserved developmental and stress-induced expression patterns of Arabidopsis and poplar genes in some clades. Evolution of plant ACS enzymes occurred early in land plants. Differential gene expansion of angiosperm ACS clades has occurred in some lineages. Evolutionary and gene expression data, combined with in vitro and limited in vivo protein function data, suggest that angiosperm ACS enzymes play conserved roles in octadecanoid and fatty acid metabolism, and play roles in organ development, for example in anthers.