ArticlePDF Available

Cotton genome: challenge into the polyploidy

Authors:
  • Hainan Yazhou Bay Seed Laboratory
Commentary
Cotton genome: challenge into the polyploidy
Zhi-Wen Chen
a
, Jun-Feng Cao
a
, Xiu-Fang Zhang
a
, Xiao-Xia Shangguan
a
, Ying-Bo Mao
a
, Ling-Jian Wang
a
,
Xiao-Ya Chen
a,b,
a
National Key Laboratory of Plant Molecular Genetics and National Center for Plant Gene Research, Institute of Plant Physiology and Ecology, Shanghai Institutes for
Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China
b
Plant Science Research Center, Shanghai Chenshan Botanical Garden, Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai 201602, China
Cottons are the most important fiber crops in the world. The cot-
ton genus Gossypium has 52 species, including seven allotetraploid
species and 45 diploids. Four species were domesticated and remain
as crops under cultivation today: the New World allopolyploid spe-
cies G. hirsutum and G. barbadense (2n= 52), and the Old World
diploid species G. arboreum and G. herbaceum (2n= 26). The primary
cultivated species is Upland cotton (G. hirsutum L.), which accounts
for more than 90% of global cotton fiber production.
With the rapid development of genome sequencing technolo-
gies, cotton research has advanced greatly in recent several years,
such that nuclear genome sequences have now been published for
the model diploids (D
5
-genome [1,2],A
2
-genome [3]), and for the
allopolyploids (AD
1
-G. hirsutum [4,5],AD
2
-G. barbadense [6]). In
fact, the majority of crops have at least one genome assembly in
the public database, generating a myriad of new data from plant
genomes which are not only incubating new insights into plant
and agricultural sciences, such as plant evolution and domestica-
tion, but also revolutionizing breeding programs.
Firstly, cotton genome sequences contribute to elucidation of
the mechanism of fiber development. Cotton fibers are unusually
long, single-celled epidermal seed trichomes and a model for
plant cell growth. Despite being the most important biological
event, our understanding of the regulation of fiber cell initiation
and elongation remain limited. The tetraploid G. hirsutum genome
harbors 28 members of the HD-ZIP IV subfamily. Genetic evidence
showed that, among these GL2-type homeodomain-containing
transcription factors, GhHOX3 plays a pivotal role in controlling
fiber elongation, and its activity is regulated by the phytohormone
gibberellin [7].
With the facility of genome sequences, some studies tried to
reveal the mechanism underlying fiber development. For example,
phytosterol content as well as the campesterol:sitosterol ratio
influence cotton fiber development [8], and cotton GhMYB7 is pre-
dominantly expressed in developing fiber cells and regulates sec-
ondary cell wall biosynthesis in transgenic Arabidopsis [9].
Furthermore, variations of transposable elements (TEs) may have
played a role in cotton fiber cell development [10], apart from
being responsible for the divergence of genome sizes between
cotton species.
Secondly, these cotton genome references are available to the
genome-wide analysis and expression profiling of the specific gene
family, such as phospholipase D gene family in G. arboreum [11],
NAC transcription factors gene family in diploid Gossypium [12]
and a mass of secondary metabolites such as gossypol, the cotton
phytoalexins against pathogens and herbivores [6,13].
Development of new genomic platforms and release of diploid
and tetraploid cotton genome sequences have greatly facilitated
the GWAS studies to hunt for candidate genes of important traits
(lint yield and fiber quality traits). However, understanding the
molecular mechanisms of fiber development is often hindered by
the complexity of polyploidy. With the coming of the genomics
era, polyploidy is attracting great attention from plant scientists,
especially those of the crop field.
Following whole genome duplication, duplicated genes gener-
ally have several possible fates: both copies may be retained for
dosage balance, one copy is retained and the other copy has
been lost, silenced, or diverged to gain new function (neofunc-
tionalization or subfunctionalization) [14]. Thus one of the
important features to emerge from polyploid is massive alter-
ations in gene expression and function. In every allopolyploid
species examined to date, some fractions of the duplicate gene
pairs (homoeologs) are expressed unequally, as proved in the
allopolyploid cotton genome with the features of asymmetrical
evolution [5]. This suite of unequally expressed genes may vary
among species, tissues, and even cells, and thus is a fundamental
feature of allopolyploids. The genomic and transcriptomic data,
in combination with modern technologies such as gene editing
[15], shall reshape future breeding approaches for supper cotton
cultivars.
Conflict of interest
The authors declare that they have no conflict of interest.
https://doi.org/10.1016/j.scib.2017.11.022
2095-9273/Ó2017 Science China Press. Published by Elsevier B.V. and Science China Press. All rights reserved.
Corresponding author.
E-mail addresses: chenzw@sippe.ac.cn (Z.-W. Chen), caojunfeng@sibs.ac.cn
(J.-F. Cao), xfzhang1990@163.com (X.-F. Zhang), sgxx@sibs.ac.cn (X.-X. Shangguan),
ybmao@sibs.ac.cn (Y.-B. Mao), ljwang@sibs.ac.cn (L.-J. Wang), xychen@sibs.ac.cn
(X.-Y. Chen).
Science Bulletin 62 (2017) 1622–1623
Contents lists available at ScienceDirect
Science Bulletin
journal homepage: www.elsevier.com/locate/scib
Acknowledgments
The research was supported by Chinese Academy of Sciences
(XDB11030000), Ministry of Science and Technology of China
and Ministry of Agriculture of China (2013CB127000,
2016YFA0500800, 2016ZX08009001-009, 2016ZX08005001-001),
National Natural Science Foundation of China (31690092).
References
[1] Wang KB, Wang ZW, Li FG, et al. The draft genome of a diploid cotton
Gossypium raimondii. Nat Genet 2012;44:1098–103.
[2] Paterson AH, Wendel JF, Gundlach H, et al. Repeated polyploidization of
Gossypium genomes and the evolution of spinnable cotton fibres. Nature
2012;492:423–7.
[3] Li FG, Fan GY, Wang KB, et al. Genome sequence of the cultivated cotton
Gossypium arboreum. Nat Genet 2014;46:567–72.
[4] Li FG, Fan GY, Lu CR, et al. Genome sequence of cultivated upland cotton
(Gossypium hirsutum TM-1) provides insights into genome evolution. Nat
Biotechnol 2015;33:524–30.
[5] Zhang TZ, Hu Y, Jiang WK, et al. Sequencing of allotetraploid cotton (Gossypium
hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat
Biotechnol 2015;33:531–7.
[6] Liu X, Zhao B, Zheng HJ, et al. Gossypium barbadense genome sequence provides
insight into the evolution of extra-long staple fiber and specialized
metabolites. Sci Rep-UK 2015;5:14139.
[7] Shan CM, Shangguan XX, Zhao B, et al. Control of cotton fibre elongation by a
homeodomain transcription factor GhHOX3. Nat Commun 2014;5:5519.
[8] Deng SS, Wei T, Tan KL, et al. Phytosterol content and the campesterol:
sitosterol ratio influence cotton fiber development: role of phytosterols in cell
elongation. Sci China, Ser C Life Sci 2016;59:183–93.
[9] Huang JF, Chen F, Wu SY, et al. Cotton GhMYB7 is predominantly expressed in
developing fibers and regulates secondary cell wall biosynthesis in transgenic
Arabidopsis. Sci China, Ser C Life Sci 2016;59:194–205.
[10] Wang K, Huang G, Zhu YX. Transposable elements play an important role
during cotton genome evolution and fiber cell development. Sci China, Ser C
Life Sci 2016;59:112–21.
[11] Tang K, Dong CJ, Liu JY. Genome-wide analysis and expression profiling of the
phospholipase D gene family in Gossypium arboreum. Sci China, Ser C Life Sci
2016;59:130–41.
[12] Shang HH, Wang ZN, Zou CS, et al. Comprehensive analysis of NAC
transcription factors in diploid Gossypium: sequence conservation and
expression analysis uncover their roles during fiber development. Sci China,
Ser C Life Sci 2016;59:142–53.
[13] Tian X, Ruan JX, Huang JQ, et al. Gossypol: phytoalexin of cotton. Sci China, Ser
C Life Sci 2016;59:122–9.
[14] Wendel JF. The wondrous cycles of polyploidy in plants. Am J Bot
2015;102:1753–6.
[15] Liu WS, Zhu XH, Lei MG, et al. A detailed procedure for CRISPR/Cas9-mediated
gene editing in Arabidopsis thaliana. Sci Bull 2015;60:1332–47.
Z.-W. Chen et al. / Science Bulletin 62 (2017) 1622–1623 1623
... For the emergence of homologous genes in a single gene family after duplication, homologous genes generally have different fates: both genes retained for dosage effect or diverged to produce new function [58,59]. In METTL3 mutants, the translation of mRNAs containing m 6 A modifications in the 5'UTR was reduced, indicating translation efficiency was influenced by 5'UTR m 6 A in cells [60,61]. ...
Article
Full-text available
N6-methyladenosine (m 6 A) RNA modification plays important regulatory roles in plant development and adapting to the environment, which requires methyltransferases to achieve the methylation process. However, there has been no research regarding m 6 A RNA methyltransferases in cotton. Here, a systematic analysis of the m 6 A methyltransferase (METTL) gene family was performed on twelve cotton species, resulting in six METTLs identified in five allotetraploid cottons, respectively, and three to four METTLs in the seven diploid species. Phylogenetic analysis of protein-coding sequences revealed that METTL genes from cottons, Arabidopsis thaliana, and Homo sapiens could be classified into three clades (METTL3, METTL14, and METTL-like clades). Cis-element analysis predicated the possible functions of METTL genes in G. hirsutum. RNA-seq data revealed that GhMETTL14 (GH_A07G0817/GH_D07G0819) and GhMETTL3 (GH_A12G2586/GH_D12G2605) had high expressions in root, stem, leaf, torus, petal, stamen, pistil, and calycle tissues. GhMETTL14 also had the highest expression in 20 and 25 dpa fiber cells, implying a potential role at the cell wall thickening stage. Suppressing GhMETTL3 and GhMETTL14 by VIGS caused growth arrest and even death in G. hirsutum, along with decreased m 6 A abundance from the leaf tissues of VIGS plants. Overexpression of GhMETTL3 and GhMETTL14 produced distinct differentially expressed genes (DEGs) in A. thaliana, indicating their possible divergent functions after gene duplication. Overall, GhMETTLs play indispensable but divergent roles during the growth of cotton plants, which provides the basis for the systematic investigation of m 6 A in subsequent studies to improve the agronomic traits in cotton.
... The GRF genes have been investigated in a number of main crops, including Zea mays (Zhang et al., 2008), Brassica napus (Liu et al., 2012), Brassica rapa , Oryza sativa (Choi et al., 2004;Kuijt et al., 2014;Luo et al., 2005), Gossypium hirsutum (Cao et al., 2020a;Chen et al., 2017), Brachypodium distachyon (Filiz et al., 2014) and Solanum lycopersicum (Khatun et al., 2017). We identified 8 GRF genes in the sorghum genome and illustrated that the gene family was conserved among different plant species. ...
Article
Sorghum (Sorghum bicolor) is the fifth important cereal and an industrial energy crop in the world. Growth Regulation Factors (GRFs) play an important role in response to environmental stress, however, the knowledge of GRFs relating to the pest resistance is lacking. Here, we identified 8 GRF genes harboring the typical QLQ (glutamine, leucine, glutamine) and WRC (tryptophan, arginine, cysteine) domains in Sorghum, which could be classified into 4 clades through phylogenetic analysis. The SbGRF genes express in most tissues, while more than half of them express at the highest level in inflorescence. To further investigate their possible role in stress response, we analyzed the transcriptomics data. The results showed that SbGRFs could respond to the abiotic stresses including heat, salt and drought stress. Furthermore, combined the data with qRT-PCR, SbGRF1, 2, 4 and 7 were identified as dominant genes response to the aphid-induced stress. SSR markers close to these genes were also searched. Above all, we summarized the SbGRFs and provided their potential roles in aphid response.
... The cotton genus Gossypium) contains more than 50 species, of which the cultivated species are the most important fiber crops in the world [29][30][31]. Recent advances in cotton genomics have produced the resources necessary to analyze gene families in Gossypium. ...
Article
Full-text available
Background The cryptochromes (CRY) are specific blue light receptors of plants and animals, which play crucial roles in physiological processes of plant growth, development, and stress tolerance. Results In the present work, a systematic analysis of the CRY gene family was performed on twelve cotton species, resulting in 18, 17, 17, 17, and 17 CRYs identified in five alloteraploid cottons (Gossypium hirsutum, G. barbadense, G. tomentosum, G. mustelinum and G. darwinii), respectively, and five to nine CRY genes in the seven diploid species. Phylogenetic analysis of protein-coding sequences revealed that CRY genes from cottons and Arabidopsis thaliana could be classified into seven clades. Synteny analysis suggested that the homoeolog of G. hirsutum Gh_A02G0384 has undergone an evolutionary loss event in the other four allotetraploid cotton species. Cis-element analysis predicated the possible functions of CRY genes in G. hirsutum. RNA-seq data revealed that Gh_D09G2225, Gh_A09G2012 and Gh_A11G1040 had high expressions in fiber cells of different developmental states. In addition, the expression levels of one (Gh_A03G0120), 15 and nine GhCRY genes were down-regulated following the PEG, NaCl and high-temperature treatments, respectively. For the low-temperature treatment, five GhCRY genes were induced, and five were repressed. These results indicated that most GhCRY genes negatively regulate the abiotic stress treatments. Conclusion We report the structures, domains, divergence, synteny, and cis-elements analyses systematically of G. hirsutum CRY genes. Possible biological functions of GhCRY genes in differential tissues as well as in response to abiotic stress during the cotton plant life cycle were predicted.
... Upland cotton (Gossypium hirsutum L.), an allopolyploid species, is one of the most economic-important crops, since it provides overwhelming production of cotton fiber in the world (Chen et al. 2017). Two-version upland cotton genomes have been sequentially released in 2015 (Li et al. 2015) and 2019 (Hu et al. 2019). ...
Article
Full-text available
Vacuolar processing enzymes (VPEs) play important roles in plant development, programmed cell death, and the responsiveness to biotic and abiotic stresses. To characterize the VPEs in upland cotton (Gossypium hirsutum), the VPE gene family within four Gossypium species, consisting of G. hirsutum, G. barbadense, G. arboreum, and G. raimondii, together with Arabidopsis thaliana, was comparatively analyzed at the genome-wide level. As a result, a total of 43 VPEs were identified, including 13 GhVPEs, 12 GbVPEs, 7 GaVPEs, and 7 GrVPEs, which are evenly distributed with one gene on a chromosome from four Gossypium species, respectively. The phylogenetic tree showed that the identified VPEs within the four Gossypium species could be categorized into β-type, δ-type, and γ-type VPE clades. Collinearity analysis presented 36 of intraspecies VPE-pairs and 152 of interspecies VPE-pairs, respectively, which are included in synteny blocks on chromosome. These results indicate that VPE duplication events have accorded well with the whole genome duplication. And expression profiles of GhVPEs in G. hirsutum seedlings demonstrated that the GhVPEs from the same clade are not necessarily identical in the pattern of transcriptional expression. Upon abiotic stresses (i.e., waterlogging and salt treatments), three GhVPEs (i.e., Ghir_A05G004610, Ghir_A09G011870, and Ghir_D09G011410) were significantly upregulated in their expression amounts, respectively. The GhVPE genes that presented inducible expression under some abiotic stresses may be applied to the improvement of resilience to abiotic stresses for the cultivated cottons.
... For instance, unequal expressions were observed in A and D-subgenome genes, including GhARF3c, GhARF16c, GhARF18a and GhARF20. These massive alterations in gene expression can cause distinct function and may just be one of the important features emerging [8,51]. During the evolution of allopolyploid, some duplicate gene pairs (homoeologs) are expressed unequally, as also proved in the allopolyploid cotton genome with the features of asymmetrical evolution [42]. ...
Article
Full-text available
Background Cotton fiber is a model system for studying plant cell development. At present, the functions of many transcription factors in cotton fiber development have been elucidated, however, the roles of auxin response factor (ARF) genes in cotton fiber development need be further explored. Results Here, we identify auxin response factor (ARF) genes in three cotton species: the tetraploid upland cotton G. hirsutum , which has 73 ARF genes, and its putative extent parental diploids G. arboreum and G. raimondii , which have 36 and 35 ARFs, respectively. Ka and Ks analyses revealed that in G. hirsutum ARF genes have undergone asymmetric evolution in the two subgenomes. The cotton ARFs can be classified into four phylogenetic clades and are actively expressed in young tissues. We demonstrate that GhARF2b , a homolog of the Arabidopsis AtARF2 , was preferentially expressed in developing ovules and fibers. Overexpression of GhARF2b by a fiber specific promoter inhibited fiber cell elongation but promoted initiation and, conversely, its downregulation by RNAi resulted in fewer but longer fiber. We show that GhARF2b directly interacts with GhHOX3 and represses the transcriptional activity of GhHOX3 on target genes. Conclusion Our results uncover an important role of the ARF factor in modulating cotton fiber development at the early stage.
... The genus Gossypium encompasses more than 52 species, but only 4 have been domesticated over a period of 1-2 million years of polyploidization (Wendel 1989). The widely cultivated New World (NW) allotetraploid (2n = 52), G. hirsutum and G. barbadense, consist of a pair of A and D sub-genomes; whereas, the Old World (OW) G. arboreum and G. herbaceum have diploid (2n = 26) A or D genomes (Chen et al. 2015). The primary source of more than 90% of fiber output is the most widely cultivated G. hirsutum. ...
Chapter
Full-text available
Cotton has a tremendous economic value worldwide due to its high-quality fiber, edible oil and protein contents. However, the intensifying scenario of human population expansion and global environmental changes demand a proportionate increase in cotton production. In the past, several successful attempts have been made by introgression of many quality- and yield-related traits into elite cotton cultivars through conventional breeding. However, those measures are time consuming due to the reliance on introgression of naturally-existing genetic variation through extensive backcrossing. Nonetheless, plant breeding can be accelerated through modern genome editing (GE) tools. Various GE techniques including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and clustered regularly interspaced palindromic repeats and CRISPR-associated proteins systems (CRISPR/Cas)-based approaches have been successfully employed for various crop plants. Among them, CRISPR/Cas-based approaches hold great GE potential due to their simplicity, competency and versatility. In cotton, this system can regulate the gene expression associated with quality traits, to circumscribe phytopathogens and/or to stack molecular traits at a desired locus. In gene stacking through site-specific endonucleases, the desired genes can be introduced in close proximity to a specific locus in the cotton genome with a low risk of segregation. However, such executions are tedious to achieve through classical breeding techniques. Moreover, through the CRISPR/Cas-based approaches, transgene-free cotton plants can easily be produced by selfing or backcrossing to meet the current genetically modified organisms (GMO) guidelines. In this chapter, we address the potential application of CRISPR/Cas-based approaches in available whole cotton genomes to sustain cotton productivity, and achieve genetic improvement, pathogen resistance and agronomic traits. Future prospects of GE applications in cotton breeding are also addressed.
... Cotton is the most important fiber crop in the world and comprises 52 Gossypium species including 7 allotetraploid species (amphidiploids [AD] genome [2n = 52]) and 45 diploid species (2n = 26). 1,2 The allopolyploid species, Gossypium hirsutum L (AD1 genome) and Gossypium barbadense (AD2 genome), have been domesticated and G hirsutum cultivars alone account for more than 90% of global cotton fiber production. [3][4][5] However, the high genetic similarity among G hirsutum accessions has hindered opportunities for breeding new cotton cultivars with improved agricultural traits such as higher yield, ease of harvest, and stronger resistance to pest, diseases, and environmental stresses. ...
Article
Full-text available
Determining the genetic rearrangement and domestication footprints in Gossypium hirsutum cultivars and primitive race genotypes are essential for effective gene conservation efforts and the development of advanced breeding molecular markers for marker-assisted breeding. In this study, 94 accessions representing the 7 primitive races of G hirsutum, along with 9 G hirsutum and 12 Gossypium barbadense cultivated accessions were evaluated. The genotyping-by-sequencing (GBS) approach was employed and 146 558 single nucleotide polymorphisms (SNP) were generated. Distinct SNP signatures were identified through the combination of selection scans and association analyses. Phylogenetic analyses were also conducted, and we concluded that the Latifolium, Richmondi, and Marie-Galante race accessions were more genetically related to the G hirsutum cultivars and tend to cluster together. Fifty-four outlier SNP loci were identified by selection-scan analysis, and 3 SNPs were located in genes related to the processes of plant responding to stress conditions and confirmed through further genome-wide signals of marker-phenotype association analysis, which indicate a clear selection signature for such trait. These results identified useful candidate gene locus for cotton breeding programs.
Article
Plant trichomes are specialized epidermal cells that protect plants from insects and pathogens. In Arabidopsis, epidermal hairs decrease as internodes increase in height, with only few epidermal hairs produced on the sepals abaxial surface of the early flowers. TRIPTYCHON (TRY) is known to be a negative regulator of epidermal hair development in Arabidopsis, suppressing the formation of ectopic epidermal hairs in the inflorescence. Here, we reported that the second intron of TRY gene plays a critical role in trichome spatial distribution in Arabidopsis. The expression of TRY rises with the increasing stem nodes and reaches the peak in the inflorescence, while the trichomes distribution decrease. The transgenic plants showed that TRY promoter could only drive the genomic instead of coding sequences combined with GUS reporter gene, which indicates that the regulatory elements of TRY expression in inflorescence could be located in the intron regions. Multiple SPLs and MADS-box binding sites were found in the TRY intron2 sequence. Further genetic and biochemistry assays revealed that the flowering-related genes such as SPL9 could bind to these cis-elements directly, contributing to the TRY spatial expression. Since cotton fiber and Arabidopsis trichomes share similar regulatory mechanism, extended analysis showed that the intron2 of cotton TRY genes also contain the cis-elements. Thus, the introns harboring the transcription element may be the general way to regulate the gene expression in different plants and provides molecular clues for the related crops’ traits design.
Article
Cotton encounters long-term drought stress problems resulting in major yield losses. Transcription factors (TFs) plays an important role in response to biotic and abiotic stresses. The coexpression patterns of gene networks associated with drought stress tolerance were investigated using transcriptome profiles. Applying a weighted gene co-expression network analysis, we discovered a salmon module with 144 genes strongly linked to drought stress tolerance. Based on coexpression and RT-qPCR analysis GH_D01G0514 was selected as the candidate gene because of its consistent expression in response to drought stress in both leaves and roots. Moreover, coexpression network analysis also identifies it as a hub gene in both tissues. For validation of GH_D01G0514, Virus Induced Gene Silencing was performed and VIGS plants showed significantly higher excised leaf water loss and ion leakage, while lower relative water and chlorophyll contents as compared to WT (Wild type) and positive control plants. Furthermore, the WT and positive control seedlings showed higher in CAT and SOD, and low in Hydrogen peroxide and MDA enzymes than the VIGS plants. Gh_D01G0514 (GhNAC072) was localized in the nucleus and cytoplasm. Y2H assay demonstrates that Gh_D01G0514 has a potential of auto activation. It was observed that the Gh_D01G0514 was highly upregulated in both tissues based on RNA Seq and RT-qPCR analysis. Thus, we inferred that, the candidate gene might be responsible for drought stress tolerance in cotton. This finding adds significantly to the existing knowledge of drought stress tolerance in cotton and deep molecular analysis required to understand the molecular mechanisms underlying drought stress tolerance in cotton.
Article
Full-text available
Of the two cultivated species of allopolyploid cotton, Gossypium barbadense produces extra-long fibers for the production of superior textiles. We sequenced its genome (AD) 2 and performed a comparative analysis. We identified three bursts of retrotransposons from 20 million years ago (Mya) and a genome-wide uneven pseudogenization peak at 11-20 Mya, which likely contributed to genomic divergences. Among the 2,483 genes preferentially expressed in fiber, a cell elongation regulator, PRE1, is strikingly At biased and fiber specific, echoing the A-genome origin of spinnable fiber. The expansion of the PRE members implies a genetic factor that underlies fiber elongation. Mature cotton fiber consists of nearly pure cellulose. G. barbadense and G. hirsutum contain 29 and 30 cellulose synthase (CesA) genes, respectively; whereas most of these genes (>25) are expressed in fiber, genes for secondary cell wall biosynthesis exhibited a delayed and higher degree of up-regulation in G. barbadense compared with G. hirsutum, conferring an extended elongation stage and highly active secondary wall deposition during extra-long fiber development. The rapid diversification of sesquiterpene synthase genes in the gossypol pathway exemplifies the chemical diversity of lineage-specific secondary metabolites. The G. barbadense genome advances our understanding of allopolyploidy, which will help improve cotton fiber quality.
Article
Full-text available
The newly developed CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system has emerged as an efficient tool for genome-editing, providing an alternative to classical mutagenesis and transgenic methods to study gene function and improve crop traits. CRISPR/Cas facilitates targeted gene editing through RNA-guided DNA cleavage followed by cellular DNA repair mechanisms that introduce sequence changes at the site of cleavage. Here we describe a detailed procedure for our previously developed and highly efficient CRISPR/Cas9 method that allows the generation of heritable-targeted gene mutations and corrections in Arabidopsis. This protocol describes the strategies and steps for the selection of targets, design of single-guide RNA (sgRNA), vector construction and analysis of transgenic lines. We also offer a method to target two loci simultaneously using vectors containing two different sgRNAs. The principles described in this protocol can be applied to other plant species to generate stably inherited DNA modifications. © 2015, Science China Press and Springer-Verlag Berlin Heidelberg.
Article
Full-text available
Gossypium hirsutum has proven difficult to sequence owing to its complex allotetraploid (AtDt) genome. Here we produce a draft genome using 181-fold paired-end sequences assisted by fivefold BAC-to-BAC sequences and a high-resolution genetic map. In our assembly 88.5% of the 2,173-Mb scaffolds, which cover 89.6%∼96.7% of the AtDt genome, are anchored and oriented to 26 pseudochromosomes. Comparison of this G. hirsutum AtDt genome with the already sequenced diploid Gossypium arboreum (AA) and Gossypium raimondii (DD) genomes revealed conserved gene order. Repeated sequences account for 67.2% of the AtDt genome, and transposable elements (TEs) originating from Dt seem more active than from At. Reduction in the AtDt genome size occurred after allopolyploidization. The A or At genome may have undergone positive selection for fiber traits. Concerted evolution of different regulatory mechanisms for Cellulose synthase (CesA) and 1-Aminocyclopropane-1-carboxylic acid oxidase1 and 3 (ACO1,3) may be important for enhanced fiber production in G. hirsutum.
Article
The secondary cell wall in mature cotton fibers contains over 90% cellulose with low quantities of xylan and lignin. However, little is known regarding the regulation of secondary cell wall biosynthesis in cotton fibers. In this study, we characterized an R2R3-MYB transcription factor, GhMYB7, in cotton. GhMYB7 is expressed at a high level in developing fibers and encodes a MYB protein that is targeted to the cell nucleus and has transcriptional activation activity. Ectopic expression of GhMYB7 in Arabidopsis resulted in small, curled, dark green leaves and also led to shorter inflorescence stems. A cross-sectional assay of basal stems revealed that cell wall thickness of vessels and interfascicular fibers was higher in transgenic lines overexpressing GhMYB7 than in the wild type. Constitutive expression of GhMYB7 in Arabidopsis activated the expression of a suite of secondary cell wall biosynthesis-related genes (including some secondary cell wall-associated transcription factors), leading to the ectopic deposition of cellulose and lignin. The ectopic deposition of secondary cell walls may have been initiated before the cessation of cell expansion. Moreover, GhMYB7 was capable of binding to the promoter regions of AtSND1 and AtCesA4, suggesting that GhMYB7 may function upstream of NAC transcription factors. Collectively, these findings suggest that GhMYB7 is a potential transcriptional activator, which may participate in regulating secondary cell wall biosynthesis of cotton fibers.
Article
Sesquiterpenoids are a class of 15-carbon secondary metabolites that play diverse roles in plant adaptation to environment. Cotton plants accumulate a large amount of sesquiterpene aldehydes (including gossypol) as phytoalexins against pathogens and herbivores. They are stored in pigment glands of aerial organs and in epidermal layers of roots. Several enzymes of gossypol biosynthesis pathway have been characterized, including 3-hydroxy-3-methylglutaryl coenzyme A reductase (HMGR) and farnesyl diphosphate synthase (FPS) that catalyze the formation of the precursor farnesyl diphosphate (FPP), (+)-δ-cadinene synthase (CDN) which is the first enzyme committed to gossypol biosynthesis, and the downstream enzymes of CYP706B1 and methyltransferase. Expressions of these genes are tightly regulated during cotton plants development and induced by jasmonate and fungi elicitors. The transcription factor GaWRKY1 has been shown to be involved in gossypol pathway regulation. Recent development of new genomic platforms and methods and releases of diploid and tetraploid cotton genome sequences will greatly facilitate the elucidation of gossypol biosynthetic pathway and its regulation.
Article
Phytosterols play an important role in plant growth and development, including cell division, cell elongation, embryogenesis, cellulose biosynthesis, and cell wall formation. Cotton fiber, which undergoes synchronous cell elongation and a large amount of cellulose synthesis, is an ideal model for the study of plant cell elongation and cell wall biogenesis. The role of phytosterols in fiber growth was investigated by treating the fibers with tridemorph, a sterol biosynthetic inhibitor. The inhibition of phytosterol biosynthesis resulted in an apparent suppression of fiber elongation in vitro or in planta. The determination of phytosterol quantity indicated that sitosterol and campesterol were the major phytosterols in cotton fibers; moreover, higher concentrations of these phytosterols were observed during the period of rapid elongation of fibers. Furthermore, the decrease and increase in campesterol:sitosterol ratio was associated with the increase and decease in speed of elongation, respectively, during the elongation stage. The increase in the ratio was associated with the transition from cell elongation to secondary cell wall synthesis. In addition, a number of phytosterol biosynthetic genes were down-regulated in the short fibers of ligon lintless-1 mutant, compared to its near-isogenic wild-type TM-1. These results demonstrated that phytosterols play a crucial role in cotton fiber development, and particularly in fiber elongation.
Article
Determining how function evolves following gene duplication is necessary for understanding gene expansion. Transcription factors (TFs) are a class of proteins that regulate gene expression by binding to specific cis-acting elements in the promoters of target genes, subsequently activating or repressing their transcription. In the present study, we systematically examined the functional diversification of the NAC transcription factor (NAC-TFs) family by analyzing their chromosomal location, structure, phylogeny, and expression pattern in Gossypium raimondii (Gr) and G. arboreum (Ga). The 145 and 141 NAC genes identified in the Gr and Ga genomes, respectively, were annotated and divided into 18 subfamilies, which showed distinct divergence in gene structure and expression patterns during fiber development. In addition, when the functional parameters were examined, clear divergence was observed within tandem clusters, which suggested that subfunctionalization had occurred among duplicate genes. The expression patterns of homologous gene pairs also changed, suggestive of the diversification of gene function during the evolution of diploid cotton. These findings provide insights into the mechanisms underlying the functional differentiation of duplicated NAC-TFs genes in two diploid cotton species.
Article
The plant phospholipase D (PLD) plays versatile functions in multiple aspects of plant growth, development, and stress responses. However, until now, our knowledge concerning the PLD gene family members and their expression patterns in cotton has been limited. In this study, we performed for the first time the genome-wide analysis and expression profiling of PLD gene family in Gossypium arboretum, and finally, a total of 19 non-redundant PLD genes (GaPLDs) were identified. Based on the phylogenetic analysis, they were divided into six well-supported clades (α, β/γ, δ, ε, ζ and φ). Most of the GaPLD genes within the same clade showed the similar exon-intron organization and highly conserved motif structures. Additionally, the chromosomal distribution pattern revealed that GaPLD genes were unevenly distributed across 10 of the 13 cotton chromosomes. Segmental duplication is the major contributor to the expansion of GaPLD gene family and estimated to have occurred from 19.61 to 20.44 million years ago when a recent large-scale genome duplication occurred in cotton. Moreover, the expression profiling provides the functional divergence of GaPLD genes in cotton and provides some new light on the molecular mechanisms of GaPLDα1 and GaPLDδ2 in fiber development.
Article
Transposable elements (TEs) usually occupy largest fractions of plant genome and are also the most variable part of the structure. Although traditionally it is hallmarked as "junk and selfish DNA", today more and more evidence points out TE's participation in gene regulations including gene mutation, duplication, movement and novel gene creation via genetic and epigenetic mechanisms. The recently sequenced genomes of diploid cottons Gossypium arboreum (AA) and Gossypium raimondii (DD) together with their allotetraploid progeny Gossypium hirsutum (AtAtDtDt) provides a unique opportunity to compare genome variations in the Gossypium genus and to analyze the functions of TEs during its evolution. TEs accounted for 57%, 68.5% and 67.2%, respectively in DD, AA and AtAtDtDt genomes. The 1,694 Mb A-genome was found to harbor more LTR(long terminal repeat)-type retrotransposons that made cardinal contributions to the twofold increase in its genome size after evolution from the 775.2 Mb D-genome. Although the 2,173 Mb AtAtDtDt genome showed similar TE content to the A-genome, the total numbers of LTR-gypsy and LTR-copia type TEs varied significantly between these two genomes. Considering their roles on rewiring gene regulatory networks, we believe that TEs may somehow be involved in cotton fiber cell development. Indeed, the insertion or deletion of different TEs in the upstream region of two important transcription factor genes in At or Dt subgenomes resulted in qualitative differences in target gene expression. We suggest that our findings may open a window for improving cotton agronomic traits by editing TE activities.