PosterPDF Available

Highly accelerated rates of genomic rearrangements and nucleotide substitutions in plastid genomes of Passiflora subgenus Decaloba

Authors:
Highly'accelerated'rates'of'genomic'rearrangements'and'nucleotide'
substitutions'in'plastid'genomes'of'Passiflora'subgenus'Decaloba!
Bikash Shresthaa,, Mao-Lun Wengb, Edward C. Theriota, Lawrence E. Gilberta, Tracey A. Ruhlmana, Shawn E. Krosnickc, Robert K. Jansena
aDepartment of Integrative Biology, University of Texas at Austin, Austin, TX, USA; bDepartment of Biology, Westfield State University, Westfield, MA, USA; cDepartment of Biology, Tennessee Tech University, Cookeville, TN, USA
Passiflora
is included among a small group of angiosperm taxa with
highly rearranged plastid genomes (plastomes) that exhibit a syndrome
of features, including biparental inheritance and plastome-genome
incompatibility. However, plastid genomic studies in
Passiflora
are
limited in taxon sampling and confined mainly to the largest subgenus
Passiflora
. To examine phylogenetic extent of plastome rearrangements
in Passifloraceae, 16 new plastomes of species across all five
subgenera and an outgroup
Adenia
were completed. With these newly
completely plastomes along with previously published
Passiflora
plastomes, this study addresses four major questions:
i)Does expanded taxon sampling of plastomes resolve the
incongruence with previously published phylogenies?
ii)Which protein coding genes provide the most phylogenetic
signal?
iii)What is the extent of genomic rearrangement and its
phylogenetic distribution?
iv)What are the patterns of nucleotide substitutions in protein
coding genes?
Introduction
Species
ycf1*
trnN
trnR+
rrn5
rrn4.5
rrn23
trnA
trnI
rrn16
trnV
rps12-3'
rps7
ndhB
trnL
ycf2
rpl23
rpl2
rps19*
Populus ycf1* + + + + + + + + + + + + + + + + + + rpl22*
A.)mannii + + + + + + + + + + + + + + + + + + +
P.)pittieri rps15 + + + + + + + + + + + + + + + + rpl23* - -
P.)arbelaezii rps15 + + + + + + + + + + + + + + + + + + +
P.)foetida + + + + + + + + + + + + + + + + + + +
P.)menispermifolia - - - - - - - - - - + + + +
ycf2Ψ+ + + +
P.)actinia)+)10)others
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + +
P.)contracta + + + + + + + + + + + + + + + + psbA trnI
rpl23*
P.)tetrandra + + + + + + + + + + + + + + + + - - -
P.)microstipula + + + + + + + + + + + + + + + + + + +
P.)obovata - - - - - - - - - - + + + + + + + +
P.)auriculata ndhF*
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA petD petB psbH psbN psbT psbB clpP rps12-5' atpB atpE trnM trnV rbcL accD psaI ycf4 cemA petA
P.)jatunsachensis/rufa ndhF*
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA petD petB psbH psbN psbT psbB clpP rps12-5' atpB atpE trnM trnV rbcL accD
P.)tenuiloba
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoAΨpetD petB psbH psbN psbT psbB clpP rps12-5' atpB atpE trnM trnV rbcL accD psaI
P.suberosa ndhF*
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoAΨpetD petB psbH psbN psbT psbB clpP rps12-5' atpB atpE trnM trnV rbcL accD psaI
P.)lutea ndhF*
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA petD petB psbH psbN psbT psbB clpP rps12-5' atpB atpE trnM trnV rbcL accD psaI ycf4 cemA*
P.filipes ndhF*
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA petD*
P.)affinis rps15
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA petD*
P.)biflora
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA petD*
P.misera
ycf1Ψ+ + + + + + + + + + + + + ycf2Ψ+ + + + rps3 rpl16 rpl14 rps8 rpl36 rps11 rpoA
0.02
P. filipes
P. edulis
Populus
P. actinia
P. lutea
P. oerstedii
P. foetida
P. tenuiloba
P. auriculata _S
P. cincinnata
P. cirrhifolia
P. contracta
P. arbelaezii
P. microstipula
P. menispermifolia
P. nitida
P. jatusachensis
P. laurifolia
P. serratodigitata
P. retipetala
P. vitifolia
A. mannii
P. suberosa
P. serratifolia
P. misera
P. biflora
P. rufa
P. tetrandra
P. obovata
P. quadrangularis
P. affinis
P. ligularis
P. auriculata_FG
P. pittieri
9 4
9 7
9 9
3 3
9 3
4 4
9 9
9 8
e,A
a
a, j
a, c, Y, L
b, h
c, Z
c, X
c, E, K
c
c
d
f, i , j, C
f
g, F
c, B
k, D, K
Gene loss
a. rpl20
b. rpl22
c. rpl32
d. rps7
e. rps16
f. ycf1/ycf2
g. rpoA
1, 8-9
9, 13-14
3-6
11-13
16
11-13
10-12
5
Intron loss
h. atpF
i. rpoC1
j. clpP
k. rpl16
D
J
IR expansion
A. JLB rps19
B. rps19 rpl22*
C. JLBrpoA
D. rpoApetD*
E. petD* accD
F. petD*psaI
G. petD*cemA*
H. accDpetA
I. JSA ycf1
J. JSA rps15
K. JSB ndhF*
L. JLApsbA
IR contraction
X. JSArps12-3’
Y. J LBtrnI
Z. JLBrpl23*
Gene blocks
1. psbA – trnH-GUG
2. trnK-UUU psbI
3. trnS-GCU – trnR-UCU
4. atpAatpH
5. atpI – rpoB
6. trnC-GCA – ycf3
7. rps4 – ndhC
8. trnM-CAU – trnV-UAC
9. atpBatpE
10. rbcL
11. psbE – accD
12. rpl20 – petL
13. clpP – rps12-5’
14. psbB – petD
15. rpoA – ycf1
16. ndhFndhH
K
H
Y
J
G
NA
6-7, 10-13,
8-9
3-6, 8-9
8-9
4-7, 12
X
A
C
B
Genome sequencing, assembly and annotation
Plastomes for sixteen Passifloraceae species were sequenced using Illlumina data and assembled
de novo
using Velvet and
annotated with DOGMA/GeSeq and tRNAscan-SE.
Phylogenetic analysis
Sixty-eight protein-coding plastid genes shared by all
Passiflora
and outgroups,
Adenia
and
Populus,
were used to infer
maximum likelihood (ML) tree using IQ-TREE v1.5.2.
Phylogenetic informativeness
PhyDesign was used to estimate the phylogenetic informativeness profile for 68 protein-coding plastid genes using a relative-
time ultrametric tree.
Nucleotide substitution rate analyses
Pairwise estimation: Pairwise synonymous (
dS
) and non-synonymous (
dN
) substitution rates were calculated between the
outgroup
Populus trichocarpa
and all other species using PAML.
Lineage-specific rate analysis: Test for significantly different substitution rates (
dN
and
dS
) and
dN/dS
on the branch leading to
IR expansion within subgenus
Decaloba
, branch models were used in HyPhy and PAML.
Methods
Figure. 1. Distribution of
Passiflora
plastome rearrangements on maximum likelihood tree. Horizontal bar indicates the expected number substitutions per site. Pseudogenes are indicated by
(Ψ ). Species names are color-coded to indicate their generic or subgeneric placement: Red (
Adenia
), cyan (
Astrophea
), orange (
Deidamioides
)
,
blue (
Passiflora
), purple
(
Tetrapathea
), and
green (
Decaloba
).
Results
Expanded taxon sampling generated trees congruent
with previously published phylogenies (Fig. 1A)
.
Ten most phylogenetic informative genes (Fig. 2)
generated a tree congruent with the tree using 68
plastid genes.
Extensive plastome rearrangements were found within
the genus, specifically subgenus
Decaloba
, some of
which occurred multiple times (Fig. 1A-C).
Gene-specific rate heterogeneity was found for
clpP
and
ribosomal subunits genes primarily in subgenus
Decaloba
(Fig. 3)
.
Figure 3. Boxplot showing the variation in pairwise
dS
and
dN,
and
dN
/
dS
for the
Adenia
+
Passiflora
species Outliers for
each category were labeled with gene names. Clades (A-E) used to group species to calculate statistical significance are
labeled on right-hand side.
Conclusions
Acknowledgements
00.0050.010.0150.020.0250.030.035
1012299.4
2024598.8
3036898.2
4049197.6
5061497
6073796.4
7086095.8
8098395.2
9110694.6
atpAccsAmatK ndhDndhF rbcLrpoB rpoC1rpoC2 ycf4
clpP%
Figure 2. Net phylogenetic informativeness profile for 68 plastid
genes. Ten genes with the greatest informativeness are color-coded.
Read more here!
P. gabrielleana
P. microstipula
dS
dN
dN/dS
0.0 0.2 0.4 0.6 0.8 0.0 0.2 0.4 0.6 0.0 0.5 1.0 1.5
A.mannii
P.arbelaezii
P.pittieri
P.actinia
P.cincinnata
P.edulis
P.foetida
P.laurifolia
P.ligularis
P.menispermifolia
P.nitida
P.oerstedii
P.quadrangularis
P.retipetala
P.serratifolia
P.serratodigitata
P.vitifolia
P.cirrhifolia
P.contracta
P.tetrandra
P.affinis
P.auriculata(FG)
P.auriculata(S)
P.biflora
P.filipes
P.jatunsachensis
P.lutea
P.microstipula
P.misera
P.obovata
P.rufa
P.suberosa
P.tenuiloba
Subgenus
Adenia
Astrophea
Decaloba
Deidamioides
Passiflora
Tetrapathea
A
B
C
D
E
psaC/
psaI/
petL/
petL/
psbI/
rps8/
ycf4/
ycf4/
clpP/
clpP/
clpP/
clpP/
ycf4/
ycf4/
clpP/rpl23/
rpl23/
clpP/
rpl23/
ycf4/
ycf4/
clpP/
clpP/
clpP/
rpl23/
clpP/
clpP/
clpP/rpl36/
rps12/
rps12/
clpP/
ycf4/
clpP/
clpP/
ycf4/
rpl36/
clpP/
clpP/
clpP/
clpP/rpl36/
clpP/
Article
Full-text available
The VIOLA ALBIDA complex is a complicated group with taxonomic problems having continuous leaf variations and composed of taxa related to the following names: Viola albida, V. albida var. takahashii, and V. chaerophylloides. As a first step to understanding the genomic nature of this complex, this study identified the whole chloroplast genome of V. albida. The genome is 157,692 bp in length (36.3% of GC content) and contains four subregions: a large single copy region of 86,220 bp, a small single copy region of 17,248 bp, and a pair of inverted regions of 27,112 bp each. An annotation of the gene identifies 111 unique genes, including 77 protein-coding genes, four rRNA genes, and 30 tRNA genes. The phylogenetic analysis of this genome with selected cp genomes from Viola identifies the close relationship between V. albida and V. ulleungdoensis. It is noteworthy that V. chaerophylloides, traditionally recognized as a member of the VIOLA ALBIDA complex, is genetically distant from V. albida and forms a sister group of all other members of the subsection Patellares. Our genome report is expected to serve as a basis for understanding the identity of the VIOLA ALBIDA complex.
Article
Extensive C‐to‐U editing has been reported from plastid genomes (plastomes) and mitochondrial genomes (mitogenomes) of spikemoss. While ‘reverse’ U‐to‐C editing was recorded in other seed‐free plants such as hornworts, quillworts, and ferns, it was not observed in spikemosses. However, no comprehensive study on the association between RNA editing and other genomic features was conducted for the organelle genomes of spikemosses. Here, we report thousands of C‐to‐U editing sites from plastomes and mitogenomes of two species: 1,767 and 2,394 edits in Selaginella remotifolia, and 4,091 and 2,786 edits in S. nipponica, respectively. Comparative analyses revealed two different editing frequencies among plastomes, but one similar frequency in mitogenomes. The different editing frequency in the Selaginella organelle genomes is related to the nonsynonymous substitution rate and the genome structural complexity. The high GC content caused by GC‐biased mutations in organelle genomes might be related to the absence of U‐to‐C editing in Selaginellaceae. Using RNA‐seq and whole‐genome data, we screened the pentatricopeptide repeat (PPR) family and discovered that the number of aspartic acid‐tyrosine‐tryptophan (DYW) domain‐containing PPR proteins corresponded roughly to the editing abundance in the Selaginella organelle genomes. Consequently, we hypothesize that associated evolution among RNA editing, GC‐biased mutation in organelle genomes, and the PPR protein family encoded in the nuclear genome, is probably triggered by the aberrant DNA repair system in Selaginellaceae. Our study provides new insights into the association between organelle and nuclear genomes in Selaginellaceae, which would contribute to understanding the evolution of posttranscriptional modifications of organelle genomes in land plants. This article is protected by copyright. All rights reserved.
Article
Full-text available
The genus Passiflora comprises a large group of plants popularly known as passionfruit, much appreciated for their exotic flowers and edible fruits. The species (∼500) are morphologically variable (e.g., growth habit, size, and color of flowers) and are adapted to distinct tropical ecosystems. In this study, we generated the genome of the wild diploid species Passiflora organensis Gardner by adopting a hybrid assembly approach. Passiflora organensis has a small genome of 259 Mbp and a heterozygosity rate of 81%, consistent with its reproductive system. Most of the genome sequences could be integrated into its chromosomes with cytogenomic markers (satellite DNA) as references. The repeated sequences accounted for 58.55% of the total DNA analyzed, and the Tekay lineage was the prevalent retrotransposon. In total, 25,327 coding genes were predicted. Passiflora organensis retains 5,609 singletons and 15,671 gene families. We focused on the genes potentially involved in the locus determining self‐incompatibility and the MADS‐box gene family, allowing us to infer expansions and contractions within specific subfamilies. Finally, we recovered the organellar DNA. Structural rearrangements and two mitoviruses, besides relics of other mobile elements, were found in the chloroplast and mt‐DNA molecules, respectively. This study presents the first draft genome assembly of a wild Passiflora species, providing a valuable sequence resource for genomic and evolutionary studies on the genus, and support for breeding cropped passionfruit species.
Article
Full-text available
This study was the first report for the complete chloroplast genome of Passiflora serrulata Jacq. (Passifloraceae). The cp genome was 149,683 bp in length contained two inverted repeats (IRs) of 25,470 bp, which were separated by large single-copy (LSC) and small single-copy (SSC) of 86,252 bp and 13,491 bp, respectively. A total of 110 functional genes were encoded, comprised 76 protein-coding genes, 30 tRNA genes, and four rRNA genes. The GC content was 37.0%. The maximum likelihood phylogenetic tree indicated that P. serrulata was recovered as the member of subg. Passiflora and most closely related to the clade formed by P. serratodigitata and P. ligularis.
Article
Full-text available
The evolutionary speed hypothesis (ESH) states that the molecular evolutionary rates of plants in the warm areas are higher than that in the cool areas. Nevertheless, largely due to the scarcity of molecular data from lower taxonomic lineages, ESH remains elusive in plants. In this study, we comprehensively conducted genome-wide comparative analyses of a total of 76 chloroplast (cp) genomes of the bamboos (Poaceae: Bambusoideae), with 10 of them newly generated, to test the ESH via investigating the potential association between their cp genome evolutionary rate and geographical distribution pattern. The results showed extreme heterogeneity of the evolutionary rate within the bamboos, with the lowest value found in temperate bamboos (tribe Arundinarieae) being approximately 1/4 of that in the tropical ones (tribe Olyreae). All the 91 non-coding loci and 75 coding loci evolve consistently faster in tribe Olyreae than that in tribe Arundinarieae. Most of the non-coding (82 out of 91) and coding (73 out of 75) loci in the tropical woody bamboos (tribe Bambuseae) also evolve faster than those in the temperate woody bamboos (tribe Arundinarieae). Moreover, the split of tropical and temperate bamboos was apparently accompanied with the change of cp genome evolutionary rate. This is by far the first case using cp genome data to support the ESH via showing the tropical plants evolve faster than their closely related temperate ones, and will shed new light on our understanding of ESH and the adaptation of plants.
Article
Full-text available
Passion flower (Passiflora edulia Sims) is an important fruit that is of great economic importance. In this study, we presented the chloroplast genome of a purple passion flower variety ‘Pingtang No.1’ in China using BGISEQ-500 sequencing. Its chloroplast genome is 152,621 bp in size. It contains a pair of inverted repeat regions of 25,989 bp, each separating a small single copy region of 13,352 bp and a large single copy region of 85,141 bp. Totally, 111 unique genes, including 77 protein coding genes, 30 tRNAs, and 4 rRNAs, were identified and annotated in the chloroplast genome. Phylogenetic maximum likelihood analysis based on chloroplast genomes of 35 plant species, mainly from the genus Passiflora indicated that ‘Pingtang No.1’ and yellow passion flower ‘IAPAR-123’ (Passiflora edulis) cluster together. The chloroplast genome can be used for a better understanding of the evolutionary relationships of plant species of the family Passifloraceae, especially the genus Passiflora.
ResearchGate has not been able to resolve any references for this publication.