ArticlePDF Available

Comparative chloroplast genomes and phylogenetic analyses of Pinellia

Authors:

Abstract and Figures

Background Pinellia Tenore (Araceae) is a genus of perennial herbaceous plants, all of which have medicinal value. The chloroplast (cp) genome data of Pinellia are scarce, and the phylogenetic relationship and gene evolution remain unclear. Methods and results We sequenced and annotated the Pinellia pedatisecta cp genome and combined it with previously published genomes for other Pinellia species. We used bioinformatics methods to analyse the genomic structure, repetitive sequences, interspecific variation, divergence hotspots, phylogenetic relationships, divergence time estimation and selective pressure of four Pinellia plastomes. Results showed that the cp genomes of Pinellia varied in length between 168,178 (P. pedatisecta MN046890) and 164,013 bp (P. ternata KR270823). A total of 68–111 SSR loci were identified as candidate molecular markers for further genetic diversity study. Eight mutational hotspot regions were determined, including psbI-trnG-UCC, psbM-rpoB, ndhJ-trnT-UGU, trnP-UGG-trnW-CCA, ndhF-trnN-GUU, ndhG-ndhE, ycf1-rps15 and trnR-ycf1. Gene selection pressure suggested that four genes were subjected to positive selection. Phylogenetic inferences based on the complete cp genomes revealed a sister relationship between Pinellia and Arisaema plants whose divergence was estimated to occur around 22.48 million years ago. All Pinellia species formed a monophyletic evolutionary clade in which P. peltata, rather than P. pedatisecta, earlier diverged, indicating that P. pedatisecta is not the basal taxon of Pinellia but P. peltata may be. Conclusions The cp genomes of Pinellia will provide valuable information for species classification, identification, molecular breeding and evolutionary exploration of the genus Pinellia.
This content is subject to copyright. Terms and conditions apply.
Vol.:(0123456789)
1 3
Molecular Biology Reports (2022) 49:7873–7885
https://doi.org/10.1007/s11033-022-07617-5
ORIGINAL ARTICLE
Comparative chloroplast genomes andphylogenetic analyses
ofPinellia
NingCui1 · WeixuChen2· XiwenLi3· PingWang1
Received: 19 October 2021 / Accepted: 18 May 2022 / Published online: 11 June 2022
© The Author(s) 2022
Abstract
Background Pinellia Tenore (Araceae) is a genus of perennial herbaceous plants, all of which have medicinal value. The
chloroplast (cp) genome data of Pinellia are scarce, and the phylogenetic relationship and gene evolution remain unclear.
Methods and results We sequenced and annotated the Pinellia pedatisecta cp genome and combined it with previously
published genomes for other Pinellia species. We used bioinformatics methods to analyse the genomic structure, repetitive
sequences, interspecific variation, divergence hotspots, phylogenetic relationships, divergence time estimation and selective
pressure of four Pinellia plastomes. Results showed that the cp genomes of Pinellia varied in length between 168,178 (P.
pedatisecta MN046890) and 164,013bp (P. ternata KR270823). A total of 68–111 SSR loci were identified as candidate
molecular markers for further genetic diversity study. Eight mutational hotspot regions were determined, including psbI-
trnG-UCC, psbM-rpoB, ndhJ-trnT-UGU, trnP-UGG-trnW-CCA, ndhF-trnN-GUU, ndhG-ndhE, ycf1-rps15 and trnR-ycf1.
Gene selection pressure suggested that four genes were subjected to positive selection. Phylogenetic inferences based on the
complete cp genomes revealed a sister relationship between Pinellia and Arisaema plants whose divergence was estimated
to occur around 22.48 million years ago. All Pinellia species formed a monophyletic evolutionary clade in which P. peltata,
rather than P. pedatisecta, earlier diverged, indicating that P. pedatisecta is not the basal taxon of Pinellia but P. peltata
may be.
Conclusions The cp genomes of Pinellia will provide valuable information for species classification, identification, molecular
breeding and evolutionary exploration of the genus Pinellia.
Keywords Pinellia· Phylogeny· Evolution· Divergence time estimation· Chloroplast genome
Abbreviations
cp Chloroplast
SSRs Simple sequence repeats
WGS Whole-genome sequencing
PE Paired-end
BLAST Basic Local Alignment Search Tool
LSC Large single copy
SSC Small single copy
IR Inverted repeat
indel Insertion-deletion
Pi Nucleotide diversity
CDS Protein-coding sequence
ML Maximum likelihood
Mya Million years ago
ESS Effective sample size
LRT Likelihood ratio test
BEB Bayes Empirical Bayes approach
HPD Highest posterior density
MP Maximum parsimony
LCBs Locally collinear blocks
Introduction
Pinellia Tenore is a small eastern Asian genus in the Araceae
family. Although there are only seven perennial herbaceous
species in Pinellia genus [1], every member it contains
* Xiwen Li
XWLi@icmm.ac.cn
* Ping Wang
wangpingjinan@163.com
1 Central Laboratory, Shandong Academy ofChinese
Medicine, Ji’nan, China
2 Shang Yao Hua Yu (LinYi) Traditional Chinese Medicine
Resources Co., Ltd, Linyi, China
3 Institute ofChinese Materia Medica, China Academy
ofChinese Medical Sciences, Beijing, China
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7874 Molecular Biology Reports (2022) 49:7873–7885
1 3
is important traditional Chinese medicinal plant and was
recorded in Chinese herb classics more than 2000years ago;
the most famous among them is P. ternata, with an annual
demand of 5500–6000 tons [2]. They have been used for
the treatment of viper bites, lumbago, allergic reaction and
externally to treat traumatic injury, abscesses, neck lympho-
sarcoma, breast mastitis and uterine cancer [3, 4].
In recent years, the phylogeny and evolution of monocots
have come under intense scrutiny with the rapid develop-
ment of molecular phylogenetic systematics, and multiple
studies have highlighted the phylogeny of Pinellia as prob-
lematic [5, 6]. The phylogenetic position of the Pinellia
genus in Araceae has been controversial, and sister groups
of Pinellia show discrepancy under different classification
systems and studies [7, 8]. The six major taxonomic systems
of Araceae, namely, Schott system [9], Engler system [10],
Hutchinson system [11], Grayum system [12], Bogner and
Nicolson system [13] and Mayo etal. system [14], have dif-
ferent views on the sister genus of Pinellia, with four genera
(Crytocoryne, Langenandra, Ambrosina and Arisaema) as
candidates. With the development of molecular systematics,
analyses of the plastome gene and restriction-site sequences
suggested that the Arisaema genus is strongly related to
Pinellia and is the sister genus of Pinellia [7, 15, 16]. Con-
trary to the above classification, Keating [17] and Bogner
and Petersen [18] based on morpho-anatomical data argued
that Arisaema and Pinellia cannot gather into a unique clade
for their morphological discrepancy of stamens.
Members of Pinellia exhibit wide variations in flower,
leaf, bulblet, spathe and ovule characteristics [19, 20]. The
low-level taxonomy and interspecific phylogeny of Pinellia
are difficult to address based on morphological traits. The
most comprehensive below-genus phylogenetic analysis to
date has been provided by Yin, a study using a matrix of all
seven Pinellia species and ITS and trnL-F DNA barcode
sequences [21]. Combined with the morphological traits,
Yin concluded that P. pedatisecta is the basal taxon of Pinel-
lia and suggested that its taxonomic rank should be elevated
to a section. However, our group previously conducted inter-
specific phylogenetic analyses of Pinellia by using four DNA
barcode sequences (ITS, matK, rbcL and trnL-F) with the
same method as Yin. The topologies of four barcodes were
not consistent, with P. pedatisecta in the outer layer of ITS
and matK trees and in the inner layer of rcbL and trnL-F
trees, receiving considerably lower bootstrap values (Sup-
plementary Fig. S1) probably due to insufficient sequence
length and interspecific variations.
The chloroplast (cp) is an important self-replicating
organelle that plays a crucial role in photosynthesis and in
the synthesis of pigment, protein and starch [22]. The cp
contains its own circular double-stranded genome, which is
inherited maternally in most angiosperms or paternally in
some gymnosperms [23]. Unlike the nuclear genome, the cp
lacks meiotic recombination. These properties, along with
adequate levels of polymorphism, make it a suitable mol-
ecule for studies on phylogeny and evolution [24]. Scientists,
especially Henriquez CL and Abdullah research group, have
long been devoted to the plastome phylogeny of the Araceae
family [2528], and sequenced the cp genome of P. pedati-
secta (MN046890) in 2020 [22]. Nevertheless, their studies
focused more on the backbone phylogeny at the taxonomic
level of the entire or subfamilies of Araceae, and local phy-
logenetic relationships of Pinellia genus were rarely paid
attention to. Moreover, we previously performed a compara-
tive genomics analysis within Pinellia genus, and found that
the published P. pedatisecta cp genome was relatively differ-
ent from those of two other species, P. peltata (NC052862)
[29] and P. ternata (KR270823) [30] in sequence length,
gene content, GC content, etc. We determined to identify,
sample and sequence the cp genome of P. pedatisecta inde-
pendently for a reliable phylogeny of Pinellia species. The
plastome evolution of Pinellia was also discussed, including
the estimations of gene selection pressure and divergence
time which have not been studied before.
To reveal the interspecific diversification pattern of Pinel-
lia, we determined the complete cp genome of P. pedati-
secta and compared its sequence features with three other
Pinellia plastomes. The main goals of this study were to (1)
characterize and compare the cp genomes of Pinellia and
detect the sequence differences between Pinellia species and
between published and newly assembled cp genomes of P.
pedatisecta; (2) identify simple sequence repeats (SSRs),
long repeats and genetically variable regions and select
divergence hotspots as candidate DNA barcodes; (3) recon-
struct phylogenetic relationships of Pinellia species based
on the cp genome alignments and verify their phylogenetic
position within Araceae; and (4) estimate genes selection
pressure and divergence time for determining the relative
order and spacing of speciation events of Pinellia. Compara-
tive cp genomic analysis could provide theoretical basis to
further understand the evolution of the Araceae family and
additional insights into the long-standing controversial inter-
genus and intragenus phylogeny of Pinellia.
Materials andmethods
Plant material, DNA extraction, sequencing,
assembly andannotation
In our study, the cp genome of P. pedatisecta was sequenced
to explore the phylogeny and evolution of Pinellia. Fresh
leaves of P. pedatisecta from Linyi City, Shandong Prov-
ince were sampled. Voucher specimens were deposited in
Shandong Academy of Chinese Medicine. Total genomic
DNA was extracted from 100mg of silica-dried leaf by
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7875Molecular Biology Reports (2022) 49:7873–7885
1 3
using a DNeasy Plant MiniKit (Qiagen, CA, USA) accord-
ing to the manufacturer’s instructions [31]. The quantity and
quality of genomic DNA were examined by using ND-2000
spectrometer (ThermoFisher Scientific, Wilmington, DE,
USA) and 0.8% agarose gel electrophoresis. The chloroplast
genome of Salvia plebeia (NC050929) was assembled in
our previous study [31]. Taking this work as a guidance,
the DNA sample pre-treatment (Covaris M220 [Covaris,
US; 250bp] and VAHTS™ Universal DNA Library Prep
Kit [Vazyme, China]), whole genome sequencing (Illumina
Hiseq 1500 platform [Illumina Inc., USA]), cp genome
assembly (Skewer, Basic Local Alignment Search Tool
[BLAST], SOAPdenovo v.2.04, GapCloser and MUMmer),
junction validation, and annotation (Plann v. 1.1.2, BLAST
and Apollo) were performed in turn with the cp genome of
P. peltata (NC052862) as a reference sequence. The primers
for junction validation are listed in Supplementary TableS1.
The cp genome obtained in this study has been submitted to
the NCBI database (www. ncbi. nlm. nih. gov). The physical
map of P. pedatisecta cp genome was produced with Chlo-
roplot (https:// irsco pe. shiny apps. io/ Chlor oplot/).
Repeat structure identification
SSRs were identified by MISA [32], and the minimum
thresholds for mono-, di-, tri-, tetra-, penta- and hexa-nucle-
otides were set to 10, 6, 5, 5, 5 and 5, respectively. REPuter
[33] was used to detect two kinds of long repeats: forward
and palindromic repeats which have been reported relatively
more prevalent in the cp genomes of Araceae family [22,
26]. Detection parameter settings were used as follows: rep-
find -d -p -h 3 -l 30 -best 50. Tandem Repeats Finder (http://
tandem. bu. edu/ trf/ trf. html) was used to find tandem repeats
with the default settings.
Genome comparative analysis
In addition to the newly sequenced cp genome, 26 avail-
able cp genome sequences of Pinellia and related spe-
cies were downloaded from the NCBI database: P. peda-
tisecta (MN046890), P. ternata (KR270823), P. peltata
(NC052862), Arisaema franchetianum (MN046885),
Arisaema ringens (MK111107), Arisaema erubescens
(MT676834), Arisaema nepenthoides (MW338731),
Typhonium blumei (NC051872), Sauromatum giganteum
(NC050648), Alocasia navicularis (MN046882), Coloca-
sia esculenta (JN105689), Pistia stratiotes (MN885890),
Amorphophallus konjac (MK611803), Calla palus-
tris (MN046887), Epipremnum aureum (KR872391),
Epipremnum amplissimum (MN477424), Aglaonema cos-
tatum (MN046881), Monstera adansonii (MN046888),
Zantedeschia aethiopica (KY792991), Pothos scandens
(MN046891), Symplocarpus renifolius (KY039276),
Symplocarpus nipponicus (MK341566), Lemna minor
(DQ400350), Acorus calamus (AJ879453), Acorus ameri-
canus (EU273602) and Acorus gramineus (KP099646). The
multiple sequence alignment of the 27 cp genome sequences
was performed using MAFFT v.7 with the default settings
and adjusted manually where necessary with BioEdit v.7.2.5
software. On the basis of the aligned sequence matrix of the
cp genomes of Pinellia, interspecific nucleotide diversity (K)
was evaluated by sliding window analysis with a step size
of 1000bp and window length of 2000bp in DnaSP v.5.10
[34]. The evolutionary divergences of the four Pinellia cp
genomes were evaluated using nucleotide differences and
p-distance by MEGA v.10.0.4. The protein-coding sequences
(CDSs) were extracted using Geneious v.2019.1.3 [35].
Phylogenetic analyses
Two datasets (cp genome and coding sequence) were used
to construct the phylogenetic topology of 27 Pinellia and
related species with maximum likelihood (ML) methods,
respectively. ML analyses were performed using RAxML
8.2.9 under the GTRGAMMA model with 1000 rapid boot-
strap replicates. Three species of Acorus (A. calamus, A.
americanus and A. gramineus) were used as outgroups.
Divergence time estimation
Accurate estimation of the divergence time in a taxon is
important to understand its evolutionary history. Divergence
times were estimated using PAML mcmctree (PAML v.4.9j)
[36] with the approximate likelihood calculation method.
The analysis was performed on 27 complete cp genome
sequences used in the phylogenetic analysis with three
known calibration times: (1) divergence between P. peda-
tisecta and T. blumei was 36–32 million years ago (Mya);
(2) divergence between A. navicularis and P. stratiotes was
85–46 Mya; and (3) all species except the genus Acorus from
the Araceae family arose 122–117 Mya inferred from the
published knowledge-based TimeTree (timetree.org). Pos-
terior distributions of parameters were approximated using
two independent mcmctree analyses of 10,000,000 genera-
tions with 20% burn-in. Tracer v.1.4.1 was used for checking
the convergence of the chains through adequate effective
sample sizes (ESSs).
Gene selection site analysis
A total of 74 single-copy protein-coding genes shared by
four Pinellia plastid genomes (Table1) were extracted
and aligned by Geneious v.9.0.5 [35] and MAFFT v.7.
The ML tree was constructed using RAxML V.8.2.9 with
the GTRGAMMA model based on complete cp genomes.
Protein-coding exon and each value of dN, dS and ω were
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7876 Molecular Biology Reports (2022) 49:7873–7885
1 3
calculated using the site-specific model in the Codeml
program (seqtype = 1, model = 0, Nsites = 0, 1, 2, 3, 7, 8)
of PAML v.4.9j [36]. To determine the selected sites, we
compared the model M0 (one ratio) versus M3 (discrete),
M1 (neutral) versus M2 (positive selection) and M7 (beta)
versus M8 (beta and ω) and carried out the three likelihood
ratio tests (LRTs). Only consistent sites of positive selec-
tion with significant support from posterior probability (p
of (ω > 1) 0.99; Bayes Empirical Bayes approach [BEB])
were identified. BEB recognized by Models M2 and M8
were further considered.
Results
Chloroplast genome features inPinellia
In our study, the cp genome of P. pedatisecta was sequenced,
assembled and validated (Fig. S2). A total of 49,904,104
PE raw reads were generated using the Illumina Sequencing
System. The novel cp genome sequence has been preserved
in GenBank (MZ702636, Table1). The cp genome of P.
pedatisecta was circular double-stranded DNA and displayed
a quadripartite structure (Fig.1). The cp genome assembled
in this study was 164,682bp in length, which was 3,496bp
shorter than the published one (MN046890, 168,178bp)
mainly due to the contraction of the LSC region. Moreo-
ver, the GC content in the SSC region of the cp genome
MZ702636 was 2.58% higher than that of MN046890. The
cp genome MN046890 had the same rRNA gene content
as that of MZ702636 but contained one more tRNA gene
(tRNA-His) (Supplementary TableS2).
After downloading from the NCBI database, protein-
coding gene number variations of 24 cp genomes of the
Araceae family, including four Pinellia cp genomes, were
analysed (Fig. S3). Compared with the cp genome of the
model plant Arabidopsis thaliana, two genes, infA and ycf68,
were inserted among most Araceae plants, while one-copy
ycf1 gene was missing. In the Araceae family, there were
significant differences in the number of ycf68 genes among
species. The ycf68 genes in 60% of Araceae cp genomes
were completely missing, while the remaining 40% of the
cp genomes had two copies. Even within the Pinellia genus,
the number of ycf68 genes was also inconsistent. Never-
theless, there was no discrepancy in the type and number
of protein-coding genes between our newly assembled cp
genome (MZ702636) and the published one (MN046890).
SSR andrepeat sequence analyses
The number of SSRs in the four Pinellia cp genomes ranged
from 68 (P. peltata) to 111 (P. ternata, Fig.2A and Supple-
mentary TableS3). Three kinds of SSRs were discovered,
namely, mononucleotide, dinucleotide and trinucleotide.
Among each Pinellia species, mononucleotide repeats were
the most common, whereas trinucleotide repeats accounted
for the lowest proportion of SSRs. The number of A/T mon-
onucleotide repeats exceeded that of the other three types
combined (Fig.2A). Interspersed repeated sequences were
identified by using REPuter for four plastomes (Fig.2B and
Supplementary TableS4). Except for IR regions, the repeats’
length ranged from 31 to 328bp with forward (F) repeats as
the relative prevalent type. Most interspersed repeats of two
P. pedatisecta cp genomes were located in the LSC regions,
while the long repeats of P. peltata and P. ternata were pre-
dominantly in the IRb/SSC and SSC regions, respectively.
The total number of tandem repeats for four plastid genomes
was in the range of 34–274. P. peltata had the least repeats,
and P. pedatisecta (MN046890) had the most (Supplemen-
tary TableS5).
Comparative genomic analyses
The differences and evolutionary divergences among four
Pinellia cp genomes were compared using nucleotide sub-
stitutions and sequence distance (Supplementary TableS6).
Across all four Pinellia cp genomes, the p-distance was
0.000402–0.013668, and the value of nucleotide differences
was 66–2459. The comparison results of nucleotide substi-
tution and genetic distance were consistent. The sequence
divergence level of two cp genomes of P. pedatisecta was
the lowest. The differentiation between P. pedatisecta and
P. ternata was less than that between P. pedatisecta and P.
peltata. Among three Pinellia species, P. ternata and P. pel-
tata were the most genetically distant. Moreover, the K value
(sequence divergence between species) was calculated, and
Table 1 Summary of features of four Pinellia chloroplast genomes
Taxon Accession number Length (bp) Number of genes GC content (%)
Genome LSC SSC IR Total Protein coding tRNA rRNA Genome LSC SSC IR
P. pedatisecta MZ702636 164,682 90,947 22,523 25,606 129 85 36 8 35.71 33.92 29.43 42.09
P. pedatisecta MN046890 168,178 92,963 23,981 25,617 130 85 37 8 35.08 33.35 26.85 42.09
P. peltata NC052862 164,923 90,089 24,871 24,981 130 86 36 8 36.51 34.53 31.77 42.44
P. ternata KR270823 164,013 89,783 22,980 25,625 131 86 37 8 36.66 34.6 31.66 42.53
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7877Molecular Biology Reports (2022) 49:7873–7885
1 3
the sliding windows of the K values were constructed by
DnaSP (Fig.2C and Supplementary TableS7). Figure2C
showed that the sequence divergence between P. pedatisecta
and P. ternata was much lower than the two other K val-
ues. The sequence divergence between P. pedatisecta and
P. peltata was not so different from that between P. ternata
and P. peltata. Eight highly variable regions with great K
values were detected, namely, psbI-trnG-UCC, psbM-rpoB,
ndhJ-trnT-UGU, trnP-UGG-trnW-CCA, ndhF-trnN-GUU,
ndhG-ndhE, ycf1-rps15 and trnR-ycf1. Four of these regions
were located in the LSC region, and the remaining four were
located in the SSC region, all of which were present in the
non-coding regions.
Phylogenetic analyses
To analyze the phylogenetic relationship of Pinellia species,
we constructed phylogenetic trees using whole cp genome
Fig. 1 Chloroplast genome map of P. pedatisecta assembled in this
study. The centre of the figure provides length information of the cp
genome. In the first inner circle, the proportion of the shaded parts
represents the GC content of each part. The gene names are labelled
on the outermost layer. The transcription directions for the inner and
outer genes are listed clockwise and anticlockwise, respectively
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7878 Molecular Biology Reports (2022) 49:7873–7885
1 3
sequences and their CDSs by ML methods. For Pinellia spe-
cies, the topological structure obtained by either complete cp
genome or CDSs was roughly identical (Fig.3). The topol-
ogy based on entire cp genomes showed that Pinellia species
were monophyletic and clustered into three clades: one for
two P. pedatisecta plastomes, one for P. ternata and one
for P. peltata with strong support (bootstrap value 100%).
P. pedatisecta and P. ternata clustered together, exhibiting
the highest genetic similarity among the studied representa-
tives of Pinellia genus. In contrast to the CDSs phylogeny
(Fig.3B), in which Pinellia species were placed most closely
to Arisaema, Sauromatum and Typhonium clade, the whole-
length plastome phylogeny placed some Arisaema species
as sister to the genus Pinellia with 100% bootstrap support
(Fig.3A).
Divergence time estimation
In this study, we used full-length sequences of cp genomes
of 23 Araceae family plants (including four Pinellia cp
Fig. 2 The SSRs (A), interspersed repeated sequences (B), and inter-
specific nucleotide diversity (C) among the chloroplast genomes of
Pinellia. X-axis in Fig.3C: position of a window. Y-axis in Fig.3C:
sequence divergence (K values) between species of each window.
K(a): K values between P. pedatisecta and P. peltata; K(b): K values
between P. pedatisecta and P. ternata; K(c): K values between P. ter-
nata and P. peltata
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7879Molecular Biology Reports (2022) 49:7873–7885
1 3
genomes) and three outgroups to estimate the divergence
times of major clades in the Araceae family. Our dat-
ing analysis resulted in estimates for the crown node of
the Araceae family of 91.99 Mya (95% highest posterior
density [HPD] = 46.69–125.17 Mya; node 1 in Fig. S4;
Supplementary TableS8) in the early Cretaceous. The
divergence time of the four genera Pinellia, Arisaema,
Sauromatum and Typhonium was estimated at 25.65 Mya
(95% HPD = 12.93–36.35 Mya; node 2) in the Oligocene.
The age estimation for the crown node of Pinellia and
most species of Arisaema was dated to 22.48 Mya (95%
HPD = 11.39–33.32 Mya; node 3) in the late Oligocene
Fig. 3 Phylogenetic trees constructed with the whole cp genomes (A) and their protein-coding sequences (B) by using ML method. Numbers
near each branch are bootstrap values
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7880 Molecular Biology Reports (2022) 49:7873–7885
1 3
and early Miocene. Pinellia diverged from its sister clade
at 13.09 Mya (95% HPD = 4.91–21.96 Mya; node 4 in Fig.
S4; Supplementary TableS8) in the Miocene.
Gene selection pressure analysis ofprotein
sequences
The site-specific selective pressure on four Pinellia plasto-
mes was assumed using the site model in PAML program.
Three pairs of site model comparisons (M0 vs M3, M1 vs
M2a and M7 vs M8) showed four protein-coding genes sub-
jected to positive selection (LRT of the three comparisons all
p < 0.05; Supplementary Tables S9 and S10), namely, petB,
rpoC1, rps12 and ycf1. Of these genes, ycf1 had the highest
number of sites (21 sites; TableS9), followed by rpoC1 (4
sites), petB (1 site) and rps12 (1 site).
Discussion
Phylogenetic position ofthePinellia genus
inAraceae
In the previous classification systems, the Pinellia genus
belonged to the Araceae family, but its sister genus remains
controversial. In the Schott system [9], which is the first clas-
sification system of the Araceae family, the genus Pinellia
was placed in the tribus Alleluehieae, related to two genera
of Crytocoryne and Langenandra. In the Engler system [10],
Pinellia belonged to the subtrib. Pinelliinae, trib. Areae, sub-
fam. Aroideae, in the same tribus as the Arisaema genus. In
the Hutchinson system [11], this genus was placed in trib.
Areae, related to two genera of Crytocoryne and Ambro-
sina. In the classification system of Grayum [12], Pinellia
belonged to trib. Pinellia, subfam. Aroideae. After modify-
ing the Engler system by Bogner and Nicolson [13], the
Pinellia genus changed to the subtrib. Atherurinae, trib.
Pinellia, subfam. Aroideae. Mayo etal. [14] conducted
taxonomic analysis of 106 taxa in the Araceae family using
63 morphological and anatomical traits and proposed the
latest classification system of the Araceae family. In this
system, Pinellia was placed in trib. Arisaemateae, subfam.
Aroideae with the Arisaema genus as sister group. Since the
21st century, with the development of molecular systemat-
ics, many studies have investigated the phylogenetic position
of the Pinellia genus in Araceae [7, 1518], yet controversy
regarding molecular systematics and morphological clas-
sification persists. French etal. [16], Cabrera etal. [7] and
Cusimano etal. [15] performed molecular phylogenetic
analyses by applying different types and numbers of DNA
marker sequence data, and they discovered that the Arisaema
genus is strongly related to Pinellia among Araceae plants,
and is the sister genus of Pinellia. Contrary to the above
classification, Keating [17] and Bogner and Petersen [18]
based on morpho-anatomical data argued that Arisaema
and Pinellia cannot be grouped into a unique clade. One
significant morphological discrepancy between them was
that almost all Arisaema species have at least partially fused
stamens, whereas Pinellia and other related genera (e.g. Sau-
romatum and Typhonium) have free stamens.
The cp genome is one of three subcellular compartments
in the plant genome, and it is mainly inherited from the
maternal parent [37]. Given its high conservation and abun-
dant interspecific variation, the cp genome has the potential
to provide distinguishing differences that can help molecu-
larly classify closely related species [38]. With advances
in high-throughput sequencing, achieving the cp genome is
easily acquirable at a large scale with low costs. Research-
ers have proposed the entire cp genome as a super barcode
to discriminate and classify closely related species [39]. To
date, phylogenies of several genus-level taxa have been fur-
ther clarified by using cp genome sequences, such as Epi-
medium [40], Paris [41] and Sanguisorba [42]. Hence, we
compared 24 cp genomes of the Araceae family, including
four plastomes of Pinellia, to explore the cp genome molec-
ular phylogeny of Pinellia. The resulting phylogeny based
on entire cp genomes in this study showed that Pinellia and
Arisaema plants were gathered into one branch and sister
groups to each other with a well-supported bootstrap value
(100%; Fig.3A). Although the Araceae family is a major
group of monocotyledons, there is still a limited number
of cp genomes available from Araceae species, which may
result in some congeneric species, such as Arisaema plants,
not clustering together in one phylogenetic branch (Fig.3).
The absence of plastid genomes from three other candidates
of the Pinellia sister groups, Crytocoryne, Langenandra and
Ambrosina [21], could also hinder our discovery of the inter-
genus phylogeny of Pinellia.
Phylogeny withinthePinellia genus
Despite recent advances in molecular phylogenetic stud-
ies, deep evolutionary relationships and below-genus taxo-
nomic classification of Pinellia remain unresolved. Yin
[21] made the first comprehensive phylogenetic analyses
of all seven Pinellia species by using the sequences of
ITS and trnL-F DNA barcodes. The resulting phylogeny
of both ITS and trnL-F sequences supported that P. peda-
tisecta and the six remaining Pinellia species were sis-
ter species, and P. pedatisecta served as a basal taxon in
Pinellia. Moon etal. [43] reconstructed the phylogenetic
trees of three species, P. ternata, P. tripartita and P. peda-
tisecta, with matK and rbcL sequences; they found that
the topology of two trees is inconsistent. The basal taxon
in the matK tree was P. pedatisecta, congruent with that
by Yin [21], while the basal taxon in the rbcL tree was P.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7881Molecular Biology Reports (2022) 49:7873–7885
1 3
ternata. Furthermore, after morphological investigation,
Yin found that P. pedatisecta has many characteristics of
the Pinellia genus. These traits include perennial herb with
a small tuber at the top of the main tuber, persistent spathe,
female inflorescence adnate to the spathe, unisexual flow-
ers, no perianth, one straight ovule and green fruit. Hence,
combined with the molecular phylogeny, Yin concluded
that the basal taxon of the Pinellia genus is P. pedatisecta,
the species native to shady woodland areas, forested slopes
and valleys in northern and western China; he suggested
that the taxonomic rank of P. pedatisecta should be ele-
vated to a section of Pinellia [21], which was the first and
only below-genus taxonomic recommendation of Pinellia.
In our opinion, P. pedatisecta indeed has some of the
characteristic traits of Pinellia, but it also possesses so
many characteristics significantly different from other con-
generic species. For example, lanceolate spathe, no con-
striction in the tube and eaves of spathe, no diaphragm
between the female inflorescence and male inflorescence
and no bulblet on the petiole [15, 19, 20]. Our group pre-
viously conducted phylogenetic analyses of Pinellia by
using four different barcode sequences with maximum par-
simony (MP) method. The topologies of the four barcodes
were not consistent, with poor bootstrap values (Fig. S1)
probably due to insufficient sequence length and interspe-
cific variation. Hence, plastid genomes, an ideal model for
evolutionary and comparative genomic studies of related
species, were necessary here for a molecular phylogeny of
Pinellia with a significantly higher resolution.
After comparing the structural organization of three
previously published cp genomes of Pinellia, we found
that the sequence similarity between the published P.
pedatisecta plastome (MN046890) and the cp genomes
of P. peltata (NC052862) and P. ternata (KR270823) was
clearly lower than expected in the following five aspects.
(1) The length of the P. pedatisecta plastome was sig-
nificantly longer than that of the two other species
(Table1), specifically 3.2kb longer than that of P. pel-
tata and even 4.1kb longer than that of P. ternata. The
length of the LSC region in P. pedatisecta was 3.18kb
longer than that of P. ternata.
(2) With regard to protein-coding gene content, all two
ycf68 genes were missing in the published cp genome
of P. pedatisecta (Fig. S3), compared with P. peltata
and P. ternata, which could have the potential to affect
the physiological functions of plants, although some
authors suggested that the ycf68 gene likely does not
encode a protein [44].
(3) The GC content of published P. pedatisecta was lower
than that in P. peltata and P. ternata (Table1), espe-
cially in the SSC region, which was 4.92% lower than
that in P. peltata and 4.81% lower than that in P. ter-
nata.
(4) After the alignment of three published cp genomes
and further estimation of locally collinear blocks
(LCBs) with MAUVE 2.4.0, the gene order compari-
son revealed one rearrangement (~ 128bp) between the
published plastomes of P. pedatisecta and P. ternata
(Fig. S5A and S5B).
(5) A significant insertion/deletion variation was noted
across three cp genomes in this group, located between
tRNA-Ser (GCU) and tRNA-Ser (CGA) genes in
the LSC region with the length of ~ 1.2kb (10,714–
11,950bp in the published P. pedatisecta cp genome
and 10,192–10,234bp in the self-assembled one; Fig.
S5C and S5D).
These five sequence discrepancies do not follow the rule
of “chloroplast genomes of related species, especially those
within the same genus, are generally highly conserved”
[45, 46]. Therefore, we determined to identify, sample and
sequence the plastome of P. pedatisecta independently for
further phylogenetic analyses of Pinellia.
In this study, the phylogenetic tree based on whole cp
genomes showed that the species of Pinellia were clustered
into three clades: one for two P. pedatisecta plastomes, one
for P. ternata and one for P. peltata with high support values
(Fig.3). P. pedatisecta and P. ternata, which possess pedate
and three full-lobed leaves, respectively, grouped together
in topology, exhibiting the highest genetic similarity among
the studied representatives of Pinellia genus. In contrast,
P. peltata, characterized by undivided and peltata leaf and
development of stem tuber, was placed in the outer layer
of Pinellia topology. P. pedatisecta is buried inside the cp
genome phylogeny, incongruent with the phylogeny from
ITS and trnL-F sequences by Yin [21]. Furthermore, com-
parative genomic analyses showed that the nucleotide sub-
stitutions and sequence distance between P. pedatisecta and
P. ternata were the smallest (1275 and 0.007206; TableS6),
while that between P. ternata and P. peltata was the largest
(2459 and 0.013668). The interspecific nucleotide diver-
sity (K value) between P. pedatisecta and P. ternata was
considerably lower than the two other K values (Fig.2C),
both of which confirmed that P. peltata shared less sequence
similarity with P. pedatisecta and P. ternata. Limited by the
number of Pinellia cp genomes published so far, we cannot
determine whether P. peltata is the ancestor of Pinellia spe-
cies, but it is possible. P. peltata has the same floral morpho-
logical characteristics as other Pinellia species, except for P.
pedatisecta [29]. For example, solitary inflorescence, persis-
tent spathe, with constriction in the tube and eaves of spathe,
female inflorescence adnate to the spathe, with diaphragm
between the female inflorescence and male inflorescence,
unisexual flowers, no perianth, one locular ovary and one
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7882 Molecular Biology Reports (2022) 49:7873–7885
1 3
straight ovule [21]. However, the peltata leaf characteristics
of P. peltata are considerably different. Thus, the below-
genus taxonomic classification of Pinellia is suggested to be
based on differences in leaves or other tissue morphological
characteristics between species, rather than flowers.
Variations andevolution ofPinellia cp genomes
In this study, the cp genome of P. pedatisecta was sequenced
(MZ702636) with the length of 164,682bp, which fell within
the cp genome size range for angiosperms but much smaller
than its published one (MN046890, 168,178bp; Table1).
Among four Pinellia plastid genomes, LSC regions showed
the most difference in size, with the shortest 89,783bp of P.
ternata KR270823 and the longest 92,963bp of P. pedati-
secta MN046890. Additionally, the inferred structures and
protein-coding gene contents were in accordance except
for the infA and ycf68 genes (Fig.1 and S3). Analysis with
DnaSP inferred that some of the most divergent regions of
psbI-trnG-UCC, psbM-rpoB, ndhJ-trnT-UGU, trnP-UGG-
trnW-CCA, ndhF-trnN-GUU, ndhG-ndhE, ycf1-rps15 and
trnR-ycf1, as shown in Fig.2C, were found for further
related species identification of Pinellia.
SSRs in the cp genome are an efficient marker tool for
population genetic structure and phylogeography [47]. In
this study, 68–111 SSR loci were identified between Pinel-
lia species (Supplementary TableS3). These SSR loci could
provide candidate molecular markers for the genetic diver-
sity study of Pinellia. The composition of SSR loci in the cp
genomes of three Pinellia species was similar to that of most
angiosperms, with A/T mononucleotide repeats dominating
all the repeat units. This phenomenon may be one of the rea-
sons for the abundance of A/T bases in cp genomes. Repeat
units in the cp genome can cause sequence polymorphism,
providing information for further genetic diversity study of
Pinellia.
At present, specific studies on the divergence time esti-
mation of Pinellia by using molecular data, particularly
plastid genomes, are lacking. Li etal. [48] first supposed
that species with simple leaves (e.g. P. peltata) are less
evolved than those with pedate or lobed leaves (e.g. P. peda-
tisecta and P. ternata) on the basis of the palynology and
isozyme characteristics of Pinellia species. Yin [21] specu-
lated that the Pinellia genus first appeared in the Paleogene
according to geographic distribution and the origin time
of its related genus (Arisaema). Here, based on whole cp
genome sequences, our divergence time estimation indi-
cated that Pinellia species originated at ~ 22.48 Mya (95%
HPD = 11.39–33.32 Mya; node 3 in Fig. S4; Supplementary
TableS8) in the late Oligocene and early Miocene, con-
gruent with that estimated by Yin and further refined, and
diverged diversely at ~ 13.09 Mya (95% HPD = 4.91–21.96
Mya; node 4 in Fig. S4) in the Miocene. Furthermore, within
the Pinellia genus, P. peltata diverged first at ~ 5.34 Mya
earlier than two other species, P. pedatisecta and P. ternata;
these findings were consistent with the results of Li etal.
[48]. The present estimation of divergence times based on
plastid genomes and fossil data provides new insights and a
hypothetical foundation for future studies on the origin and
earlier evolution of Pinellia.
Genes subjected to positive selection have a significant
impact on the creative effects of populations, changes under
selection stress, and genetic drift led to the rapid transfor-
mation of genes into new common adaptive combinations
[49]. In this study, we calculated the non-synonymous/syn-
onymous substitution rate ratio (ω = dN/dS) for each of the
74 single copy protein-coding genes shared by the analyzed
plastid genomes of Pinellia. Four genes with high posterior
probability of codon sites in the BEB test were acquired
and considered as genes under positive selection (Fig.4
and Tables S9 and S10). These genes included one gene
for cytochrome b/f complex subunit protein in the photo-
system II reaction (petB) [50], one DNA-dependent RNA
Fig. 4 Positive selection sites logo of four Pinellia plastomes
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7883Molecular Biology Reports (2022) 49:7873–7885
1 3
polymerase gene (rpoC1), one gene for ribosome small
subunit protein (rps12) and ycf1 gene. All these genes were
also detected in other plants [5154], and these genes may
have played a significant role in the adaptive evolution of
Pinellia. The specific role needs to be further studied.
The genus Pinellia includes seven species in total. Only
three Pinellia species were studied here, and as a conse-
quence, it might not fully show the real effect of the phylog-
eny of Pinellia. However, our study was the first compre-
hensive phylogenetic analysis of Pinellia at the cp genome
level, and the resulting topologies based on either whole
cp genome or CDS datasets both definitively show that P.
pedatisecta cannot be the ancestor of Pinellia species. The
main habitat of all three Pinellia species in our study is in
the southeastern portion of Asia, where hybridization and
introgression have been reported [55, 56], which may cause
interspecific gene flow in response to ecological selection
and lead to difficulties in molecular phylogenetic analysis.
Given that the cp and nuclear genomes evolve independently,
the phylogeny of cp genomes alone is insufficient in making
taxonomic decisions about Pinellia. Therefore, more meth-
ods and more species are needed in further study to improve
our ability to better understand the phylogeny and evolution
of Pinellia.
Conclusions
The genus Pinellia Tenore is comprised of seven species,
all of which are important medicinal plant resources. Spe-
cies phylogenetic classification is vital for protecting species
diversity and selecting high quality germplasm resources.
But due to members of Pinellia exhibit rather wide varia-
tions in morphological structures, the low-level taxonomy
and interspecific phylogeny of Pinellia are difficult to
address based on morphology. With the development of
next-generation sequencing technology, complete chloro-
plast genomes have been widely employed to explore phy-
logenetic relationships of intra-or inter-genus. However, the
variation and evolution of the whole chloroplast genomes in
the genus Pinellia have been ignored. Here, we sequenced
the chloroplast genome of P. pedatisecta, compared it with
previously published plastid genomes, and reconstructed
phylogenetic relationships of Pinellia based on chloroplast
genomes and their CDSs. The divergence time estimation
and selective pressure of Pinellia plastomes were also inves-
tigated. The results showed some variations and adaptive
evolution between Pinellia complete chloroplast genomes
including size, structure and nucleotide diversity, which
provided valuable information for species classification and
evolution. In addition, our results revealed a sister relation-
ship between Pinellia and Arisaema plants whose diver-
gence was estimated to occur around 22.48 million years
ago. All Pinellia species formed a monophyletic evolution-
ary clade in which P. peltata, rather than P. pedatisecta,
earlier diverged, indicating that P. pedatisecta is not the
basal taxon of Pinellia but P. peltata may be. In conclu-
sion, our results could provide insight into the chloroplast
genome evolution and phylogeny of Pinellia genus and even
of Araceae family, which would be useful for selecting high
quality Pinellia germplasm resources in the future.
Supplementary Information The online version of this article (https://
doi. org/ 10. 1007/ s11033- 022- 07617-5) contains supplementary mate-
rial, which is available to authorized users.
Acknowledgements The authors thank Baosheng Liao for his support
in data analysis.
Author contributions NC and XL conceived and designed the study;
NC and WC collected and analyzed the data; NC drafted the initial ver-
sion of the manuscript; NC, XL and PW contributed to later versions
of the manuscript.
Funding This study was supported by the Major Special Project of
Scientific and Technological Cooperation of Bijie City (Grant No.
2021-02), Research Incubation Fund Project of Shandong Academy
of Chinese Medicine (Grant No. 2021SACM-3), Medical and Health
Science and Technology Development Project of Shandong Province
(Grant No. 202102041135) and Key Research and Development Pro-
gram of Shandong Province (Grant No. 2020CXGC010505-04).
Data availability The chloroplast genome sequence of Pinellia peda-
tisecta assembled here is accessible via GenBank with the accession
number of MZ702636 and Global Pharmacopoeia Genome Database
(GPGD,
http:// www. gpgen ome. com/ speci es/ 40314). Raw sequencing data
is available at NCBI SRA database with the accession number of
SRR15328795.
Declarations
Conflict of interest The authors declare that they have no conflict of
interest.
Ethical approval This article does not contain any studies with human
participants performed by any of the authors.
Open Access This article is licensed under a Creative Commons Attri-
bution 4.0 International License, which permits use, sharing, adapta-
tion, distribution and reproduction in any medium or format, as long
as you give appropriate credit to the original author(s) and the source,
provide a link to the Creative Commons licence, and indicate if changes
were made. The images or other third party material in this article are
included in the article's Creative Commons licence, unless indicated
otherwise in a credit line to the material. If material is not included in
the article's Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will
need to obtain permission directly from the copyright holder. To view a
copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7884 Molecular Biology Reports (2022) 49:7873–7885
1 3
References
1. Liu Y, Hui RK, Deng RN, Wang JJ, Wang M, Li ZY (2012)
Abnormal male meiosis explains pollen sterility in the poly-
ploid medicinal plant Pinellia ternata (Araceae). Genet Mol Res
11(1):112–120. https:// doi. org/ 10. 4238/ 2012. Janua ry. 17.1
2. Zhu Y, Zhu G, Guo Q, Zhu Z, Wang C, Liu Z (2013) A compara-
tive proteomic analysis of Pinellia ternata leaves exposed to heat
stress. Int J Mol Sci 14(10):20614–20634. https:// doi. org/ 10. 3390/
ijms1 41020 614
3. Kozlov VA, Zhuravkin IN, Tsyrlova IG (1978) Comparative analy-
sis of the migratory capacity of hematopoietic stem cells and of
immunocompetent precursors after acute hypoxia and the admin-
istration of testosterone. Dokl Akad Nauk SSSR 239(3):747–749
4. Nauheimer L, Metzler D, Renner SS (2012) Global history of the
ancient monocot family Araceae inferred with models accounting
for past continental positions and previous ranges based on fossils.
New Phytol 195(4):938–950. https:// doi. org/ 10. 1111/j. 1469- 8137.
2012. 04220.x
5. Renner SS, Zhang LB (2004) Biogeography of the Pistia
clade (Araceae): based on chloroplast and mitochondrial DNA
sequences and Bayesian divergence time inference. Syst Biol
53(3):422–432. https:// doi. org/ 10. 1080/ 10635 15049 04459 04
6. Renner SS, Zhang LB, Murata J (2004) A chloroplast phylogeny
of Arisaema (Araceae) illustrates tertiary floristic links between
Asia, North America, and East Africa. Am J Bot 91(6):881–888.
https:// doi. org/ 10. 3732/ ajb. 91.6. 881
7. Cabrera LI, Salazar GA, Chase MW, Mayo SJ, Bogner J, Davila
P (2008) Phylogenetic relationships of aroids and duckweeds
(Araceae) inferred from coding and noncoding plastid DNA. Am
J Bot 95(9):1153–1165. https:// doi. org/ 10. 3732/ ajb. 08000 73
8. Boyce PC (1996) The Genus Hapaline (Araceae: Aroideae: Cala-
dieae). Kew Bull 51(1):63–82. https:// doi. org/ 10. 2307/ 41187 45
9. Boyce P, Schott H (1860) Schott, H.W. Prodromus Systematis
Aroidearum (1860).
10. Engler A (1920) Araceae. Pars generalis et Index familiae gener-
alis. In. Edited by Engler A, vol. 74 (IV. 23A). Leipzig, Wilhelm
Engelmann.
11. Hutchinson J (1959) The families of flowering plants, arranged
according to a new system based on their probable phylogeny, 2nd
edn. Internet Archive, London
12. Grayum MH (1990) Evolution and phylogeny of the Araceae. Ann
Missouri Bot Gard 77:628–697. https:// doi. org/ 10. 2307/ 23996 68
13. Bogner J, Nicolson DH (1991) A revised classification of Araceae
with dichotomous keys. Willdenowia 21:35–50
14. Mayo SJ, Bogner J, Boyce PC (1998) Araceae. In: Kubitzki K (ed)
Flowering plants monocotyledons. Springer, Berlin, pp 26–74
15. Cusimano N, Bogner J, Mayo SJ, Boyce PC, Wong SY, Hesse M,
Hetterscheid WL, Keating RC, French JC (2011) Relationships
within the Araceae: comparison of morphological patterns with
molecular phylogenies. Am J Bot 98(4):654–668. https:// doi. org/
10. 3732/ ajb. 10001 58
16. French JC, Chung MG, Hur YK (1995) Chloroplast DNA phylog-
eny of the Ariflorae. In: Rudall PJ, Cribb P, Cutler DF, Humphries
CJ (eds) Monocotyledons: systematics and evolution, vol 1. Royal
Botanic Gardens, Kew, pp 255–275
17. Keating RC (2004) Vegetative anatomical data and its relationship
to a revised classification of the genera of Araceae. Ann Missouri
Bot Gard 91:485–494
18. Fontana ME, Cavalcante CAV (2013) Índices baseados no número
de clientes para localização de itens em armazéns. Production
23(3):561–569. https:// doi. org/ 10. 1590/ s0103- 65132 01300 50000
02
19. Li YW, Zhou FQ, Zhang SP, Li ZY, Kong QY (2008) Studies on
morphology and histology of Pinellia pedatisecta. Zhong Yao Cai
31(2):206–209
20. Guo QS, Wang QY, Shi HZ (2006) Study on morphological char-
acteristics of pollen grains of Pinellia ternata in different popula-
tions. Zhongguo Zhong Yao Za Zhi 31(1):27–30
21. Yin T (2002) Phylogeny and biogeography of Pinellia (Araceae)
with special reference to phylogenetic problems of Remusatia.
Kunming Institute of Botany Chinese Academy of Sciences,
Beijing
22. Henriquez CL, Abdullah AI, Carlsen MM, Zuluaga A, Croat
TB, McKain MR (2020) Evolutionary dynamics of chloro-
plast genomes in subfamily Aroideae (Araceae). Genomics
112(3):2349–2360. https:// doi. org/ 10. 1016/j. ygeno. 2020. 01. 006
23. Henriquez CL, Abdullah AI, Carlsen MM, Zuluaga A, Croat TB,
McKain MR (2020) Molecular evolution of chloroplast genomes
in Monsteroideae (Araceae). Planta 251(3):72. https:// doi. org/ 10.
1007/ s00425- 020- 03365-7
24. Zhai W, Duan X, Zhang R, Guo C, Li L, Xu G, Shan H, Kong H,
Ren Y (2019) Chloroplast genomic data provide new and robust
insights into the phylogeny and evolution of the Ranunculaceae.
Mol Phylogenet Evol 135:12–21. https:// doi. org/ 10. 1016/j. ympev.
2019. 02. 024
25. Abdullah HCL, Mehmood F, Shahzadi I, Ali Z, Waheed MT,
Croat TB, Poczai P, Ahmed I (2020) Comparison of chloroplast
genomes among species of unisexual and bisexual clades of the
monocot family Araceae. Plants (Basel) 9(6):737. https:// doi. org/
10. 3390/ plant s9060 737
26. Abdullah HCL, Mehmood F, Carlsen MM, Islam M, Waheed
MT, Poczai P, Croat TB, Ahmed I (2020) Complete chloro-
plast genomes of Anthurium huixtlense and Pothos scandens
(Pothoideae, Araceae): unique inverted repeat expansion and
contraction affect rate of evolution. J Mol Evol 88(7):562–574.
https:// doi. org/ 10. 1007/ s00239- 020- 09958-w
27. Abdullah HCL, Croat TB, Poczai P, Ahmed I (2020) Mutational
dynamics of aroid chloroplast genomes II. Front Genet 11:610838.
https:// doi. org/ 10. 3389/ fgene. 2020. 610838
28. Abdullah HCL, Mehmood F, Hayat A, Sammad A, Waseem S,
Waheed MT, Matthews PJ, Croat TB, Poczai P etal (2021) Chlo-
roplast genome evolution in the Dracunculus clade (Aroideae,
Araceae). Genomics 113(1 Pt 1):183–192. https:// doi. org/ 10.
1016/j. ygeno. 2020. 12. 016
29. Luo R, Du S, Luo X (2021) The complete plastome sequence
of Pinellia peltata Pei (Araceae). Mitochondrial DNA B Resour
6(8):2112-2113. https:// doi. org/ 10. 1080/ 23802 359. 2021. 19234 11
30. Cai Z, Wang H, Wang G (2020) Complete chloroplast genome
sequence of Pinellia ternata (Thunb.) Breit, a medicinal plants to
China. Mitochondrial DNA B Resour 5(3):2107–2108. https:// doi.
org/ 10. 1080/ 23802 359. 2020. 17652 07
31. Cui N, Liao BS, Liang CL, Li SF, Zhang H, Xu J, Li XW, Chen SL
(2020) Complete chloroplast genome of Salvia plebeia: organiza-
tion, specific barcode and phylogenetic analysis. Chin J Nat Medi-
cines 18(8):563–572. https:// doi. org/ 10. 1016/ S1875- 5364(20)
30068-6
32. Beier S, Thiel T, Munch T, Scholz U, Mascher M (2017) MISA-
web: a web server for microsatellite prediction. Bioinformatics
33(16):2583–2585. https:// doi. org/ 10. 1093/ bioin forma tics/ btx198
33. Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J,
Giegerich R (2001) REPuter: the manifold applications of repeat
analysis on a genomic scale. Nucleic Acids Res 29(22):4633–
4642. https:// doi. org/ 10. 1093/ nar/ 29. 22. 4633
34. Librado P, Rozas J (2009) DnaSP v5: a software for compre-
hensive analysis of DNA polymorphism data. Bioinformatics
25(11):1451–1452. https:// doi. org/ 10. 1093/ bioin forma tics/ btp187
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
7885Molecular Biology Reports (2022) 49:7873–7885
1 3
35. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Stur-
rock S, Buxton S, Cooper A, Markowitz S, Duran C etal (2012)
Geneious Basic: an integrated and extendable desktop software
platform for the organization and analysis of sequence data. Bioin-
formatics 28(12):1647–1649. https:// doi. org/ 10. 1093/ bioin forma
tics/ bts199
36. Yang Z (2007) PAML 4: phylogenetic analysis by maximum like-
lihood. Mol Biol Evol 24(8):1586–1591.
37. McCauley DE, Sundby AK, Bailey MF, Welch ME (2007) Inherit-
ance of chloroplast DNA is not strictly maternal in Silene vulgaris
(Caryophyllaceae): evidence from experimental crosses and natu-
ral populations. Am J Bot 94(8):1333–1337. https:// doi. org/ 10.
3732/ ajb. 94.8. 1333
38. Wu L, Wu M, Cui N, Xiang L, Li Y, Li X, Chen S (2021) Plant
super-barcode: a case study on genome-based identification for
closely related species of Fritillaria. Chin Med 16(1):52. https://
doi. org/ 10. 1186/ s13020- 021- 00460-z
39. Li X, Yang Y, Henry RJ, Rossetto M, Wang Y, Chen S (2015)
Plant DNA barcoding: from gene to genome. Biol Rev Camb
Philos Soc 90(1):157–166. https:// doi. org/ 10. 1111/ brv. 12104
40. Zhang Y, Du L, Liu A, Chen J, Wu L, Hu W, Zhang W, Kim K,
Lee SC, Yang TJ etal (2016) The Complete Chloroplast Genome
Sequences of Five Epimedium Species: Lights into Phylogenetic
and Taxonomic Analyses. Front Plant Sci 7:306. https:// doi. org/
10. 3389/ fpls. 2016. 00306
41. Huang Y, Li X, Yang Z, Yang C, Yang J, Ji Y (2016) Analysis
of Complete Chloroplast Genome Sequences Improves Phyloge-
netic Resolution in Paris (Melanthiaceae). Front Plant Sci 7:1797.
https:// doi. org/ 10. 3389/ fpls. 2016. 01797
42. Meng XX, Xian YF, Xiang L, Zhang D, Shi YH, Wu ML, Dong
GQ, Ip SP, Lin ZX, Wu L etal (2018) Complete Chloroplast
Genomes from Sanguisorba: Identity and Variation Among Four
Species. Molecules 23(9):2137. https:// doi. org/ 10. 3390/ molec
ules2 30921 37
43. Moon BC, Kim WJ, Ji Y, Lee YM, Kang YM, Choi G (2016)
Molecular identification of the traditional herbal medicines, Ari-
saematis Rhizoma and Pinelliae Tuber, and common adulterants
via universal DNA barcode sequences. Genet Mol Res 15(1).
https:// doi. org/ 10. 4238/ gmr. 15017 064
44. Raubeson LA, Peery R, Chumley TW, Dziubek C, Fourcade HM,
Boore JL, Jansen RK (2007) Comparative chloroplast genomics:
analyses including new sequences from the angiosperms Nuphar
advena and Ranunculus macranthus. BMC Genomics 8:174.
https:// doi. org/ 10. 1186/ 1471- 2164-8- 174
45. Daniell H, Lin CS, Yu M, Chang WJ (2016) Chloroplast
genomes: diversity, evolution, and applications in genetic
engineering. Genome Biol 17(1):134. https:// doi. org/ 10. 1186/
s13059- 016- 1004-2
46. Wang Y, Wang S, Liu Y, Yuan Q, Sun J, Guo L (2021) Chloroplast
genome variation and phylogenetic relationships of Atractylodes
species. BMC Genomics 22(1):103. https:// doi. org/ 10. 1186/
s12864- 021- 07394-8
47. Angioi SA, Desiderio F, Rau D, Bitocchi E, Attene G, Papa R
(2009) Development and use of chloroplast microsatellites in Pha-
seolus spp. and other legumes. Plant Biol (Stuttg) 11(4):598–612.
https:// doi. org/ 10. 1111/j. 1438- 8677. 2008. 00143.x
48. Li L (1999) Micromorphological Characteristics and Enzyme
Analysis of Some Chinese Pinellias and Their Taxonomic Sig-
nificance. Acta Botanica Yunnanica 21(4):442–448
49. Xie DF, Yu Y, Deng YQ, Li J, Liu HY, Zhou SD, He XJ (2018)
Comparative Analysis of the Chloroplast Genomes of the Chinese
Endemic Genus Urophysa and Their Contribution to Chloroplast
Phylogeny and Adaptive Evolution. Int J Mol Sci 19(7):1847.
https:// doi. org/ 10. 3390/ ijms1 90718 47
50. Gui L, Jiang S, Xie D, Yu L, Huang Y, Zhang Z, Liu Y (2020)
Analysis of complete chloroplast genomes of Curcuma and
the contribution to phylogeny and adaptive evolution. Gene
732:144355. https:// doi. org/ 10. 1016/j. gene. 2020. 144355
51. Liu ML, Fan WB, Wang N, Dong PB, Zhang TT, Yue M, Li
ZH (2018) Evolutionary Analysis of Plastid Genomes of Seven
Lonicera L. Species: Implications for Sequence Divergence and
Phylogenetic Relationships. Int J Mol Sci 19(12):4039. https:// doi.
org/ 10. 3390/ ijms1 91240 39
52. Chen Y, Zhong H, Zhu Y, Huang Y, Wu S, Liu Z, Lan S, Zhai J
(2020) Plastome structure and adaptive evolution of Calanthe s.l.
species. PeerJ 8:e10051. https:// doi. org/ 10. 7717/ peerj. 10051
53. Gao C, Deng Y, Wang J (2018) The Complete Chloroplast
Genomes of Echinacanthus Species (Acanthaceae): Phylogenetic
Relationships, Adaptive Evolution, and Screening of Molecular
Markers. Front Plant Sci 9:1989. https:// doi. org/ 10. 3389/ fpls.
2018. 01989
54. Petersen G, Darby H, Lam VKY, Pedersen HAE, Merckx V, Zer-
vas A, Seberg O, Graham SW (2019) Mycoheterotrophic Epirix-
anthes (Polygalaceae) has a typical angiosperm mitogenome but
unorthodox plastid genomes. Ann Bot 124(5):791–807. https://
doi. org/ 10. 1093/ aob/ mcz114
55. Carlhoff S, Duli A, Nagele K, Nur M, Skov L, Sumantri I, Okta-
viana AA, Hakim B, Burhan B, Syahdar FA etal (2021) Genome
of a middle Holocene hunter-gatherer from Wallacea. Nature
596(7873):543–547. https:// doi. org/ 10. 1038/ s41586- 021- 03823-6
56. Kinoshita G, Nunome M, Kryukov AP, Kartavtseva IV, Han SH,
Yamada F, Suzuki H (2019) Contrasting phylogeographic histories
between the continent and islands of East Asia: Massive mito-
chondrial introgression and long-term isolation of hares (Lago-
morpha: Lepus). Mol Phylogenet Evol 136:65–75. https:// doi. org/
10. 1016/j. ympev. 2019. 04. 003
Publisher's Note Springer Nature remains neutral with regard to
jurisdictional claims in published maps and institutional affiliations.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
Article
The dried tuber of Pinellia ternata (Thunb.) Breit, Pinelliae Rhizoma (PR, also named 'Banxia' in Chinese), is widely used in traditional medicine. This review aims to provide detail summary of active ingredients, pharmacological effects, toxic ingredients, detoxification strategies, and omic researches, etc. Pharmacological ingredients from PR are mainly classified into six categories: alkaloids, amino acids, polysaccharides, phenylpropanoids, essential oils, and glucocerebrosides. Diversity of chemical composition determines the broad-spectrum efficacy and gives a foundation for the comprehensive utilization of P. ternata germplasm resources. The pharmacological compounds are involved in inhibition of cancer cells by targeting various pathways, including activation of immune system, inhibition of proliferation and cycle, induction of apoptosis, and inhibition of angiogenesis. The pharmacological components of PR act on nervous system by targeting neurotransmitters, activating immune system, decreasing apoptosis, and increasing redox system. Lectins, one major class of the toxic ingredients extracted from raw PR, possess significant toxic effects on human cells. Inflammatory factors, cytochrome P450 proteins (CYP) family enzymes, mammalian target of rapamycin (mTOR) signaling factors, transforming growth factor-β (TGF-β) signaling factors, and nervous system, are considered to be the target sites of lectins. Recently, omic analysis is widely applied in Pinellia genus studies. Plastome genome-based molecular markers are deeply used for identifying and resolving phylogeny of Pinellia genus plants. Various omic works revealed and functional identified a series of environmental stress responsive factors and active component biosynthesis-related genes. Our review summarizes the recent progress in active and toxic ingredient evaluation, pharmacological effects, detoxification strategies, and functional gene identification and accelerates efficient utilization of this traditional herb.
Preprint
Full-text available
Background Subtribe Swertiinae, belonging to Gentianaceae, is one of the most taxonomically difficult representatives. The intergeneric and infrageneric classification and phylogenetic relationships within Subtribe Swertiinae are controversial and unresolved. Methods With the aim of clarifying the circumscription of taxa within the Subtribe Swertiinae, comparative and phylogenetic analyses were conducted using 34 Subtribe Swertiinae chloroplast genomes (4 newly sequenced) representing 9 genera. Results The results showed that 34 chloroplast genomes of Subtribe Swertiinae were smaller and ranged in size from 149,036 to 154,365 bp, each comprising two inverted repeat regions (size range 25,069 − 26,126 bp) that separated large single-copy (80,432 − 84,153 bp) and small single-copy (17,887 − 18,47 bp) regions, and all chloroplast genomes showed similar gene order, content, and structure. These chloroplast genomes contained 129–134 genes each, including 84–89 protein-coding genes, 30 tRNAs, and 4 rRNAs. The chloroplast genomes of Subtribe Swertiinae appeared to lose some genes, such as the rpl33, rpl2 and ycf15 genes. Nineteen hypervariable regions, including trnC-GCA-petN, trnS-GCU-trnR-UCU, ndhC-trnV-UAC, trnC-GCA-petN, psbM-trnD-GUC, trnG-GCC-trnfM-CAU, trnS-GGA-rps4, ndhC-trnV-UAC, accD-psaI, psbH-petB, rpl36-infA, rps15-ycf1, ycf3, petD, ndhF, petL, rpl20, rpl15 and ycf1, were screened, and 36–63 SSRs were identified as potential molecular markers. Positive selection analyses showed that two genes (ccsA and psbB) were proven to have high Ka/Ks ratios, indicating that chloroplast genes may have undergone positive selection in evolutionary history. Phylogenetic analysis showed that 34 Subtribe Swertiinae species formed a monophyletic clade including two evident subbranches, and Swertia was paraphyly with other related genera, which were distributed in different clades. Conclusion These results provide valuable information to elucidate the phylogeny, divergence time and evolution process of Subtribe Swertiinae.
Article
Full-text available
Much remains unknown about the population history of early modern humans in southeast Asia, where the archaeological record is sparse and the tropical climate is inimical to the preservation of ancient human DNA ¹ . So far, only two low-coverage pre-Neolithic human genomes have been sequenced from this region. Both are from mainland Hòabìnhian hunter-gatherer sites: Pha Faen in Laos, dated to 7939–7751 calibrated years before present (yr cal bp; present taken as ad 1950), and Gua Cha in Malaysia (4.4–4.2 kyr cal bp ) ¹ . Here we report, to our knowledge, the first ancient human genome from Wallacea, the oceanic island zone between the Sunda Shelf (comprising mainland southeast Asia and the continental islands of western Indonesia) and Pleistocene Sahul (Australia–New Guinea). We extracted DNA from the petrous bone of a young female hunter-gatherer buried 7.3–7.2 kyr cal bp at the limestone cave of Leang Panninge ² in South Sulawesi, Indonesia. Genetic analyses show that this pre-Neolithic forager, who is associated with the ‘Toalean’ technocomplex 3,4 , shares most genetic drift and morphological similarities with present-day Papuan and Indigenous Australian groups, yet represents a previously unknown divergent human lineage that branched off around the time of the split between these populations approximately 37,000 years ago ⁵ . We also describe Denisovan and deep Asian-related ancestries in the Leang Panninge genome, and infer their large-scale displacement from the region today.
Article
Full-text available
Background Although molecular analysis offers a wide range of options for species identification, a universal methodology for classifying and distinguishing closely related species remains elusive. This study validated the effectiveness of utilizing the entire chloroplast (cp) genome as a super-barcode to help identify and classify closely related species. Methods We here compared 26 complete cp genomes of ten Fritillaria species including 18 new sequences sequenced in this study. Each species had repeats and the cp genomes were used as a whole DNA barcode to test whether they can distinguish Fritillaria species . Results The cp genomes of Fritillaria medicinal plants were conserved in genome structure, gene type, and gene content. Comparison analysis of the Fritillaria cp genomes revealed that the intergenic spacer regions were highly divergent compared with other regions. By constructing the phylogenetic tree by the maximum likelihood and maximum parsimony methods, we found that the entire cp genome showed a high discrimination power for Fritillaria species with individuals of each species in a monophyletic clade. These results indicate that cp genome can be used to effectively differentiate medicinal plants from the genus Fritillaria at the species level. Conclusions This study implies that cp genome can provide distinguishing differences to help identify closely related Fritillaria species, and has the potential to be served as a universal super-barcode for plant identification.
Article
Full-text available
Background Atractylodes DC is the basic original plant of the widely used herbal medicines “Baizhu” and “Cangzhu” and an endemic genus in East Asia. Species within the genus have minor morphological differences, and the universal DNA barcodes cannot clearly distinguish the systemic relationship or identify the species of the genus. In order to solve these question, we sequenced the chloroplast genomes of all species of Atractylodes using high-throughput sequencing. Results The results indicate that the chloroplast genome of Atractylodes has a typical quadripartite structure and ranges from 152,294 bp ( A. carlinoides ) to 153,261 bp ( A. macrocephala ) in size. The genome of all species contains 113 genes, including 79 protein-coding genes, 30 transfer RNA genes and four ribosomal RNA genes. Four hotspots, rpl22 - rps19 - rpl2 , psbM - trnD , trnR - trnT (GGU) , and trnT (UGU) - trnL , and a total of 42–47 simple sequence repeats (SSR) were identified as the most promising potentially variable makers for species delimitation and population genetic studies. Phylogenetic analyses of the whole chloroplast genomes indicate that Atractylodes is a clade within the tribe Cynareae ; Atractylodes species form a monophyly that clearly reflects the relationship within the genus. Conclusions Our study included investigations of the sequences and structural genomic variations, phylogenetics and mutation dynamics of Atractylodes chloroplast genomes and will facilitate future studies in population genetics, taxonomy and species identification.
Article
Full-text available
Chloroplast (cp) genomes are considered important for the study of lineage-specific molecular evolution, population genetics, and phylogenetics. Our aim here was to elucidate the molecular evolution in cp genomes of species in the Dracunculus clade (Aroideae, Araceae). We report de novo assembled cp genomes for eight species from eight genera and also retrieved cp genomes of four species from the National Center for Biotechnology Information (NCBI). The cp genomes varied in size from 162,424 bp to 176,835 bp. Large Single Copy (LSC) region ranged in size from 87,141 bp to 95,475 bp; Small Single Copy (SSC) from 14,338 bp to 23,981 bp; and Inverted Repeats (IRa and IRb) from 25,131 bp to 32,708 bp. The expansion in inverted repeats led to duplication of ycf1 genes in four species. The genera showed high similarity in gene content and yielded 113 unique genes (79 protein coding, 4 rRNA, and 30 tRNA genes). Codon usage, amino acid frequency, RNA editing sites, microsatellites repeats, transition and transversion substitutions, and synonymous and non-synonymous substitutions were also similar across the clade. A previous study reported deletion of ycf1, accD, psbE, trnL-CAA, and trnG-GCC genes in four Amorphophallus species. Our study supports conservative structure of cp genomes in the Dracunculus clade including Amorphophallus species and does not support gene deletion mentioned above. We also report suitable polymorphic loci based on comparative analyses of Dracunculus clade species, which could be useful for phylogenetic inference. Overall, the current study broad our knowledge about the molecular evolution of chloroplast genome in aroids.
Article
Full-text available
The co-occurrence among single nucleotide polymorphisms (SNPs), insertions-deletions (InDels), and oligonucleotide repeats has been reported in prokaryote, eukaryote, and chloroplast genomes. Correlations among SNPs, InDels, and repeats have been investigated in the plant family Araceae previously using pair-wise sequence alignments of the chloroplast genomes of two morphotypes of one species, Colocasia esculenta belonging to subfamily Aroideae (crown group), and four species from the subfamily Lemnoideae, a basal group. The family Araceae is a large family comprising 3,645 species in 144 genera, grouped into eight subfamilies. In the current study, we performed 34 comparisons using 27 species from 7 subfamilies of Araceae to determine correlation coefficients among the mutational events at the family, subfamily, and genus levels. We express strength of the correlations as: negligible or very weak (0.10–0.19), weak (0.20–0.29), moderate (0.30–0.39), strong (0.40–0.69), very strong (0.70–0.99), and perfect (1.00). We observed strong/very strong correlations in most comparisons, whereas a few comparisons showed moderate correlations. The average correlation coefficient was recorded as 0.66 between “SNPs and InDels,” 0.50 between “InDels and repeats,” and 0.42 between “SNPs and repeats.” In qualitative analyses, 95–100% of the repeats at family and sub-family level, while 36–86% of the repeats at genus level comparisons co-occurred with SNPs in the same bins. Our findings show that such correlations among mutational events exist throughout Araceae and support the hypothesis of distribution of oligonucleotide repeats as a proxy for mutational hotspots.
Article
Full-text available
Calanthe s.l. is the most diverse group in the tribe Collabieae (Orchidaceae), which are pantropical in distribution. Illumina sequencing followed by de novo assembly was used in this study, and the plastid genetic information of Calanthe s.l. was used to investigate the adaptive evolution of this taxon. Herein, the complete plastome of five Calanthe s.l. species (Calanthe davidii, Styloglossum lyroglossa, Preptanthe rubens, Cephalantheropsis obcordata, and Phaius tankervilliae) were determined, and the two other published plastome sequences of Calanthe s.l. were added for comparative analyses to examine the evolutionary pattern of the plastome in the alliance. The seven plastomes ranged from 150,181 bp (C. delavayi) to 159,014 bp (C. davidii) in length and were all mapped as circular structures. Except for the three ndh genes (ndhC, ndhF, and ndhK) lost in C. delavayi, the remaining six species contain identical gene orders and numbers (115 gene). Nucleotide diversity was detected across the plastomes, and we screened 14 mutational hotspot regions, including 12 non-coding regions and two gene regions. For the adaptive evolution investigation, three species showed positive selected genes compared with others, C. obcordata (cemA), S. lyroglossa (infA, ycf1 and ycf2) and C. delavayi (nad6 and ndhB). Six genes were under site-specific positive selection in Calanthe s.l., namely, accD, ndhB, ndhD, rpoC2, ycf1, and ycf2, most of which are involved in photosynthesis. These results, including the new plastomes, provide resources for the comparative plastome, breeding, and plastid genetic engineering of orchids and flowering plants.
Article
Full-text available
The subfamily Pothoideae belongs to the ecologically important plant family Araceae. Here, we report the chloroplast genomes of two species of the subfamily Pothoideae: Anthurium huixtlense (size: 163,116 bp) and Pothos scandens (size: 164,719 bp). The chloroplast genome of P. scandens showed unique contraction and expansion of inverted repeats (IRs), thereby increasing the size of the large single-copy region (LSC: 102,956 bp) and decreasing the size of the small single-copy region (SSC: 6779 bp). This led to duplication of many single-copy genes due to transfer to IR regions from the small single-copy (SSC) region, whereas some duplicate genes became single copy due to transfer to large single-copy regions. The rate of evolution of protein-coding genes was affected by the contraction and expansion of IRs; we found higher mutation rates for genes that exist in single-copy regions as compared to those in IRs. We found a 2.3-fold increase of oligonu-cleotide repeats in P. scandens when compared with A. huixtlense, whereas amino acid frequency and codon usage revealed similarities. The ratio of transition to transversion mutations was 2.26 in P. scandens and 2.12 in A. huixtlense. Transversion mutations mostly translated in non-synonymous substitutions. The phylogenetic inference of the limited species showed the monophyly of the Araceae subfamilies. Our study provides insight into the molecular evolution of chloroplast genomes in the subfamily Pothoideae and family Araceae.
Article
Full-text available
The chloroplast genome provides insight into the evolution of plant species. We de novo assembled and annotated chloroplast genomes of four genera representing three subfamilies of Araceae: Lasia spinosa (Lasioideae), Stylochaeton bogneri, Zamioculcas zamiifolia (Zamioculcadoideae), and Orontium aquaticum (Orontioideae), and performed comparative genomics using these chloroplast genomes. The sizes of the chloroplast genomes ranged from 163,770 bp to 169,982 bp. These genomes comprise 113 unique genes, including 79 protein-coding, 4 rRNA, and 30 tRNA genes. Among these genes, 17-18 genes are duplicated in the inverted repeat (IR) regions, comprising 6-7 protein-coding (including trans-splicing gene rps12), 4 rRNA, and 7 tRNA genes. The total number of genes ranged between 130 and 131. The inf A gene was found to be a pseudogene in all four genomes reported here. These genomes exhibited high similarities in codon usage, amino acid frequency, RNA editing sites, and microsatellites. The oligonucleotide repeats and junctions JSB (IRb/SSC) and JSA (SSC/IRa) were highly variable among the genomes. The patterns of IR contraction and expansion were shown to be homoplasious, and therefore unsuitable for phylogenetic analyses. Signatures of positive selection were seen in three genes in S. bogneri, including ycf2, clpP, and rpl36. This study is a valuable addition to the evolutionary history of chloroplast genome structure in Araceae.
Article
Full-text available
Pinellia ternata (Thunb.) Breit is one of the commonly used traditional Chinese medicine with tuber as medicine. We report herein the complete chloroplast genome sequence of Pinellia ternata (Thunb.) Breit. It is length of 167,280 bp, which contained a small single-copy (SSC) region of 23,618 bp and a large single-copy (LSC) region of 92,450 bp, separated by two copies of an inverted repeat (IR) of 25,606 bp. The chloroplast genome contains 113 unique genes, including 79 PCG, 4 rRNA genes, and 30 tRNA genes. In addition, 19 genes contained one or two introns, which of those including 13 PCG genes possess a single intron and 2 PCG genes harbor two introns; and 6 tRNA genes harbor a single intron. In this study, Pinellia ternata is sister to Pinellia pedatisecta and clustered within the group consisting of the species that belong to Araceae.
Article
Salvia plebeia has been in use as traditional Chinese medicine (TCM) for more than 500 years. In this study, the complete chloroplast (cp) genome of S. plebeia was sequenced, assembled and compared to those of other five published Salvia cp genomes. It was found that the cp genome structure of S. plebeia was well conserved and had a total size of 151 062 bp. Four parameters were used to display the usage conditions of the codons of the amino acids in Salvia genus. Although the number of protein-coding genes in each species was the same, the total number of codons was different. Except for amino acids Trp and Met whose Relative Synonymous Codon Usage (RSCU) value of one condon was equal to 1, the remaining 19 amino acids had 1−3 preferred codons. The preferred codon names of each amino acid were coincident. The period size for the tandem repeats of six species ranged from 9 to 410 bp. Salvia cp genomes mainly possessed tandem repeats with a copy number less than or equal to 3. The sequence length of tandem repeats of the six species ranged from 25 to 824 bp. Highly viarable regions including four intergenic spacers and six partial genes were discovered as potential specific barcodes for Salvia species through cp genome-wide comparison. Finally, we performed phylogenetic analyses based on the complete cp genome and coding sequences respectively. These results provide information to help construct the cp genome library for Salvia, which may support studies of phylogenetics, DNA barcoding, population and transplastomics.