ArticlePDF Available

Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: Many species, little divergence

Authors:

Abstract and Figures

Background and aims: Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely relatedDiospyrosspecies. Methods: Twenty-two closely relatedDiospyrosspecies from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key results: The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New CaledonianDiospyrosspecies. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions: In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New CaledonianDiospyrosspecies, but the higher percentage of parsimony-informative characters in nrDNA compared with plastid DNA did not help to resolve the phylogenetic tree because the total number of variable sites was much lower than in the entire plastid genome. The geographical clustering of the individuals against a background of overall low sequence divergence could indicate transfer of plastid genomes due to hybridization and introgression following secondary contact.
Content may be subject to copyright.
Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros
species (Ebenaceae) endemic to New Caledonia: many species, little divergence
Barbara Turner
1,
*, Ovidiu Paun
1
,Je´roˆme Munzinger
2
, Mark W. Chase
3,4
and Rosabelle Samuel
1
1
Department of Botany and Biodiversity Research, Faculty of Life Sciences, University of Vienna, Rennweg 14, 1030
Wien, Austria,
2
IRD, UMR AMAP, TA A51/PS2, 34398 Montpellier Cedex 5, France,
3
Jodrell Laboratory,
Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3DS, UK and
4
School of Plant Biology,
The University of Western Australia, Crawley, WA 6009, Australia
* For correspondence. Present address: Department of Integrative Biology and Biodiversity Research, University of Natural
Resources and Life Sciences, Vienna, Gregor-Mendel-Straße 33, 1180 Wien, Austria. E-mail barbara.turner@univie.ac.at
Received: 23 October 2015 Returned for revision: 10 February 2016 Accepted: 26 February 2016
Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottle-
necks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative
enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA
(nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in
which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data
types to resolve phylogenetic relationships among closely related Diospyros species.
Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plas-
tid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were
inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA
and combined matrices.
Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported
phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold
greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total num-
ber of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear
data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than
following taxonomic species.
Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at
the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction.
Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New
Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared
with plastid DNA did not help to resolve the phylogenetic tree because the total number of variable sites was much
lower than in the entire plastid genome. The geographical clustering of the individuals against a background of
overall low sequence divergence could indicate transfer of plastid genomes due to hybridization and introgression
following secondary contact.
Key words: Diospyros, genome skimming, island floras, New Caledonia, next-generation sequencing, nuclear
ribosomal DNA, rapid radiation, complete plastid genomes.
INTRODUCTION
New Caledonia comprises an archipelago in the southern
Pacific known for its characteristic, rich endemic flora
(Lowry, 1998;Morat et al., 2012). Due to its complex geolo-
gical history, New Caledonia features a mosaic of soil types
(Pelletier, 2006;Maurizot and Vende´-Leclerc, 2009), which, in
combination with its elevational and climatic heterogeneity, re-
sults in many different habitats. One of the genera that has
adapted to a wide range of these habitats is Diospyros
(Ebenaceae).
Diospyros is a large genus of woody dioecious plants found
worldwide in the tropics and subtropics, including 31 species in
New Caledonia. Previous studies based on plastid markers
(Duangjai et al.,2009)showedthatDiospyros colonized New
Caledonia at least four times via long-distance dispersal.
Two of the successful dispersal events each resulted in a single
species that still persists; an additional dispersal event led to a
small clade comprising five species; and yet another event gave
rise to a putatively rapidly radiating group of 24 endemic spe-
cies. These 24 species have been shown to be highly similar
genetically using low-copy nuclear and plastid markers
(Duangjai et al., 2009;Turner et al., 2013a). Most of these
closely related species are morphologically and ecologically
clearly differentiated, and current species delimitations (White,
1993) have been generally confirmed by analyses of amplified
fragment length polymorphisms (AFLPs; Turner et al., 2013b)
and restriction site-associated DNA sequencing (RADseq; Paun
et al.,2016).
V
CThe Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/
by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Annals of Botany Page 1 of 11
doi:10.1093/aob/mcw060, available online at www.aob.oxfordjournals.org
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
On New Caledonia, Diospyros species are found in many
habitats, but they often grow in proximity to each other. At
some localities, several species are microsympatric, which
allows interspecific gene flow if reproductive isolation is still
incomplete. Dating analysis based on four plastid and two low-
copy nuclear DNA regions showed that the ancestors of this
group of New Caledonian Diospyros species arrived in New
Caledonia around 9 million years ago (Turner et al.,2013a).
Given that Diospyros includes long-lived perennial plants, it be-
comes obvious that they have evolved relatively recently.
Resolving the phylogenetic relationships in such a young group
of rapidly radiating and potentially hybridizing taxa poses sig-
nificant challenges (Glor, 2010). We test here the usefulness of
next-generation sequencing (NGS)-based genome skimming to
obtain phylogenetic data, in particular by sequencing whole
plastomes and the full-length nuclear ribosomal DNA region
(i.e. nrDNA).
The plastid genome has proved useful for molecular phylo-
genetic investigations of plants at different taxonomic levels. In
the past two decades, sequences from the plastid genome have
been extensively used to infer phylogenetic relationships among
plants (e.g. Chase et al., 1993;Barfuss et al.,2005;Duangjai
et al., 2009;Russell et al.,2010). Uniparental inheritance, low
mutation rates and high copy number are well-known features
of plastid genomes and the basis for their standard usage in
plant systematics. Due to the slow rate of evolution of the plas-
tid genome, the level of variation is often low compared with
nuclear DNA in general and mitochondrial markers in animals
(Schaal et al.,1998). The high level of conservation has pre-
vented the development of a universally applicable single bar-
coding region in plants (CBOL Plant Working Group, 2009).
Recently, whole-plastid genome sequencing has become afford-
able, and this has been employed to generate more highly
resolved phylogenetic trees (e.g. Ku et al., 2013;Yang et al.,
2013;Barrett et al., 2014;Male´et al., 2014).
Genes coding for nuclear ribosomal RNA are found in the
genome in multiple copies arranged in tandem repeats, and
therefore it is feasible to obtain full-length sequences from low-
coverage NGS approaches. Because of concerted evolution,
these thousands of copies mostly behave as single-copy genes,
and potential evidence of hybridization is normally eliminated
within a few generations (Chase et al., 2003). Each nrDNA re-
peat consists of coding and non-coding elements. In plants, the
four rRNA genes are arranged in two clusters. One of these
clusters comprises three genes (18S, 58S and 26S) separated
by two internal transcribed spacers (ITSs), the external tran-
scribed spacer (ETS) and the non-transcribed spacer (NTS).
The second cluster includes the coding region of one rRNA
(5S) and a spacer between the repeats. The nrDNA genes are
relatively conserved but contain enough variation that they
have been used for phylogenetic reconstructions at higher taxo-
nomic levels (Maia et al.,2014). The non-coding spacers, ITS
and ETS, between the genes are much more variable and repre-
sent a useful source of phylogenetic information among closely
related species (e.g. Devos et al.,2006;Sanz et al.,2008;
Akhani et al.,2013;Nu¨rk et al.,2013;Zhu et al.,2013).
Here we aimed to use whole plastid genomes as well as com-
plete sequences of the nrDNA for phylogenetic analyses for
species of the New Caledonian Diospyros to investigate
whether this approach would produce an improved estimate of
phylogenetic relationships in this putatively rapidly radiating
group.
MATERIALS AND METHODS
Leaf material from New Caledonian Diospyros species was col-
lected on the main island, Grande Terre, and on one smaller is-
land in the south, ^
Ile des Pins (Fig. 1). For the widespread
species, at least two representative individuals were sequenced
(Table 1). In total we included here 36 individuals of New
Caledonian Diospyros species (corresponding to 21 identified
and one unidentified species) as well as one individual of
Diospyros olen (from New Caledonia, but not closely related to
the other New Caledonian Diospyros species) and one individ-
ual of Diospyros ferrea from Thailand. Diospyros olen and D.
ferrea were used as outgroups. Wherever possible, we used the
exact same individuals for which Sanger sequence data are
available from previous studies (Turner et al.,2013a). However,
because of poor DNA quality (unsuitable for NGS) or unavail-
ability for some of the accessions, we had to include in this study
another accession from the same population or locality.
DNA was extracted from silica gel-dried leaf material using
a modified sorbitol/high-salt cetyltrimethylammonium bromide
(CTAB) method (Tel-Zur et al., 1999). Extracts were purified
using the NucleoSpin gDNA Clean-up kit (Marcherey-Nagel,
Germany), according to the manufacturer’s protocol.
From each sample, 300 ng of DNA was sheared (in two
cycles of 45 s with 30 s break between the two shearing runs)
using an ultrasonicator (Bioruptor Pico, Diagenode, Belgium),
targeting a mean fragment size of 400 bp. Library preparation
was performed using the NEBNext Ultra DNA Library Prep
Kit for Illumina (New England Biolabs, USA) according to the
manufacturer’s protocol. All individuals were barcoded and
pooled to reach an equal representation of each individual in
the final libraries. In total, two libraries (containing 14 and 24
samples per library, respectively) were prepared for the 38 sam-
ples used. The two libraries were sequenced on an Illumina
HiSeq as 100-bp paired-end reads at the VBCF (Vienna
Biocenter Core Facilities, Vienna, Austria; http://www.vbcf.
ac.at/facilities/next-generation-sequencing/). Demultiplexing of
the raw data was performed, allowing for a maximum of one
mismatch using the Picard BamIndexDecoder (included in the
Picard Illumina2bam package; available from https://github.
com/wtsi-npg/illumina2bam). The number and quality of raw
reads obtained from each individual were evaluated with
FastQC (Andrews, 2010).
Assembling and annotating plastid genomes
Reads originating from the plastid genome (pt DNA) were
filtered using a multistep and iterative in-house established
pipeline. First, the individual raw files were imported into the
CLC GENOMIC WORKBENCH v. 6.5 (Qiagen) and trimmed
by quality at P<005, retaining reads of at least 30 bp. Then
the reads of D. ferrea were mapped on the complete plastid
genome of Camellia sinensis (Theaceae, Ericales, GenBank:
KC143082.1). For this initial mapping of both coding and non-
coding regions (the latter comprising introns and intergenic spa-
cers), a mismatch cost of 2 and insertion and deletion cost of 3
Page 2 of 11 Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
were used, requiring at least 80 % of a read to be at least 90 %
similar to the target for each successful mapping. With these
settings, 403 980 reads of D. ferrea mapped to the Camellia
plastome. We re-extracted the initial paired-end read data cor-
responding to these reads using FastQ.filter.pl (Rodriguez-R
and Konstantinidis, 2016, available at https://github.com/lmro
driguezr/enveomics/) and have assembled them de novo in the
CLC GENOMIC WORKBENCH, with automatic optimization
of the word and bubble sizes and updating the contigs after
mapping back the reads. We obtained three contigs, which have
been concatenated by aligning them to the C. sinensis reference
sequence manually in the program BioEdit v. 7.2.5 (Hall,
1999). From the coverage information and by comparison with
the C. sinensis reference we were able to identify both inverted
repeats and confirm their presence in the plastid genome of
Diospyros. Both inverted repeats were reconstructed together
and duplicated to represent a complete plastid genome. The
base composition of the assembled contigs was extracted from
the alignments using BioEdit v. 7.2.5 (Hall, 1999).
The plastid genomes of the rest of the Diospyros species
were obtained in a similar way, but the initial mapping step was
performed on the assembled D. ferrea genome. Finally, annota-
tion of coding regions was performed using DOGMA (Wyman
et al.,2004) using only the D. vieillardii BT025 plastid
genome. The circular plastid genome map (Fig. 2) was visual-
ized with OGDRAW (Lohse et al., 2007).
Due to the difficulties with assembling the mitochondrial
genome of plants as well as the low level of phylogenetic infor-
mation in plant mtDNA (Male´et al.,2014), we did not attempt
to assemble the mitochondrial genome of Diospyros.
Assembly of nrDNA repeats
Reads containing sequences from the nrDNA region were
collected and assembled using the program MITObim v. 1.7
(Hahn et al., 2013). As initial seed, we used previously gener-
ated Sanger sequences from the ITS region of Diospyros vieil-
lardii BT025 (B. Turner, unpubl. res.). Characteristics of
assemblies such as the number of reads assembled in each con-
tig and coverage were inspected from the MITObim output
files. Assembled sequences were aligned using Muscle v. 3.8
(Edgar, 2004). The alignment was manually inspected using the
program BioEdit v. 7.2.5 (Hall, 1999). The beginning and end
of each coding region were estimated by comparing the
Diospyros alignment with annotated sequences of Solanum
lycopersicum (GenBank: AY366529, AY552528). We also ex-
tracted the 5S nrDNA region using the procedure described
100km
25
26
24
5
4
3
12
19 18 19
7
11
12
20
23
7
89
10
21
22
13
14
15
16
Grande Terre
Île des Pins
1 Nakutakoin
2 Dumbea
3 Between Plum and Prony
4 Col de Prony
5 Kuébini
6 La Roche Percée
7 Pindai/Népoui
8 Voh
9 Between Koné and Poindimié
10 Conservàtoire botanique de Tiéa
11 Aoupinié
12 Nétéa
13 Mandjelia
14 Koumac
15 Koumac
16 Paagoumène
17 Baie de Gadji
18 Yahoué
19 Forêt de la Superbe (Montagne des Sources)
20 Moindah
21 Plateau de Tiéa
22 Plateau de Tango
23 Népoui
24 Île Kuébini
25 Île des Pins, Kanuméra
26 Île des Pins, Pic N‘ga
02550
FIG. 1. Map of New Caledonia indicating the 26 sampling localities for this study. Numbered dots indicate sampling sites (see also Table 1). Dots are coloured ac-
cording to sampling region (north, green; middle, blue; south, red).
Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia Page 3 of 11
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
above. As initial seed we used the 120-bp coding fragment of
5S nrDNA sequences of Actinidia chinensis (GenBank:
AF394578).
Phylogenetic analyses
For the phylogenetic analyses of the plastid sequence data,
only one copy of the inverted repeat was included in the final
alignment. For analyses of the nrDNA sequences, only the tran-
scribed regions were used.
Parsimony analyses including bootstrapping were performed
using PAUP* v. 4b10 (Swofford, 2003). They were run using a
heuristic search with stepwise addition, random sequence add-
ition (1000 replicates) and tree bisection–reconnection. Gaps
were treated as missing data. To estimate clade support, boot-
strapping with 1000 replicates was performed using the same
settings as above (including 1000 random replicates per boot-
strap replicate). Estimation of consistency index (CI) and reten-
tion index (RI) was done with PAUP. Likelihood analysis
including bootstrapping was performed using RAxML v. 8.1.3
(Stamatakis, 2014). We used the Broyden–Fletcher–Goldfarb–
Shanno (BFGS) method to optimize generalized time reversible
(GTR) rate parameters, the gamma model of rate heterogeneity
and 1000 rapid bootstrap inferences with a subsequent thorough
maximum likelihood (ML) search. The results were visualized
with FigTree 14 (available from http://tree.bio.ed.ac.uk/soft
ware/figtree/). We rooted the trees obtained with D. olen ac-
cording to earlier results (Turner et al.,2013a).
To reduce the file size and speed up analyses in the com-
bined analyses, D. olen and constant positions were removed
using Mesquite v. 3.01 (Maddison and Maddison, 2014). In
trees conducted from the combined data set D. ferrea was used
as outgroup. Parsimony and likelihood analyses were per-
formed as described for the individual data sets. In addition, we
conducted a Bayesian analysis for the combined data set using
BEAST v. 1.8.1 (Drummond et al.,2012). The best evolution-
ary models for the two subsets (ptDNA and nrDNA) were eval-
uated using jModeltest v. 2.1.6 (Guindon and Gascuel, 2003;
Darriba et al., 2012). For the plastid partition, the transversional
TABLE 1. Table of accessions including all individuals used in this study. The identification numbers of sampling localities are given in
Fig. 1. Voucher codes: JMXXXX: collection number J. Munzinger; Tree No. XXXXX: Tree of New Caledonian Plant Inventory and
Permanent Plot Network (NC-PIPPN, Ibanez et al., 2014); KUFF, Herbarium of the Faculty of Forestry Kasertsat University
Bangkok; MPU, Herbarium of the University of Montpellier; NOU, Herbarium of IRD Noume´ a; P, Herbarium of the Natural History
Museum Paris; WU, Herbarium of the University of Vienna
Taxon Accession number Sampling location Voucher
D. calciphila F.White BT313 25 MPU026746
D. cherrieri F.White BT293 23 NOU079547
D. erudita F.White BT280 21 WU062858
D. ferrea (Wild.) Bakh. Eb045 Duangjai 106 (KUFF)
D. flavocarpa (Vieill. ex P.Parm.) F.White BT130 9 MPU026741
BT156 11 MPU026737
D. glans F.White BT093 5 Turner et al. 093 (MPU)
D. impolita F.White BT102 6 NOU019538
D. inexplorata F.White BT308 24 NOU005818
D. labillardierei F.White BT122 9 NOU052188
D. minimifolia F.White BT135 10 NOU019556
BT233 17 NOU019554
BT263 20 NOU079549, WU062872
D. olen Hiern BT001 1 NOU052191
D. pancheri Kosterm. BT028 3 MPU026742
BT031 3 MPU026742
D. parviflora (Schltr.) Bakh. BT041 4 Turner et al. s.n. (NOU)
BT090 5 NOU2519
BT147 10 NOU052175
BT187 13 NOU031409
BT250 19 Tree no. 23109
BT290 22 NOU079550
D. perplexa F.White BT004 1 MPU026738
D. pustulata F.White BT111 7 NOU019572
BT140 10 NOU052177
BT261 20 NOU079544
D. revolutissima F.White BT120 8 NOU023189
BT221 16 NOU084762
D. tridentata F.White BT207 14 NOU052179
D. trisulca F.White BT185 13 NOU031344
D. umbrosa F.White BT176 12 JM6635 (NOU)
BT247 19 NOU023234
D. veillonii F.White BT227 17 NOU019582
D. vieillardii (Hiern) Kosterm. BT025 2 Turner et al. s.n. (NOU)
BT100 5 NOU006676
BT215 15 NOU023242
D. yahouensis (Schltr.) Kosterm. BT238 18 P00057340
D. sp. Pic N’ga BT318 26 NOU054315
Page 4 of 11 Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
model with equal frequencies (TVMef; Posada, 2003) showed
the best fit to the data, and for the nrDNA partition Tamura and
Nei’s model (TrN; Tamura and Nei, 1993) with among-site rate
variation modelled with a gamma distribution (TrNþC)showed
the best fit. Base frequencies (uniform), substitution rates
among bases (gamma shape 10) and alpha (gamma shape 10)
were inferred by jModeltest for each data set. For flexibility,
we used a relaxed uncorrelated log-normal clock model
(Drummond et al., 2006). As a speciation model, we used a
Yule model because the investigated group is so young that we
expected a low proportion of lineage extinction (Yule, 1925;
Gernhard, 2008). For further details regarding the parameters
used see Supplementary Data Fig. S1. Two independent
Metropolis-coupled Markov chain Monte Carlo (MCMC)
rpl2
rpl23
trnI (CAU)
ycf2
ycf15
trnL (CAA)
ndhB
rps7
rps12_3end
trnV (GAC)
rrn16
trnI (GAU)
ycf68
trnA (UGC)
orf42
orf56
rrn23
rrn4.5
rrn5
trnR (ACG)
trnN (GUU)
ycf1
rps15
ndhH
ndhA
I
h
d
n
G
h
dn
ndhE
psaC
ndhD
ccsA
trnL (UAG)
rpl32
ndhF
ycf1
trnN (GUU)
trnR (ACG)
rrn5
rrn4.5
rrn23
trnA (UGC)
orf56
orf42
trnI (GAU)
ycf68
rrn16
trnV (GAC)
rps12_3end
rps7
ndhB
trnL (CAA)
ycf15
ycf2
trnH (CAU)
rpl23
rpl2
rps19
rpl22
rps3
rpl16
rpl14
rps8
infA
rpl36
rps11
rpoA
petD
petB
psbH
psbN
psbT
psbB
clpP
rps12
rpl20
rps18
rpl33
psaJ
trnP (UGG)
trnW (CCA)
petG
petL
psbE
psbF
psbL
psbJ
petA
cemA
ycf4
psaI
accD
rbcL
atpB
atpE
trnM (CAU)
trnV (UAC)
ndhC
ndhK
ndhJ
trnF (GAA)
trnL (UAA)
trnT (UGU)
rps4
trnS (GGA)
ycf3
psaA
psaB
rps14
tRNA-fM (CAU)
trnG (GCC)
psbZ
trnS (UGA)
psbC
ps
bD
trnT (GGU)
trnE (UUC)
trnY (GUA)
trnD (GUC)
psbM
petN
trnC (GCA)
rpoB
rpoC1
rpoC2
rps2
atpI
atpH
atpF
atpA
trnR (UCU)
trnG (UCC)
trnS (GCU)
psbI
psbK
trnQ (UUG)
rps16
matK
psbA
trnH (GUG)
Diospyros vieillardii
plastid genome
157,535 bp
Introns
Ribosomal RNAs
Transfer RNAs
ORFs
H
y
pothetical chloroplast readin
g
frames (
y
cf)
Other genes
clpP, matK
Ribosomal proteins (LSU)
Ribosomal proteins (SSU)
RNA polymerase
RubisCO large subunit
NADH dehydrogenase
ATP s
y
nthase
Cytochrome b/f complex
Photosystem II
Photosystem I
I
R
A
I
R
B
S
S
C
L
S
C
FIG. 2. Graphic representation of the annotated plastid genome of Diospyros vieillardii.
Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia Page 5 of 11
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
analyses, each with 20 million generations, were run, sampling
each 1000th generation. The initial 10 % of trees obtained from
each MCMC run were removed as burn-in; the remaining trees
of both runs were used to calculate the maximum clade cred-
ibility tree.
For directly comparing the results of the present study with
previous ones (Turner et al., 2013a) based on four plastid
markers (atpB,rbcL,trnK-matK,trnS-trnG; 5979 bp) and two
low-copy nuclear markers (ncpGS, 717 bp; PHYA, 1189 bp), a
subset of those data corresponding to the individuals included
in this study was analysed in the same way as described above
for the Bayesian analysis. In most cases, to represent each spe-
cies we used either the same accession or individuals from the
same population. In the few cases where no data from individ-
uals of the same population were available, then the geograph-
ically closest individual assigned to the same species was used.
Due to these differences, results of the previous studies are not
strictly comparable to those produced here, but we have com-
pared them nonetheless due to their overall similarity.
A general pattern of geographical clustering was visually
observed in the resulting trees, in particular in the plastid data.
Based on the geographical coordinates of samples and the dis-
tance matrix of pairwise uncorrected P-values (calculated with
SplitsTree), we tested the significance of geographical cluster-
ing of the samples in the trees using a Mantel test. We estimated
the correlation of geographical and genetic discontinuities in
the data by performing this analysis on the plastid and the
combined data sets with Isolation by Distance web service
(IBDWS) (Jensen et al., 2005).
To investigate the relationships between populations we con-
structed a neighbour network based on the plastid markers,
using SplitsTree4 v. 4.13.1 (Hudson and Bryant, 2006) and un-
corrected Pdistance estimates. Based on the results from the
Mantel test [Table 2 andRADseqresults(Paun et al.,2016)],
we excluded samples of D. olen,D. ferrea,D. vieillardii,
D. umbrosa,D. flavocarpa,D. cherrieri and D. veillonii from
this analysis to get a clearer picture of the relationships within
the recently and rapidly radiated species group.
RESULTS
After the demultiplexing step, the number of raw Illumina se-
quences ranged from 10 to 29 million paired-end reads per indi-
vidual. Details of sample characteristics are given in Table 3.
Plastid genomes
We obtained between 75 262 (47average coverage, for
D. parviflora BT250) and 857 039 (531coverage, for D. sp.
Pic N’ga BT318) pairs of reads per individual that mapped to
the plastid genome (for details see Table 3). The GC content of
the plastid genomes of Diospyros (37 %) is similar to those of
many other angiosperms [average 37 %; e.g. Ardisia (Ku
et al., 2013); Camellia (Yang et al.,2013); Potentilla (Ferrarini
et al.,2013); Musa (Martin et al., 2013); Zingiberales (Barrett
et al.,2014)].
Thesize(157 kb) and gene order of the plastid genome of
D. vieillardii (Fig. 2) is similar to that of C. sinensis (GenBank:
KC143082.1). This plastid genome is the first fully sequenced
plastid genome of Ebenaceae reported in the literature.
The plastid matrix of Diospyros used for phylogenetic ana-
lyses (including only one of the inverted repeats) included
133 210 characters, of which 1295 variable positions were
parsimony-uninformative and 384 (03 %) were potentially par-
simony-informative. Phylogenetic reconstruction using parsi-
mony produced 1127 equally parsimonious trees (results not
shown) with a CI of 093 and RI of 087. Phylogenetic relation-
ships between the species were in many cases not well sup-
ported, and in several cases individuals of the same species
failed to form well-supported clusters. Similar results were
obtained using maximum likelihood (Supplementary Data
Fig. S2).
Nuclear ribosomal DNA
Between 008 % (D. impolita BT102) and 051 % (D. ferrea
Eb045) of the total reads pertained to the nrDNA region (for de-
tails see Table 3). The contigs obtained had average coverages
ranging from 209(for D. impolita BT102) to 1159(for
D. vieillardii BT025). The NTS of the intergenic spacer was
difficult to align and therefore excluded from further analyses.
The aligned nrDNA matrix of Diospyros included 7233 charac-
ters, of which 368 variable positions were parsimony-unin-
formative and 141 (19 %) were potentially parsimony-
informative. The parsimony phylogenetic reconstruction with
only the nrDNA sequences produced 84 equally parsimonious
trees (results not shown) with a CI of 066 and RI of 053.
Phylogenetic relationships generally disagreed with results ob-
tained from other markers, but these incongruent relationships
are all weakly supported (results not shown). Several species
TABLE 2. The extent of geographical clustering in the data, as evidenced with Mantel tests performed on IBDWS
Dataset Mantel’s rSignificance
Plastid data
Including all individuals, except the outgroups D. olen and D. ferrea 0103 P¼009
Excluding D. olen,D. ferrea and all D. vieillardii 0242 P<0001
Excluding D. olen,D. ferrea,D. vieillardii,D. umbrosa and D. flavocarpa 0383 P<0001
Excluding D. olen,D. ferrea,D. vieillardii,D. umbrosa,D. flavocarpa,D. cherrieri and D. veillonii 0427 P<0001
Combined
Including all individuals except the outgroups D. olen and D. ferrea 0079 P¼010
Excluding D. olen,D. ferrea and all D. vieillardii 0121 P<005
Excluding D. olen,D. ferrea,D. vieillardii,D. umbrosa and D. flavocarpa 0302 P<0001
Excluding D. olen,D. ferrea,D. vieillardii,D. umbrosa,D. flavocarpa,D. cherrieri and D. veillonii 0374 P<0001
Page 6 of 11 Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
failed to form unique groups. Similar results were obtained
using maximum likelihood (Supplementary Data Fig. S3).
The assembled 5S nrDNA region containing coding and non-
coding parts was short (less than 500 bp) and therefore did not
contain many informative characters. Phylogenetic trees based
on this fragment were poorly resolved, and therefore the trees
are not presented or discussed further. The 5S nrDNA region
was also not included in the combined analysis.
Analyses of the combined data set
In addition to the individual analyses of the two data sets, we
also combined them to determine whether this approach pro-
vides better resolution/support.
The combined matrix (i.e. reduced to variable positions as
explained in Materials and methods) included 1136 characters,
of which 437 were potentially parsimony-informative.
Phylogenetic reconstruction using parsimony produced a single
most parsimonious tree of 1580 steps (Supplementary Data Fig.
S4) with a CI of 073 and RI of 052. Phylogenetic relationships
depicted in the combined analysis were better resolved than in
the trees obtained from the individual analyses. A comparable
pattern was found in the Bayesian analysis of the combined
data set (Fig. 3). All three individuals of D. vieillardii
clustered together and were sister to the rest of the New
Caledonian endemic species group. Individuals from
D. flavocarpa and D. umbrosa formed unique groups and were
sister to the rest of the remaining accessions, among which
D. cherrieri and D. veillonii were sister to the rest. The results
from the combined analysis were also in agreement with earlier
results based on plastid and nuclear markers (Turner et al.,
2013a).
There is a trend of geographical clustering visible in the
Bayesian tree (Fig. 3) and in the neighbour network
(Supplementary Data Fig. S5). The neighbour network (Fig.
S5) clearly shows that individuals and populations from the
south and the middle of New Caledonia clustered according to
their sampling region. The Mantel test (Table 2) confirmed
a significant geographical clustering of the genetic informa-
tion (Fig. 3), in particular in the plastid data across the crown
group.
TABLE 3. Details of samples used here for sequencing of whole plastid genomes and nrDNA
Raw reads Plastid genome Nuclear ribosomal DNA
Percentage of
reads mapping
Coverage () GC (%) Percentage of
reads mapping
Coverage () GC (%)
D. calciphila BT313 19549968 134 162 3737 017 444 5924
D. cherrieri BT293 15164658 330 309 3736 011 229 5852
D. erudita BT280 12612118 303 236 3736 015 246 5936
D. ferrea Eb045 9943646 481 300 3734 051 559 5846
D. flavocarpa BT130 10129514 099 62 3736 029 331 5707
D. flavocarpa BT156 15656792 053 51 3737 010 219 5853
D. glans BT093 14436316 091 81 3736 019 386 5848
D. impolita BT102 18636398 114 131 3736 008 209 5926
D. inexplorata BT308 16201056 106 106 3736 024 511 5965
D. labillardierei BT122 26574012 095 156 3736 015 538 586
D. minimifolia BT135 26685314 119 199 3736 009 314 5882
D. minimifolia BT233 25526086 084 134 3735 013 384 5824
D. minimifolia BT263 16154630 195 198 3735 029 616 5914
D. olen BT001 24966688 151 236 3744 038 1084 5832
D. pancheri BT028 27590124 080 136 3735 009 316 5938
D. pancheri BT031 29453086 071 130 3736 009 361 5898
D. parviflora BT041 27178316 083 140 3736 009 356 5854
D. parviflora BT090 25432978 040 63 3737 019 656 5934
D. parviflora BT147 17887304 065 71 3736 028 690 5869
D. parviflora BT187 26648984 045 74 3736 021 690 566
D. parviflora BT250 17588828 043 47 3736 037 877 5902
D. parviflora BT290 19187356 039 47 3736 028 737 5908
D. perplexa BT004 18085506 074 82 3736 030 595 5743
D. pustulata BT111 15958418 246 242 3736 028 467 5769
D. pustulata BT140 17029506 180 188 3736 026 580 5952
D. pustulata BT261 15585090 119 114 3736 037 745 5994
D. revolutissima BT221 20867576 150 193 3737 020 551 5798
D. revolutissima BT120 17111770 146 154 3735 021 466 5916
D. tridentata BT207 14190292 089 77 3735 021 356 5779
D. trisulca BT185 17297816 075 80 3736 029 710 5867
D. umbrosa BT176 18856642 085 98 3738 048 632 5592
D. umbrosa BT247 28845756 059 104 3736 010 385 5919
D. veillonii BT227 26594158 180 297 3736 030 1056 5865
D. vieillardii BT025 26595776 091 152 3734 038 1160 5829
D. vieillardii BT100 22649344 210 297 3733 011 327 5852
D. vieillardii BT215 27710030 080 138 3732 014 501 5894
D. yahouensis BT238 29242716 080 144 3736 015 578 5968
D. sp. Pic N’ga BT318 18515350 463 531 3736 021 500 5937
Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia Page 7 of 11
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
0·002
D. parviflora BT187
D. perplexa BT004
D. pancheri BT028
D. pustulata BT140
D. calciphila BT313
D. parviflora BT147
D. tridentata BT207
D. ferrea Eb045
D. umbrosa BT176
D. trisulca BT185
D. parviflora BT250
D. glans BT093
D. sp. Pic N‘
g
a BT318
D. vieillardii BT100
D. flavocarpa BT156
D. inexplorata BT308
D. vieillardii BT215
D. minimifolia BT233
D. parviflora BT041
D. labillardierei BT122
D. flavocarpa BT130
D. revolutissima BT120
D. pustulata BT111
D. revolutissima BT221
D. pustulata BT261
D. parviflora BT090
D. vieillardii BT025
D. erudita BT280
D. yahouensis BT238
D. umbrosa BT247
D. cherrieri BT293
D. veillonii BT227
D. parviflora BT290
D. minimifolia BT135
D. impolita BT102
D. minimifolia BT263
D. pancheri BT031
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
0·98
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
1·00
0·99
D. parviflora BT147
D. veillonii BT229
D. yahouensis BT238
D. pustulata BT137
D. impolita BT102
D. calciphila BT314
D. glans BT093
D. parviflora M2037
D. perplexa BT004
D. minimifolia BT231
D. parviflora M2338
D. minimifolia BT264
D. vieillardii BT213
D. minimifolia BT133
D. sp. Pic N‘ga BT318
D. flavocarpa BT126
D. erudita BT287
D. umbrosa M2265
D. revolutissima BT116
D. ferrea Eb045
D. parviflora M2071
D. inexplorata BT311
D. vieillardii BT100
D. parviflora BT187
D. pancheri BT028
D. labillardierei BT122
D. cherrieri BT297
D. vieillardii BT025
D. pustulata BT259
D. umbrosa BT247
D. pancheri BT031
D. revolutissima BT219
D. trisulca BT185
D. pustulata BT113
D. parviflora BT040
D. flavocarpa BT156
D. tridentata BT203
0·98
1·00
1·00
1·00
1·00
Whole plastid genome and nrDNA
B
Four plastid and two low copy nuclear markers
A
FIG. 3. Phylogenetic trees resulting from the Bayesian analysis of the (A) combined data set of the present study (variable nucleotide positions in the whole plastid genome and nrDNA) and (B) combined data
set of a previous study including four plastid and two low-copy nuclear markers (Turner et al.,2013a). The numbers indicate posterior probabilities >90. Trees are scaled to the same branch length. Samples are
coloured according to sampling region (see Fig. 1).
Page 8 of 11 Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
DISCUSSION
Previous standard approaches to phylogenetic analysis of
Diospyros species including samples from New Caledonia used
nine (Duangjai et al.,2009) and four (Turner et al., 2013a)plas-
tid markers (alignment length >8and64 kb, respectively).
They demonstrated low levels of sequence divergence among
these species, indicating a fairly recent and rapid radiation.
Inclusion of low-copy nuclear markers, which have been shown
to be highly informative and useful for resolving phylogenetic
relationships at lower taxonomic levels in some taxa [e.g.
Paeonia (Tank and Sang, 2001)andPassiflora (Yockteng and
Nadot, 2004)], only partly improved resolution among the New
Caledonian Diospyros species (Turner et al.,2013a). Similar
results have been observed in two genera of Cunoniaceae from
New Caledonia [Spiraeanthemum (Pillon et al.,2009a)and
Codia (Pillon et al.,2009b)].
AFLP markers are typically used at low taxonomic level in
closely related species and population analyses (e.g. Despre´
et al.,2003;Tremetsberger et al.,2006;Gaudeul et al., 2012).
In the case of the New Caledonian Diospyros species, we used
AFLPs to evaluate species limits (Turner et al., 2013b), and in
most cases we found congruence with the species concepts of
White (1993). However, the AFLP approach was not useful in
resolving phylogenetic relationships among these species
(Turner et al., 2013b) because there were few of the individual
AFLP fragments shared by two or more species. It would ap-
pear from the AFLP results that fragmentation of an original
widespread population occurred in many regions of New
Caledonia more or less simultaneously, resulting in genetically
distinct populations that correspond to the morphologically
based species delimitations of White (1993) without leaving
much evidence for interspecific relationships.
The phylogenetic tree obtained here based on the whole plas-
tid genome (Fig. S2) is similar to the phylogenetic tree previ-
ously based on four plastid markers (Turner et al., 2013a). The
combined tree of this study (Fig. 3A)is similar both in reso-
lution and structure to the phylogenetic tree based on four plas-
tid and two low-copy nuclear markers (Fig. 3B). Although not
always represented by the same individuals, general relation-
ships of species are in agreement.
The nuclear ribosomal region included a higher percentage
of parsimony-informative characters (19versus03%)than
the plastid DNA, which is in agreement with findings in other
plant groups (Hamby and Zimmer, 1992;Doyle, 1993;Male´
et al., 2014). Despite this variability, nrDNA was still too short
to contain enough variation (141 versus 384 informative sites)
and failed to resolve phylogenetic relationships among the New
Caledonian Diospyros species (Fig. S3).
Our results clearly show that for this group of species, in
which standard plastid and nuclear markers were not helpful for
resolving the phylogenetic relationships, using the whole plas-
tid genome does not greatly increase resolution and support.
There are only a few other studies available in which whole
plastid genomes have been used to resolve phylogenetic rela-
tionships at the intraspecific level among closely related spe-
cies. Some studies [e.g. in Chrysobalanaceae (Male´et al.,
2014) and eucalypts (Myrtaceae) (Bayly et al.,2013)] have re-
vealed that this approach can be useful for resolving phylogen-
etic relationships among genera, but they failed to resolve
phylogenetic relationships among closely related species and to
group together individuals of the same taxonomic species. In
cases of recently radiating species groups, in particular follow-
ing an extreme bottleneck associated with a long-distance dis-
persal event such as the arrival of Diospyros in New Caledonia,
plastid genomes appear to be insufficient for inference of
phylogenetic relationships. The basis of the rapid radiation of
Diospyros in New Caledonia is not yet clear, but it has been
speculated that it is an adaptive origin associated with different
soil types (Paun et al., 2016), as recently shown for the genus
Geissois in Cunoniaceae (Pillon et al., 2014).
The individuals of D. vieillardii,D. umbrosa,D. flavocarpa,
D. cherrieri and D. veillonii form a minimally isolated group in
the combined analyses (Fig. 3). These species form clusters that
are successively sister to the rest of the taxa, which are well
supported collectively but form a highly unresolved central
cluster. This unresolved central cluster is less than 6 million
years old (Paun et al.,2016) and could be the result of two lin-
eages that developed in isolation and then subsequently colon-
ized some of the same habitats. This too could be the result of
simultaneous parallel divergence in different parts of New
Caledonia combined with effects of local and more recent hy-
bridization (as was indicated in the AFLP study; Turner et al.,
2013b). Retention of ancient polymorphisms present in the ori-
ginal colonizing population, which probably also underwent a
severe bottleneck, could not produce such a geographically
structured pattern.
Comparisons of the phylogenetic tree based on plastid se-
quences (Fig. S2) with the tree derived from RAD data [Paun
et al., 2016 (Supplementary Data Fig. S6)] showed several clus-
ters of individuals (D. trisulca and D. parviflora from L13, D.
pustulata and D. minimifolia from L20, D. pancheri and D. par-
viflora from L3 and 4; Table 1,Fig. 1) that occur with high stat-
istical support in the plastid results, but are not present in the
nuclear tree. These clusters consist of individuals found in the
same or very nearby locations, which could indicate introgres-
sive hybridization and transfer of plastid genomes as a relevant
phenomenon (Naciri and Linder, 2015, and references therein).
Geographical rather than taxonomic clustering was observed
for all populations of D. minimifolia and D. parviflora that also
failed to form unique groups in nuclear results [AFLP (Turner
et al., 2013b)RAD(Paun et al.,2016)]. Phenomena like intro-
gressive hybridization and transfer of plastid genomes could
also explain the geographical clustering of individuals observed
in the plastid data set (Figs S2 and S5;Table 2), whereas in nu-
clear-based data sets [AFLP (Turner et al., 2013b), RAD (Paun
et al., 2016)] no such geographical clustering was observed.
Similar geographical clustering for plastid results has previ-
ously been reported in other plant groups [e.g. Nothofagus
(Acosta and Premoli, 2010)].
Although there are more than 140 kb of DNA sequence
included in this study, it effectively corresponds to only two
markers (plastid genome and rDNA region). As not all genes
evolve in the same mode and at the same tempo, phylogenies
based on different genes might show different phylogenetic re-
lationships (Heled and Drummond, 2010). It is therefore im-
portant to use many phylogenetic markers for reconstruction of
relationships to overcome the limitations of individual genes
and produce results as close as possible to the real species tree.
Phylogenetic trees based on plastid data should always be
Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia Page 9 of 11
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
viewed as gene trees because of the typical evolutionary path-
ways of these organelles (Naciri and Linder, 2015). The trees
presented here may hence show the evolutionary history of the
particular region investigated, and may differ from the species
trees. In the case of the New Caledonian Diospyros species, we
consider the SNP data derived from RAD (Paun et al.,2016)as
the best approximation of the species tree, because it involves
nearly 8500 independent markers from the nuclear genome.
CONCLUSIONS
Although New Caledonian Diospyros are morphologically and
ecologically diverse, they show little genetic divergence. For
these rapidly radiating Diospyros species, in which standard
plastid and nuclear markers were not helpful for resolving the
phylogenetic relationships, using the whole plastid genome
does not greatly increase resolution and support. Plastid
markers grouped accessions according to geographical proven-
ance, which could result from local transfer of plastid genomes
due to hybridization and introgression following secondary
contact.
We are now conducting additional nuclear genome studies
(both coding and non-coding regions) to determine whether
other approaches could help us determine the potential adaptive
nature of this radiation, which has thus far defeated our at-
tempts using standard and next-generation methods.
SUPPLEMENTARY DATA
Supplementary data are available online at www.aob.oxfordjour
nals.org and consist of the following. Figure S1: BEAST input
file for the Bayesian analysis of the combined data set including
all settings and priors of both data sets. Figure S2: maximum
likelihood tree based on plastid sequences. Figure S3: max-
imum likelihood tree based on nrDNA sequences. Figure S4:
single most parsimonious tree based on the combined data ma-
trix. Figure S5: neighbour network based on the plastid data set.
Individuals are coloured according to their sampling region.
Figure S6: maximum likelihood tree based on RAD SNP data.
ACKNOWLEDGEMENTS
We thank Emiliano Trucchi for his help and ideas concerning
data analysis and IRD’s Noume´a team for assistance with field
work, especially Ce´line Chambrey. Voucher specimens are
deposited in the herbaria of Noumea (NOU), University of
Montpellier (MPU), University of Vienna (WU) and the
Faculty of Forestry of the Kasertsat University Bangkok
(KUFF). This work was supported by the Austrian Science
Fund (P 22159-B16 to R. S.).
LITERATURE CITED
Acosta MC, Premoli AC. 2010. Evidence of chloroplast capture in South
American Nothofagus (subgenus Nothofagus, Notofagaceae). Molecular
Phylogenetics and Evolution 54: 235–242.
Akhani H, Malekhammadi M, Mahdavi P, Gharibiyan A, Chase MW. 2013.
Phylogenetics of the Irano-Turanian taxa of Limonium (Plumbaginaceae)
based on ITS nrDNA sequences and leaf anatomy provides evidence for
species delimitation and relationships of lineages. Botanical Journal of the
Linnean S ociety 171: 519–550.
Andrews S. 2010. FastQC: a quality control tool for high throughpu t sequence
data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
Barfuss MHJ, Samuel R, Till W, Stuessy TF. 2005. Phylogenetic relationships
in subfamily Tillandsioideae (Bromeliaceae) based on DNA sequence data
from seven plastid regions. American Journal of Botany 92: 337–351.
Barrett CF, Specht CD, Leebens-Mack J, Stevenson DW, Zomlefer WB,
David JI. 2014. Resolving ancient radiations: can complete plastid gene
sets elucidate deep relationships among the tropical gingers (Zingiberales)?
Annals of Botany 113: 119–133.
Bayly MJ, Rigault P, Spokevicius A, et al.2013.Chloroplast genome analysis
of Australian eucalypts – Eucalyptus, Corymbia, Angophora, Allosyncarpia
and Stockwellia (Myrtaceae). Molecular Phylogenetics and Evolution 69:
704–716.
CBOL Plant Working Group. 2009. A DNA barcode for land plants.
Proceedings of the National Academy of Sciences of the USA 106:
12794–12797.
Chase MW, Soltis DE, Olmstead RG, et al. 1993. Phylogenetics of seed plants:
an analysis of nucleotide sequences from the plastid gene rbcL.Annals of
the Missouri Botanical Garden 80: 528–548, 550–580.
Chase MW, Knapp S, Cox AV, et al.2003.Molecular systematics, GISH and
the origin of hybrid taxa in Nicotiana (Solanaceae). Annals of Botany 92:
107–127.
Darriba D, Taboada GL, Doallo R, Posada D. 2012. jModelTest 2: more mod-
els, new heuristics and parallel computing. Nature Methods 9: 772.
Despre´ L, Giells L, Redoutet B, Taberlet P. 2003. Using AFLP to resolve
phylogenetic relationships in a morphologically diversified plant species
complex when nuclear and chloroplast sequences fail to reveal variability.
Molecular Phylogenetics and Evolution 27: 185–196.
Devos N, Raspe´O,JacquemartA-L,TytecaD.2006.On the monophyly of
Dactylorhiza Necker ex Nevski (Orchidaceae): is Coleoglossum viride (L.)
Hartman a Dactylorhiza?Botanical Journal of the Linnean Society 152:
261–269.
Doyle JJ. 1993. DNA, phylogeny, and the flowering of plant systematics.
BioScien ce 43: 380–389.
Drummond AJ, Ho SYW, Phillips MJ, Rambaut A. 2006. Relaxed phylogen-
etics and dating with confidence. PLoS Biology 4: 699–710.
Drummond AJ, Suchard MA, Xie D, Rambaut A. 2012. Bayesian phylogen-
etics with BEAUti and the BEAST 1.7. Molecular Biology and Evolution
29: 1969–1973.
Duangjai S, Samuel R, Munzinger J, et al. 2009. A multi-locus plastid phylo-
genetic analysis of the pantropical genus Diospyros (Ebenaceae), with an
emphasis on the radiation and biogeographic origins of the New Caledonian
endemic species. Molecular Phylogenetics and Evolution 52: 602–620.
Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy
and high throughput. Nucleic Acids R esearch 32: 1792–1797.
Ferrarini M, Moretto M, Ward JA, et al.2013.Evaluation of the PacBio RS
platform for sequencing and de novo assembly of a chloroplast genome.
BMC Genomics 14: 670. doi:10.1186/1471-2164-14-670.
Gaudeul M, Rouhan G, Gardner MF, Hollingsworth PM. 2012. AFLP
markers provide insights into the evolutionary relationships and diversifica-
tion of New Caledonian Araucaria species (Araucariaceae). American
Journal of Botany 99: 68–81.
Gernhard T. 2008. The conditioned reconstructed process. Journal of
Theoretical Biology 253: 769–788.
Glor RE. 2010. Phylogenetic insights on adaptive radiation. Annual Review of
Ecology,Evolution and Systematics 41: 251–270.
Guindon S, Gascuel O. 2003. A simple, fast and accurate method to estimate
large phylogenies by maximum-likelihood. Systematic Biology 52:
696–704.
Hahn C, Bachmann L, Chevereux B. 2013. Reconstructing mitochondrial gen-
omes directly from genomic next-generation sequencing reads – a baiting
and iterative mapping approach. Nucleic Acids Research 41: e129.
doi:10.1093/nar/gkt371.
Hall TA. 1999. BioEdit: a user-friendly biological sequence alignment editor
and analysis program for windows 95/98/NT. Nucleic Acids Symposium
Series 41: 95–98.
Hamby RK, Zimmer EA. 1992. Ribosomal RNA as a phylogenetic tool in plant
systematics. In: Soltis PS, Soltis DE, Doyle JJ, eds. Molecular systematics
of plants. New York: Chapman & Hall, 50–91.
Heled J, Drummond AJ. 2010. Bayesian inference of species trees from multi-
locus data. Molecular Biology and Evolution 27: 570–580.
Hudson DH, Bryant D. 2006. Application of phylogenetic networks in evolu-
tionary studies. Molecular Biology and Evolution 23: 254–267.
Page 10 of 11 Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
Ibanez T, Munzinger J, Dagostini G, et al.2014.Structural and floristic diver-
sity of mixed tropical rain forest in New Caledonia: new data from the New
Caledonian plant inventory and permanent plot network (NC-PIPPN).
Applied Vegetation Science 17: 386–397.
Jensen JL, Bohonak AJ, Kelley ST. 2005. Isolation by distance, web service.
BMC Genetics 6: 13. doi:10.1186/1471-2156-6-13.
Ku C, Hu J-M, Kuo C-H. 2013.Complete plastid genome sequence of the basal
asterid Ardisia polysticta Miq. and comparative analyses of asterid plastid
genomes. PLoS One 8: e62548. doi:10.1371/journal.pone.0062548.
Lohse M, Drechsel O, Bock R. 2007. OrganellarGenomeDRAW (OGDRAW):
a tool for the easy generation of high-quality custom graphical maps of plas-
tid and mitochondrial genomes. Current Genetics 52: 267–274.
Lowry II PP. 1998. Diversity, endemism and extinction in the flora of New
Caledonia: a review. In: Peng CF, Lowry II PP, eds. Rare, threatened, and
endangered floras of Asia and the Pacific rim. Taiwan: Institute of Botany,
Taipei, 181–206.
Maddison WP, Maddison DR. 2014. Mesquite:a modular system for evolution-
ary analysis. V. 3.01. http://mesquiteproject.org.
Maia VH, Gitzendanner MA, Soltis PS, Wong GK-S, Soltis DE. 2014.
Angiosperm phylogeny based on 18S/26S rDNA sequence data: construct-
ing a large data set using next-generation sequence data. International
Journal of Plant Sciences 175: 613–650.
Male´ P-JG, Bardon L, Besnard G, et al.2014.Genome skimming by shotgun
sequencing helps resolve the phylogeny of a pantropical tree family.
Molecular Ecology Resources 14: 966–975.
Martin G, Baurens F-C, Cardi C, Aury J-A, D’Hont A. 2013. The complete
chloroplast genome of banana (Musa acuminata, Zingiberales): insight into
plastid monocotyledon evolution. PLoS One 8: e67350. doi:10.1371/
journal.pone.0067350.
Maurizot P, Vende´-Leclerc M. 2009. New Caledonia geological map, scale 1/
500000. Direction de l’Industrie, des Mines et de l’Energie, Service de la
Ge´ologie de Nouvelle-Cale´donie, Bureau de Recherches Ge´ologiques et
Minie`res.
Morat P, Jaffre´ T, Tronchet F, et al.2012.Le re´fe´rentiel taxonomique florical
et les caracte´ristiques de la flore vasculaire indige`ne de la Nouvelle-
Cale´donie. Adansonia 34: 177–219.
Naciri Y, Linder HP. 2015. Species delimitation and relationships: the dance of
the seven veils. Taxon 64: 3–16.
Nu¨rk NM, Mandi~
n
an S, Crine MA, Chase MW, Blattner FR. 2013.
Molecular phylogenetics and morphological evolution of St. John’s wort
(Hypericum; Hypericaceae). Molecular Phylogenetics and Evolution 66:
1–16.
Paun O, Turner B, Trucchi E, Munzinger J, Chase MW, Samuel R. 2016.
Processes driving the adaptive radiation of a tropical tree (Diospyros,
Ebenaceae) in New Caledonia, a biodiversity hotspot. Systematic Biology
65: 212–227.
Pelletier B. 2006. Geology of the New Caledonia region and its implications for
the study of the New Caledonian biodiversity. In: Payri C, Richer de Forges
B, eds. Compendium of marine species from New Caledonia. Documents
Scientifiques et Techniques II4. New Caledonia: Institut de Recherche pour
le De´veloppement Noume´a, 17–30.
Pillon Y, Hopkins HCF, Rigault F, Jaffre´ T, Stacy EA. 2014. Cryptic adaptive
radiation in tropical forest trees in New Caledonia. New Phytologist 202:
521–530.
Pillon Y, Hopkins HCF, Munzinger J, Amir H, Chase MW. 2009a.Cryptic
species, gene recombination and hybridization in the genus
Spiraeanthemum (Cunoniaceae) from New Caledonia. Botanical Journal of
the Linne an Society 161: 137–152.
Pillon Y, Munzinger J, Amir H, Hopkins HCF, Chase MW. 2009b.
Reticulate evolution on a mosaic of soils: diversification of the New
Caledonian endemic genus Codia (Cunoniaceae). Molecular Ecology 18:
2263–2275.
Posada D. 2003. Using MODELTEST and PAUP* to select a model of nucleo-
tide substitution. Current Protocols in Bioinformatics 6.5: 1–14. doi:
10.1002/0471250953.bi0605s00.
Rodriguez-R LM, Konstantinidis KT. 2016. The enveomics collection: a tool-
box for specialized analyses of microbial genomes and metagenomes. PeerJ
Preprints. https://doi.org/10.7287/peerj.preprints.1900v1 (last accessed 30
March 2016).
Russell A, Samuel R, Rupp B, et al. 2010. Phylogenetics and cytology of a pan-
tropical orchid genus Polystachya (Polystachyinae, Vandeae, Orchidaceae):
evidence from plastid DNA sequence data. Taxon 59: 389–404.
Sanz M, Vilatersana R, Hidalgo O, et al. 2008. Molecular phylogeny and evo-
lution of floral characters of Artemisia and allies (Anthemideae,
Asteraceae): evidence from nrDNA ETS and ITS sequences. Taxon 57:
66–78.
Schaal BA, Hayworth DA, Olsen KM, Rauscher JT, Smith WA. 1998.
Phylogeographic studies in plants: problems and prospects. Molecular
Ecology 7: 465–474.
Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and
post-analysis of large phylogenies. Bioinformatics 30: 1312–1313.
Swofford DL. 2003. PAUP*. Phylogenetic analysis using parsimony (*and
other methods). Version 4. Sunderland, MA: Sinauer Associates.
Tamura K, Nei M. 1993. Estimation of the number of nucleotide substitutions
in the control region of mitochondrial DNA in humans and chimpanzees.
Molecular Biology and Evolution 10: 512–526.
Tank DC, Sang T. 2001. Phylogenetic utility of the glycerol-3-phosphate acyl-
transferase gene: evolution and implications in Paeonia (Paeoniaceae).
Molecular Phylogenetics and Evolution 19: 421–429.
Tel-Zur N, Abbo S, Myslabodski D, Mizrahi Y. 1999. Modified CTAB p roced-
ure for DNA isolation from epiphytic cacti of genera Hylocereus and
Selenicereus (Cactaceae). Plant Molecular Biology Reporter 17: 249–254.
Tremetsberger K, Stuessy TF, Kadlec G, et al. 2006. AFLP phylogeny of
South American species of Hypochaeris (Asteraceae, Lactuceae).
Systematic Botany 31: 610–626.
Turner B, Munzinger J, Duangjai S, et al.2013a.Molecular phylogenetics of
New Caledonian Diospyros (Ebenaceae) using plastid and nuclear markers.
Molecular Phylogenetics and Evolution 69: 740–763.
Turner B, Paun O, Munzinger J, Duangjai S, Chase MW, Samuel R. 2013b.
Amplified fragment length polymorphism (AFLP) data suggest rapid radi-
ation of Diospyros species (Ebenaceae) endemic to New Caledonia. BMC
Evolutionary Biology 13: 269. doi:10.1186/1471–2148–13–269.
White F. 1993. Flore de la Nouvelle-Cale´donie et De´ pendances. 19.
Ebe´nace´es.
Paris: Muse´um National d’Histoire Naturelle.
Wyman SK, Jansen RK, Boore JL. 2004. Automatic annotation of organellar
genomes with DOGMA. Bioinformatics 20: 3252–3255.
Yang J-B, Yang S-X, Li H-T, Yang J, Li De-Zhu. 2013. Comparative chloro-
plast genomes of Camellia species.PLoS ONE 8: e73053.
Yockteng R, Nadot S. 2004. Phylogenetic relationships among Passiflora spe-
cies based on the glutamine synthetase nuclear gene expressed in chloroplast
(ncpGS). Molecular Phylogenetics and Evolution 31: 379–396.
Yule GU. 1925. A mathematical theory of evolution,based on the conclusions of
Dr. J. C. Willis, F.R.S. Philosophical Transactions of the Royal Society
London, Series B 213: 21–87.
ZhuW-D,Nie,Z-L,WenJ,SunH.2013.Molecular phylogeny and biogeog-
raphy of Astilbe (Saxifragaceae) in Asia and eastern North America.
Botanical Journal of the Linnean Society 171: 377–394.
Turner et al. — Plastid genomes and nrDNA of Diospyros species from New Caledonia Page 11 of 11
by guest on April 21, 2016http://aob.oxfordjournals.org/Downloaded from
... A previous study has established that the island Diospyros species have been shaped by ancestral bottlenecks, rapid and recent radiations in phenotypic characters, and repeated and convergent evolution of potentially adaptive traits during the diversification (Fernández-Mazuecos et al., 2020). The island Diospyros taxa (New Caledonia) also experienced similar evolutionary pressure (Turner et al., 2016). Studies of Diospyros about macroevolution of migration among the continents indicated its complex evolutionary history of species diversity (Duangjai et al., 2009;Linan et al., 2019). ...
... (collected from Yunan, China; Yu et al., 2017) was most likely misidentified, considering that D. strigosa is found only in Hainan, China (Lee et al., 1996). In the clade VII and XI, we recognized phyloge- (Duangjai et al., 2009;Turner et al., 2016). Elucidating the boundaries between the different Diospyros species would improve our understanding of the cultivated species' origin, phylogeny and help decide the breeding strategy. ...
Article
Full-text available
Diospyros (Ebenaceae) is a widely distributed genus of trees and shrubs from pantropical to temperate regions, with numerous species valued for their fruits (persimmons), timber, and medicinal values. However, information regarding their plastomes and chloroplast evolution is scarce. The present study performed comparative genomic and evolutionary analyses on plastomes of 45 accepted Diospyros species, including three newly sequenced ones. Our study showed a highly conserved genomic structure across the Diospyros species, with 135–136 encoding genes, including 89 protein-coding genes, 1–2 pseudogenes (Ψycf1 for all, Ψrps19 for a few), 37 tRNA genes and 8 rRNA genes. Comparative analysis of Diospyros identified three intergenic regions (ccsA-ndhD, rps16-psbK and petA-psbJ) and five genes (rpl33, rpl22, petL, psaC and rps15) as the mutational hotspots in these species. Phylogenomic analysis identified the phylogenetic position of three newly sequenced ones and well supported a monophylogenetic (sub)temperate taxa and four clades in the pantropical taxa. The analysis codon usage identified 30 codons with relative synonymous codon usage (RSCU) values >1 and 29 codons ending with A and U bases. A total of three codons (UUA, GCU, and AGA) with highest RSCU values were identified as the optimal codons. Effective number of codons (ENC)-plot indicated the significant role of mutational pressure in shaping codon usage, while most protein-coding genes in Diospyros experienced relaxed purifying selection (dN/dS < 1). Additionally, the psbH gene showed positive selection (dN/dS > 1) in the (sub)temperate species. Thus, the results provide a meaningful foundation for further elaborating Diospyros's genetic architecture and taxonomy, enriching genetic diversity and conserving genetic resources.
... Plastid genomes have been widely used for reconstructing phylogenetic relationships in various plants (Gitzendanner et al. 2018;Forrest et al. 2019;Li et al. 2019;Guo et al. 2022). Nevertheless, plastid data may lack informative sites for addressing complex evolutionary radiations (Turner et al. 2016;Huang et al. 2023). Furthermore, since only maternally inherited information is provided, plastid genomes are represented as a single nonrecombinant linkage group. ...
Article
Full-text available
Sequencing strategies have continually advanced, with high-throughput sequencing (HTS) technologies emerging as pivotal tools in plant phylogenomics. As a standard form of target capture sequencing, hybridization target enrichment sequencing (Hyb-seq) is innovative and flexible, and then HTS strategy is widely adopted in phylogenomics. The universal Angiosperms353 probe sets (Angiosperms353) are instrumental in Hyb-seq and have been witnessed increased application in recent years. This review integrates numerous publications and empirical datasets to comprehensively assess the potential applications of Angiosperms353. Through evaluations using 18-taxa and 343-taxa genomic datasets, this review explores potential factors influencing Angiosperms353 gene capture. The RNA-seq strategy yielded the highest number of Angiosperms353 genes, followed by whole-genome sequencing and genome skimming. Increased sequencing depth enhanced gene capture yields, and no evident bias was observed across all tested angiosperm groups. In the genomics era with extensive HTS data, this review provides comprehensive guidelines for the practical application of Angiosperms353, promoting resource recycling and maximizing genomic data sharing and utilization.
... clade (B.B. Liu & al., 2020b). Additionally, the whole plastome has shown great promise in many lineages of angiosperms, such as Diospyros L. (Turner & al., 2016), Fritillaria L. (Bi & al., 2018), Magnoliaceae (G.N. , Panax L. and Araliaceae (Ji & al., 2019;Valcárcel & Wen, 2019), Theobroma L. (Kane & al., 2012), and Vitis L. and Vitaceae (N. Zhang & al., 2015Zhang & al., , 2016Wen & al., 2018. ...
Article
Full-text available
In this comprehensive study, based on extensive taxon sampling, we performed phylogenomic analyses of plastome and nuclear ribosomal DNA (nrDNA) datasets. We employed multiple inference methods, including concatenated and coalescent‐based strategies, to generate an accurate phylogeny of the woody Rosaceae genus Pourthiaea . The nrDNA phylogeny of Pourthiaea strongly supports three major clades, which are consistent with morphology. However, the plastid tree provides an alternative phylogenetic topology, indicating cytonuclear discordance which could be explained by frequent hybridizations between and among the species of Pourthiaea. Considering evidence from morphology and phylogenomic data, we propose a new infrageneric classification for Pourthiaea , consisting of three sections: P. sect. Pourthiaea , P. sect. Amphidoxae sect. nov., and P. sect. Impressivenae sect. nov.
... Compared with single or multiple sequence regions generated by Sanger sequencing, either complete plastomes or nrDNA arrays possess far more variable loci; therefore, the application of both datasets in plant species discrimination is considered to have great potential for improving resolution [21,23,35,[40][41][42][43]. Consistent with this theoretical expectation, empirical studies have shown that although the application of genome skimming is not yet robust enough to distinguish all species, especially in evolutionarily young and complex plant taxa, it can substantially improve the efficacy of species discrimination in most of the studied taxa [38,41,[44][45][46][47][48][49][50][51][52]. Better yet, previous studies have shown that even using trace and highly degraded genomic DNA extracted from processed plant materials to build shotgun libraries, complete plastomes and nrDNA arrays can be recovered by application of high-throughput sequencing technologies [35,37,38], and the use of genome skimming has shown desirable performance in accurate and sensitive identification of species in mixed samples [53,54]. ...
Article
Full-text available
Background: Paris yunnanensis (Melanthiaceae) is a traditional Chinese medicinal plant of significant pharmaceutical importance. Due to previous taxonomic confusion, a congeneric species, Paris liiana, has been mistaken for P. yunnanensis and cultivated on a large scale, leading to the mixing of commercial products (i.e., seedlings and processed rhizomes) of P. yunnanensis with those of P. liiana. This may have adverse effects on quality control in the standardization of P. yunnanensis productions. As the lack of PCR amplifiable genomic DNA within processed rhizomes is an intractable obstacle to the authentication of P. yunnanensis products using PCR-based diagnostic tools, this study aimed to develop a PCR-free method to authenticate commercial P. yunnanensis products, by applying genome skimming to generate complete plastomes and nrDNA arrays for use as the molecular tags. Results: Based on a dense intraspecies sampling of P. liiana and P. yunnanensis, the robustness of the proposed authentication systems was evaluated by phylogenetic inferences and experimental authentication of commercial seedling and processed rhizome samples. The results indicate that the genetic criteria of both complete plastomes and nrDNA arrays were consistent with the species boundaries to achieve accurate discrimination of P. yunnanensis and P. liinna. Owing to its desirable accuracy and sensitivity, genome skimming can serve as an effective and sensitive tool for monitoring and controlling the trade of P. yunnanensis products. Conclusion: This study provides a new way to solve the long-standing problem of the molecular authentication of processed plant products due to the lack of PCR amplifiable genomic DNA. The proposed authentication system will support quality control in the standardization of P. yunnanensis products in cultivation and drug production. This study also provides molecular evidence to clarify the long-standing taxonomic confusion regarding the species delimitation of P. yunnanensis, which will contribute to the rational exploration and conservation of the species.
... Unfortunately, the results inferred from the nrDNA dataset could not provide solid resolution for the phylogenetic relationships of Ceratostigma, perhaps due to its short length and not enough parsimony-informative characters. Especially, the systematic positions of individuals of C. minus and C. griffithii varied from those of plastome and combined datasets, and similar results were also found in other phylogenetic studies [70,71]. This phenomenon could be better explained by different important determinants including incomplete lineage sorting, hybridization/introgression and other genetic processes [72,73]. ...
Article
Full-text available
Background: Ceratostigma, a genus in the Plumbaginaceae, is an ecologically dominant group of shrubs, subshrub and herb mainly distributed in Qinghai-Tibet Plateau and North China. Ceratostigma has been the focal group in several studies, owing to their importance in economic and ecological value and unique breeding styles. Despite this, the genome information is limited and interspecific relationships within the genus Cerotastigma remains unexplored. Here we sequenced, assembled and characterized the 14 plastomes of five species, and conducted phylogenetic analyses of Cerotastigma using plastomes and nuclear ribosomal DNA (nrDNA) data. Results: Fourteen Cerotastigma plastomes possess typical quadripartite structures with lengths from 164,076 to 168,355 bp that consist of a large single copy, a small single copy and a pair of inverted repeats, and contain 127-128 genes, including 82-83 protein coding genes, 37 transfer RNAs and eight ribosomal RNAs. All plastomes are highly conservative and similar in gene order, simple sequence repeats (SSRs), long repeat repeats and codon usage patterns, but some structural variations in the border of single copy and inverted repeats. Mutation hotspots in coding (Pi values > 0.01: matK, ycf3, rps11, rps3, rpl22 and ndhF) and non-coding regions (Pi values > 0.02: trnH-psbA, rps16-trnQ, ndhF-rpl32 and rpl32-trnL) were identified among plastid genomes that could be served as potential molecular markers for species delimitation and genetic variation studies in Cerotastigma. Gene selective pressure analysis showed that most protein-coding genes have been under purifying selection except two genes. Phylogenetic analyses based on whole plastomes and nrDNA strongly support that the five species formed a monophyletic clade. Moreover, interspecific delimitation was well resolved except C. minus, individuals of which clustered into two main clades corresponding to their geographic distributions. The topology inferred from the nrDNA dataset was not congruent with the tree derived from the analyses of the plastid dataset. Conclusion: These findings represent the first important step in elucidating plastome evolution in this widespread distribution genus Cerotastigma in the Qinghai-Tibet Plateau. The detailed information could provide a valuable resource for understanding the molecular dynamics and phylogenetic relationship in the family Plumbaginaceae. Lineage genetic divergence within C. minus was perhaps promoted by geographic barriers in the Himalaya and Hengduan Mountains region, but introgression or hybridization could not be completely excluded.
... plastomes) as next-generation superbarcodes, and its application for plant species identification has been constantly encouraged (Kane et al., 2012;Dodsworth, 2015;Coissac et al., 2016;Hollingsworth et al., 2016). Subsequent empirical studies have revealed that genome skimming to recover complete plastomes as superbarcodes can increase species identification success compared with that using standard DNA barcodes, although they are not robust enough to discriminate all species, particularly in recently radiated and evolutionarily complex plant taxa (e.g., Kane et al., 2012;Ruhsam et al., 2015;Turner et al., 2016;Ji et al., 2019Ji et al., , 2020Ji et al., , 2021Ji et al., , 2022aFu et al., 2022;Yu et al., 2022). To evaluate the robustness of the proposed method that applies genome skimming to generate complete plastomes as molecular tags for authentication of medicinal Polygonatum species, the primary objective of this study was to evaluate the discriminative power of complete plastomes in terms of identifying Polygonatum species. ...
Article
Polygonati Rhizoma, a traditional Chinese medicine important to the pharmaceutical and food industries, is frequently adulterated with morphologically similar congeneric species, posing quality and effectiveness issues for the products. Accordingly, the accurate and sensitive authentication of medicinal Polygonatum species is vital to the standardization of Polygonati Rhizoma products. In view of the difficulties in morphological analysis and the use of standard DNA barcodes for authentication of medicinal Polygonatum species, an alternative authentication method is needed. Based on inter- and intraspecies sampling, phylogenetic and genetic distance analyses were performed to evaluate the discriminative power of complete plastomes for the identification of Polygonatum species. The results indicate that the genetic criteria of complete plastomes are in good correspondence with species boundaries and therefore can be used for unambiguous identification of medicinal Polygonatum species and morphologically similar congeners, which verifies the theoretical prediction that the application of genome skimming to generate complete plastomes as molecular tags can serve as an effective and accurate method to detect whether Polygonati Rhizoma is adulterated with congeneric species. Additionally, in this study, a taxonomically well-curated and high-resolution reference plastome superbarcode library for Polygonatum was developed for the first time, providing a valuable genetic resource for the application of genome skimming to monitor and control the production and trade of Polygonati Rhizoma to guarantee the effectiveness of downstream products. The findings also provide new insights to clarify the taxonomic ambiguities of the generic circumscription of Polygonatum and Heteropolygonatum, as well as the species delimitation of Polygonatum cyrtonema, Polygonatum nodosum, and Polygonatum punctatum, which will contribute to proper exploitation and conservation of Polygonatum species.
... Thus, the results provide a foundation for 38 elaborating Diospyros's genetic architecture and taxonomy, conserving genetic 39 diversity and enriching genetic resources. with enlarged persistent calyx at the base, and a dioecious breeding system ( (Turner et al., 2016). Therefore, to 67 understand the strategies to adapt to different environmental conditions, the research 68 for leaf habits of Diospyros has great significance (Tomlinson, et al. 2013;Yao, et 69 al., 2020). ...
Preprint
Diospyros (Ebenaceae) is a widely distributed genus of trees and shrubs native to tropical and subtropical regions, with numerous species valued for their fruits (persimmons), timber, and medicinal values. However, information regarding their plastomes and chloroplast evolution is scarce. The present study performed comparative genomic and evolutionary analyses on plastomes of 18 accepted Diospyros species, including three newly sequenced ones. Our study showed a highly conserved genomic structure across the species, with plastome size ranging from 157,321 bp (D. jinzaoshi) to 157,934 bp (D. deyangensis). These plastomes encoded 134–138 genes, including 89–91 protein-coding genes, 1–2 pseudogenes (Ψycf1 for all, Ψrps19 for a few), 37 tRNA genes, and 8 rRNA genes. Comparative analysis of Diospyros identified the intergenic regions (trnH-psbA, rps16-trnQ, trnT-psbD, petA-psbJ, trnL-trnF-ndhJ) as the mutational hotspots in these species. Phylogenomic analyses identified three main groups within the genus designated as the evergreen, deciduous, and island groups. The codon usage analysis identified 30 codons with relative synonymous codon usage (RSCU) values greater than 1 and 29 codons ending with A and U bases. A total of three codons (UUA, GCU, and AGA) with highest (RSCU) values were identified as the optimal codons. ENC-plot indicated the significant role of mutational pressure in shaping codon usage, while most protein-coding genes in Diospyros experienced relaxed purifying selection (Ka/Ks < 1). Additionally, the ndhG, rpoC1, and ycf3 genes showed positive selection (Ka/Ks > 1) in the island, deciduous, and both deciduous and evergreen species, respectively. Thus, the results provide a foundation for elaborating Diospyros’s genetic architecture and taxonomy, conserving genetic diversity and enriching genetic resources.
... Consequently, whole plastome sequences and nrDNA sequences are recommended as the next wave of plant DNA barcoding, referred to as ultrabarcodes (Kane et al., 2012), organelle-scale barcodes (Yang et al., 2013) or next-generation barcodes (Ji et al., 2019), with an increase of species discrimination rate or high phylogenetic resolution (Fu et al., 2021;Guo et al., 2021;Ruhsam et al., 2015;Tian et al., 2019;Turner et al., 2016;Zhou et al., 2019). However, most current applications of genome skimming for species identification use organelle genomes and nrDNA sequences, and this approach discards a large portion of the nuclear genome (Bohmann et al., 2020;Tonti-Filippini et al., 2017). ...
Article
Full-text available
Cymbidium is an orchid genus with rapid radiation and high ornamental, economic, ecological and cultural importance, but its classification is controversial based on morphology. The plastid genome (plastome), as an extension of plant standard DNA barcodes, has been widely used as a potential molecular marker for identificating the recently diverged species or complicated plant groups. In this study, we newly generated 237 plastomes of 50 species (at least two individuals per species) by genome skimming, covering 71.4% of the member of the genus Cymbidium. Sequence-based analyses (barcoding gaps and ABGD) and tree-based analyses (ML, BI, and mPTP) methods were conducted for species identification of Cymbidium. Our work provides a comprehensive DNA barcode reference library for Cymbidium species identification. The results show that compared with standard DNA barcodes (rbcL+matK) as well as the plastid trnH-psbA, the species identification rate of the plastome moderately increases from 58% to 68%. At the same time, we propose an optimized identification strategy for Cymbidium species. The plastome cannot completely resolve the species identification of Cymbidium, incomplete lineage sorting, artificial cultivation, nature hybridization and chloroplast capture may be the main reasons. To further explore the potential usage of nuclear data in identifying species, Skmer method was adopted and the identification rate was increased to 72%. It seems that the nuclear genome data acts a vital role in species identification and is expected to be used as next-generation nuclear barcodes.
Article
As a traditional Chinese medicine, rhubarb is used to treat several diseases such as severe acute pancreatitis, sepsis and chronic renal failure. However, few studies focused on the authentication of germplasm for the Rheum palmatum complex, and no studies have been conducted to elucidate the evolutionary history of the R. palmatum complex using plastome datasets. Hence, we aim to develop the potential molecular markers to identify the elite germplasms of rhubarb and explore the divergence and biogeographic history of the R. palmatum complex based on the newly sequenced chloroplast genome datasets. Chloroplast genomes of thirty-five the R. palmatum complex germplasms were sequenced, and the length ranged from 160,858 to 161,204 bp. The structure, gene content and gene order were highly conserved across all genomes. Eight InDels and sixty-one SNPs loci could be used to authenticate the high-quality germplasms of rhubarb in specific areas. Phylogenetic analysis revealed that all rhubarb germplasms were clustered in the same clade with high bootstrap support values and Bayesian posterior probabilities. According to the molecular dating result, the intraspecific divergence of the complex occurred in the Quaternary, which might be affected by climatic fluctuation. The biogeography reconstruction indicated that the ancestor of the R. palmatum complex might originate from the Himalaya-Hengduan Mountains or/and Bashan-Qinling Mountains, and then spread to surrounding areas. Several useful molecular markers were developed to identify rhubarb germplasms, and our study will provide further understanding on speciation, divergence and biogeography of the R. palmatum complex.
Chapter
Persimmon (Diospyros kaki Thunb.) is an ancient fruit tree that originate in East Asia, especially in Southern China. Persimmon is grown in China, Japan and Korea for a long history. Persimmon can be utilized for various purposes with fruits, leaves and its derivates. The genus Diospyros with about 500 species accounts for the largest genus of Ebenaceae. There is a close relationship between D. kaki, D. lotus, D. glandulosa, D. oleifera and ‘Yemaoshi’ based on morphological and molecular evidence. The ancestor of persimmon is not clearly elucidated. Modern hexaploidy persimmon should be evolved from diploid through genome duplication, in which 2n gametes might play an important role. Persimmon cultivars can be horticulturally classified into four types including PCNA (pollination constant and non-astringent), PVNA (pollination variant and non-astringent), PVA (pollination variant and astringent) and PCA (pollination constant and astringent). PCNA is further classified into Chinese PCNA and Japanese PCNA based on their different genetic control of astringency loss traits. Persimmon genotypes with different geographic origins can be distinguished using various molecular markers, while four persimmon cultivar types could not be well-separated by molecular detection. PCNA cultivars from China and Japan exhibit a divergent origin, interestingly, both Chinese PCNA and Japanese PCNA originate in central mountain areas of the two countries, respectively. Most of the genetic resources of Ebenaceae were distributed in tropical and subtropical regions of the world. More than 2000 accessions of persimmon were preserved in China, Japan, Korea, Italy, Spain, etc. Most of the persimmon production was derived from PCA cultivars in China, which is the largest persimmon producer globally. PCNA fruit is more attractive and new orchards prefer to plant PCNA types in Asia, Mediterranean area, Oceania and South America.
Preprint
Full-text available
Genomic and metagenomic analyses are increasingly becoming commonplace in several areas of biological research, but recurrent specialized analyses are frequently reported as in-house scripts rarely available after publication. We describe the enveomics collection, a growing set of actively maintained scripts for several recurrent and specialized tasks in microbial genomics and metagenomics, and present a graphical user interface and several case studies. Our resource includes previously described as well as new algorithms such as Transformed-space Resampling In Biased Sets (TRIBS), a novel method to evaluate phylogenetic under- or over-dispersion in reference sets with strong phylogenetic bias. The enveomics collection is freely available under the terms of the Artistic License 2.0 at https://github.com/lmrodriguezr/enveomics and for online analysis at http://enve-omics.ce.gatech.edu
Article
Full-text available
Due to its special geological history, the New Caledonian Archipelago is a mosaic of soil types, and in combination with climatic conditions this results in a heterogeneous environment across relatively small distances. A group of over 20 endemic species of Diospyros (Ebenaceae) has rapidly and recently radiated on the archipelago after a single long-distance dispersal event. Most of the Diospyros species in the radiating group are morphologically and ecologically well differentiated, but they exhibit low levels of DNA variability. To investigate the processes that shaped the diversification of this group we employed restriction site associated DNA sequencing (RADseq). Over 8,400 filtered SNPs generally confirm species delimitations and produce a well-supported phylogenetic tree. Our analyses document local introgression, but only a limited potential for gene flow over longer distances. The phylogenetic relationships point to an early regional clustering among populations and species, indicating that allopatric speciation with respect to macrohabitat (i.e., climatic conditions) may have had a role in the initial differentiation within the group. A later, more rapid radiation involved divergence with respect to microhabitat (i.e., soil preference). Several sister species in the group show a parallel divergence in edaphic preference. Searches for genomic regions that are systematically differentiated in this replicated phenotypic divergence pointed to loci potentially involved in ion binding and cellular transport. These loci appear meaningful in the context of adaptations to soil types that differ in heavy-metal and mineral content. Identical nucleotide changes affected only two of these loci, indicating that introgression may have played a limited role in their evolution. Our results suggest that both allopatric diversification and (parapatric) ecological divergence shaped successive rounds of speciation in the Diospyros radiation on New Caledonia.
Article
Full-text available
The taxonomic reference base Florical and characteristics of the native vascular flora of New Caledonia. The floristic inventory presented here comprises the evolving computerised database Florical (http://www.botanique.nc/herbier/florical). As of the date of publication, it lists all native vascular plant species (as well as infraspecific taxa) present in the territory of New Caledonia, whether validly published or in press, along with their basionym, organised according to the most recent classifications systems (APG III 2009 for the angiosperms, Smith et at [2006] for the ferns and lycophytes, and Mabberley [2009] for the gymnosperms), accompanied by their status (native or endemic) and the various vegetation types in which they occur. After a brief historical review of our knowledge of New Caledonian botany, the characteristics (richness, composition and endemism) of the different taxonomic groups (ferns and lycophytes, gymnosperms and then angiosperms -mono- and dicotyledons) are analyzed, followed by an assessment of the distribution of these groups among the vegetation types. The vascular flora of New Caledonia is characterised by its high level of richness (3371 species, including 3099 for the flowering plants alone), despite the absence or under-representation of some taxa that are abundantly represented elsewhere in the tropics, and especially by its remarkable distinctiveness (with endemism at the species level reaching 74,7%, and 77.8% for the flowering plants). The flora stands out by the presence of several relictual taxa and a high level of speciation among certain groups, despite the fact that the island was re-colonised relatively recently following the total submersion of its much older basement during the Paleocene and the Oligocene. Details of the measures taken to conserve this extraordinary natural heritage show that only 3.4% of the territory's total surface area is effectively protected. Despite the undeniable progress made in recent years by the authorities concerned, significant work remains to be done, especially in the northern Province.
Article
Full-text available
An increasing number of studies using genetic data at the generic and species levels reveal complex patterns of relationships among populations and species. Incomplete lineage sorting and reticulation are, for instance, commonly observed in low-rank phylogenies. These two processes falsify the simplifying assumptions often used when reconstructing phylogenetic relationships or when assigning specimens to species using barcodes, i.e., the absence of ancestral polymorphism or the obligate dichotomous relationship among species. We recognize seven processes that act as veils to obscure species boundaries and relationships and lead to erroneous interpretations if not diagnosed and analysed properly. These processes include the transfer of genetic material from plastids or mitochondria to the nuclear genome (NuPt, NuMt), hybridization, the stochasticity of lineage sorting within and among species, genome organization, species demography, selection, genetic and geographic structure. Most of these processes have a direct impact on species effective population sizes, a central parameter to be considered. Failure to take the seven processes into account can affect the accuracy of barcode-based identification, in particular between closely related species. It can also bias the delimitation of species, result in inaccurate estimations of species relationships and lead to incorrect estimation of species ages. We suggest that these problems can be identified and, to some extent, mitigated by ensuring that the full spatial and genetic ranges of the species are sampled, and by the use of information from all genomes of the species. By analysing numerous loci, each as a separate entity in a coalescence framework, it is possible to take into account stochasticity as well as variation in genome effective population sizes. Such a critical approach would allow the use of genetic data to make realistic assessments of species delimitation, affinities and ages, and should promote a more biologically realistic view of species and speciation.
Article
The pantropical orchid genus Polystachya is the subject of ongoing taxonomic work. We inferred phylogenetic relationships in the genus using 5.3 kb of plastid DNA, for 83 out of ca. 240 species and 12 out of 15 sections, as well as five outgroup species. We also collected ploidy data using chromosome counts and genome size estimates. Bayesian and parsimony trees were congruent with each other and well resolved. Polystachya appears monophyletic based on current sampling, provided that the name P. neobenthamia is used instead of Neobenthamia gracilis for that species. The current sectional classification does not define monophyletic groups, but the present study can be used as the basis for a future sectional classification. Areas postulated as Pleistocene refugia for wet tropical forests in Africa also form centres of diversity for the genus. Biogeographical analyses using DIVA and Lagrange show an early radiation in eastern Africa, followed by separate radiations in eastern and western Africa. Subsequent dispersal from western to eastern Africa has occurred at a much higher rate than from east to west. Dispersal to the Neotropics occurred more than once, and one lineage has spread recently and rapidly throughout the tropics. Polyploidy has occurred several times during the diversification of the genus, most notably in association with the recent pantropical dispersal.
Article
We investigate a neutral model for speciation and extinction, the constant rate birth-death process. The process is conditioned to have $n$ extant species today, we look at the tree distribution of the reconstructed trees-- i.e. the trees without the extinct species. Whereas the tree shape distribution is well-known and actually the same as under the pure birth process, no analytic results for the speciation times were known. We provide the distribution for the speciation times and calculate the expectations analytically. This characterizes the reconstructed trees completely. We will show how the results can be used to date phylogenies.
Chapter
The traditional classification of plants into respective classes, orders, families, genera, and species has until recently been based on shared morphologic, cytologic, biochemical, and ecologic traits. The development of techniques in molecular biology including those for molecular hybridization, cloning, restriction endo-nuclease digestions, and protein and nucleic acid sequencing have provided many new tools for the investigation of phylogenetic relationships. At the molecular level, the most fundamental comparison possible is of the primary nucleotide sequences of homologous genes in different populations or species.