ArticlePDF Available

Discovery and characterization of sweetpotato’s closest tetraploid relative

Wiley
New Phytologist
Authors:

Abstract and Figures

The origin of sweetpotato, a hexaploid species, is poorly understood, partly because the identity of its tetraploid progenitor remains unknown. In this study, we identify, describe and characterize a new species of Ipomoea that is sweetpotato’s closest tetraploid relative known to date and probably a direct descendant of its tetraploid progenitor. We integrate morphological, phylogenetic, and genomic analyses of herbarium and germplasm accessions of the hexaploid sweetpotato, its closest known diploid relative Ipomoea trifida, and various tetraploid plants closely related to them from across the American continent. We identify wild autotetraploid plants from Ecuador that are morphologically distinct from Ipomoea batatas and I. trifida, but monophyletic and sister to I. batatas in phylogenetic analysis of nuclear data. We describe this new species as Ipomoea aequatoriensis T. Wells & P. Muñoz sp. nov., distinguish it from hybrid tetraploid material collected in Mexico; and show that it likely played a direct role in the origin of sweetpotato’s hexaploid genome. This discovery transforms our understanding of sweetpotato’s origin.
One of several possible scenarios of sweetpotato evolution and origin of current diversity. Tetraploid plants closely related to sweetpotato have two different origins: plants from Ecuador represent direct descendants from the autotetraploid progenitor of hexaploid Ipomoea batatas, whereas plants from Mexico and Central America are the result of a more recent hybridization between hexaploid I. batatas and diploid Ipomoea trifida. (a) One possible scenario, congruent with the data currently available, is presented here. An autotetraploid would have arisen from a whole genome duplication of a diploid common ancestor with I. trifida. This autotetraploid would have hybridized with the diploid ancestor to produce an allohexaploid. Subsequent introgression between the diploid ancestor lineage and the allohexaploid would result in chloroplast capture from I. trifida, explaining the two distinct I. batatas lineages in the chloroplast phylogenies. This separate lineage would keep a hexaploid nuclear genome but a chloroplast most similar to the diploid progenitor, and therefore to modern I. trifida, than to the ancestral sweetpotato lineage. Red and blue colours indicate the proportion of diploid (AA, red) and tetraploid (BBBB, blue) ancestral genomes in the different entities. Small, coloured circles represent the chloroplast. Dashed lines indicate hybridization and dotted line indicates introgression with chloroplast capture. (b) Summary nuclear phylogeny depicting the relationship between modern taxa, with Ipomoea aequatoriensis most closely related to I. batatas. (c) Summary chloroplast phylogeny depicting the relationship between modern taxa, with I. aequatoriensis most closely related to I. batatas lineage 1, the ancestral sweetpotato lineage.
… 
This content is subject to copyright. Terms and conditions apply.
Rapid report
Discovery and characterization of sweetpotato’s closest tetraploid
relative
Author for correspondence:
Robert W. Scotland
Email: robert.scotland@plants.ox.ac.uk
Received: 23 November 2021
Accepted: 16 January 2022
Pablo Mu~noz-Rodrıguez
1
*, Tom Wells
1
*, John R. I. Wood
1,2
,
Tom Carruthers
2
, Noelle L. Anglin
3,4
, Robert L. Jarret
4
and
Robert W. Scotland
1
1
Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX1 3RB, UK;
2
Royal Botanic Gardens, Kew,
Richmond, Surrey, TW9 3AB, UK;
3
International Potato Center, Avenida La Molina 1895, Distrito de La Molina, Lima 15023,
Peru;
4
United States Department of Agriculture, 1109 Experiment Street, Griffin, GA 30223, USA
New Phytologist (2022) 234: 1185–1194
doi: 10.1111/nph.17991
Key words: crop wild relatives, Ecuador,
genomics, herbarium specimens, Ipomoea
aequatoriensis, new species, tetraploid.
Summary
The origin of sweetpotato, a hexaploid species, is poorly understood, partly because the
identity of its tetraploid progenitor remains unknown. In this study, we identify, describe and
characterize a new species of Ipomoea that is sweetpotato’s closest tetraploid relative known to
date and probably a direct descendant of its tetraploid progenitor.
We integrate morphological, phylogenetic, and genomic analyses of herbarium and
germplasm accessions of the hexaploid sweetpotato, its closest known diploid relative Ipomoea
trifida, and various tetraploid plants closely related to them from across the American continent.
We identify wild autotetraploid plants from Ecuador that are morphologically distinct from
Ipomoea batatas and I. trifida, but monophyletic and sister to I. batatas in phylogenetic analysis
of nuclear data.
We describe this new species as Ipomoea aequatoriensis T. Wells & P. Mu~
noz sp. nov.,
distinguish it from hybrid tetraploid material collected in Mexico; and show that it likely played a
direct role in the origin of sweetpotato’s hexaploid genome. This discovery transforms our
understanding of sweetpotato’s origin.
Introduction
Sweetpotato, Ipomoea batatas (L.) Lam., is a hexaploid species
thought to have originated via allopolyploidy from a diploid and a
tetraploid ancestor (Yang et al., 2017). Ipomoea trifida (Kunth)
G. Don, a Circum-Caribbean species, was recently confirmed as
sweetpotato’s closest diploid relative and most likely the direct
descendant of its diploid progenitor (Mu~noz-Rodrıguez et al.,
2018). In contrast, the identity of the sweetpotato’s closest
tetraploid relative remains unknown. Identifying this entity is
key to untangling the evolutionary history of sweetpotato,
understanding its contemporary diversity and assembling its large
allohexaploid genome.
Whilst preparing a monograph of all American species of
Ipomoea L. (Wood et al., 2020), our attention was drawn to
herbarium specimens from Ecuador identified as I. batatas but
differing in their shorter and blunter sepals (Fig. 1a,b), sepal
morphology being an important taxonomic character in Ipomoea
(Austin, 1978; Wood et al., 2020). These specimens were restricted
to coastal Ecuador (Fig. 1c) and were of wild provenance, in
contrast to most populations of I. batatas that are only known from
cultivation or as escapes.
As part of our research, in parallel to studying herbarium
specimens, we also grew tetraploid Ipomoea specimens from seeds
available in germplasm collections (Supporting Information
Table S1). Tetraploid collections (2n=4x=60) are of particular
interest because their ploidy is intermediate between hexaploid
sweetpotato (2n=6x=90) and its closest diploid relative, I. trifida
(2n=2x=30), meaning that they may represent intermediate
stages in sweetpotato evolution. The germplasm material studied
by us included other tetraploid specimens from the same areas of
Ecuador as the distinctive herbarium material we had identified
during our studies (Figs S1S5), as well as material of the Mexican
sweetpotato variety I. batatas var. apiculata J.A. McDonald & D.F.
*These authors contributed equally to this work.
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
New Phytologist (2022) 234: 1185–1194 1185
www.newphytologist.com
This is an open access article under the terms of the Creative Commons Attribution License, which permits use,
distribution and reproduction in any medium, provided the original work is properly cited.
Research
Austin and the Mexican hybrid species Ipomoea tabascana
J.A. McDonald & D.F. Austin, both of them also tetraploids
(Notes S1). Ipomoea tabascana is a modern hybrid between
I. batatas and I. trifida known from a single collection (McDonald
& Austin, 1990; Srisuwan et al., 2006). Modern tetraploid
hybrids such as this may confound data interpretation, hence the
importance of including in our study examples of known hybrid
origin: it is essential to be able to distinguish between truly
autotetraploid entities and other tetraploids of modern hybrid
origin.
To place the Ecuadorian specimens in a phylogenetic context, we
conducted a preliminary phylogenetic analysis using rpl32-trnL,a
small, noncoding, rapidly-evolving chloroplast DNA region.
Ipomoea batatas contains two different chloroplast lineages (Roul-
lier et al., 2013), the ancestral lineage (chloroplast lineage 1) and a
second, more recent lineage that is most likely the result of
introgression with chloroplast capture from I. trifida (lineage 2)
(Mu~noz-Rodrıguez et al., 2018). The preliminary analysis of this
small chloroplast DNA region showed that the herbarium
specimens and the germplasm material from Ecuador were the
same entity, and that they were more closely related to sweetpotato
chloroplast lineage 1 than to any other lineage (Methods S1;
Fig. S6). A subsequent literature review showed that we were not
the first to recognize these tetraploids from Ecuador (Martin &
Jones, 1972; Martin et al., 1974; Austin et al., 1993), but previous
studies lacked the taxonomic and phylogenetic framework required
to accurately infer their relationship with sweetpotato.
Here, we provide the first comprehensive study of these
Ecuadorian tetraploids and show that they represent a distinct
species that is sweetpotato’s closest wild relative. We describe this
new species as Ipomoea aequatoriensis T. Wells & P. Mu~noz and
show it is most likely the direct descendant of the sweetpotato’s
tetraploid progenitor.
Materials and Methods
Herbarium collections and germplasm material
We studied American material from germplasm collections (CIP
and USDA) and herbaria (AAU, BM, E, FL, FTG, GUAY,
(a)
(c)
(b) (d)
(e)
Fig. 1 Ipomoeaaequatoriensis is morphologically distinct from Ipomoea batatas and Ipomoea trifida. Sepals are (a) oblong/ovate in cultivated I. batatas (Balls
5483) and (b) obovate in I. aequatoriensis (Jativa and Epling. 1191). (c) Map of the Americas showing the distribution of specimens included in the
morphological analysis. Closed symbols indicate specimens also included in the genomic analyses. All hexaploid I. batatas specimens in this study are of
cultivated origin and are not included in the map. (d) Principal component analysis and (e) linear discriminant analysis of 12 quantitative morphological traits
widely used in sweetpotato morphological studies. Ellipses indicate 95% confidence level. In (ce), I. batatas (green dots), I. aequatoriensis (blue triangles),
I. trifida (red squares), hybrids Ipomoea tabascana (black triangle) and I. batatas var. apiculata (orange triangles). The Colombian specimens affinis to
I. aequatoriensis are indicated by light blue triangles.
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
Research Rapid report
New
Phytologist
1186
HUEFS, K, LPB, OXF, QAC, QAP, QCA, QCNE, RB, ST, US,
USZ, XAL, acronyms according to Thiers (2018)). We included
specimens of cultivated hexaploid I. batatas (L.) Lam. and diploid
I. trifida (Kunth) G. Don from across their geographical distribu-
tion; representatives of the 14 other close wild relatives of
sweetpotato (Wood et al., 2020) including the Mexican hybrid
tetraploid I. tabascana J.A. McDonald & D.F. Austin , known from
a single collection (Notes S1) (McDonald & Austin, 1990; Austin
et al., 1991); two specimens of the tetraploid sweetpotato variety
I. batatas var. apiculata J.A. McDonald & D.F. Austin, also from
Mexico and apparently restricted to the vicinity of the city of
Veracruz (Notes S1); the tetraploid material from Ecuador; a
tetraploid accession from Colombia; and multiple herbarium
collections of wild plants resembling the tetraploid Ecuadorian
material and collected in the same geographical area (Dodson &
Gentry, 1978; Austin, 1982; Dodson et al., 1985; McDonald &
Austin, 1990; Wood et al., 2020). See Tables S2 and S3 for
passport data of all specimens and indication of analyses they were
included in.
Quantitative morphological analyses
Character selection and measurement We identified and anal-
ysed herbarium specimens and germplasm material of I. trifida (57
specimens), I. batatas (55 specimens), the Ecuadorian tetraploids
(44 specimens), I. tabascana (one specimen) and I. batatas var.
apiculata (five specimens) (Table S2). We measured 12 morpho-
logical characters found to be informative in taxonomic treatments
of Ipomoea or commonly used to study sweetpotato germplasm
collections (Table S2) (Austin, 1978; Huaman, 1991; Wood et al.,
2020). Measurements were taken using digital callipers or, in the
case of digitized herbarium specimens, using the biological-image
analysis software FIJI (Schindelin et al., 2012).
Clustering analyses We first ran a principal component analysis
(PCA) to investigate phenotypic clustering between I. batatas,
I. trifida and the various tetraploid entities. We used FACTOMINER
package v.2.4 (L^eet al., 2008) in R and divided the tetraploid
material into three groups based on geographical distribution and
past determinations: (1) Ecuadorian, (2) I. tabascana, and (3)
I. batatas var. apiculata. We then used R package MASS v.7.3.54
(Venables et al., 2002) to assess how well individual specimens
could be classified into their assigned groups through a linear
discriminant analysis (LDA). We plotted the results of both
analyses using the GGPLOT2 package v.3.3.5 (Wickham, 2016), with
ellipses depicting 95% confidence level added using the stat_ellipse
function.
Analysis of genomic data
We sequenced 13 new specimens using Illumina whole genome
sequencing and incorporated them in our previously-existing
dataset of sweetpotato crop wild relatives (CWRs) (Mu~noz-
Rodrıguez et al., 2018) (Table S2). This material included six
Ecuadorian tetraploids (PI 561246, PI 561248, PI 561255, PI
561258, K300/CH71.3, CH81.2), one Colombian tetraploid
(K500/CH80.3), diploid I. trifida specimen s from Colombia (F. de
la Puente 1054) and Mexico (F. de la Puente 2961), three I. batatas
var. apiculata (D.F. Austin 7480, PI 518474 and K233) and one
I. tabascana (PI 518479).
DNA processing and sequencing We extracted DNA using the
Plant Tissue Mini protocol for Qiagen DNEasy Plant Mini Kit. We
created genomic libraries using the NEBNext Ultra DNA Library
Prep Kit for Illumina v.3.0 (New England BioLabs, Ipswich, MA,
USA). Sequencing was done at Novogene facilities in Cambridge,
UK, using Illumina NovoSeq6000. We obtained 150 bp paired
end whole genome data, on average 11 Gb per sample. We filtered
the sequence files using default parameters in TRIMMOMATIC
(Bolger et al., 2014) and checked the quality of the reads using
FASTQC. We used default settings in BBTOOLS tadpole (https://
sourceforge.net/projects/bbmap/) to correct the reads.
Assembly of single copy nuclear regions for phylogenetic
analysis We assembled 386 putative single copy nuclear DNA
regions of all samples using a reference-guided assembly. A detailed
description of how these nuclear regions were identified is provided
in Methods S2. We mapped the reads to the reference 386 nuclear
probes using BBMAP (paired only =t,local =t). We used
SAMTOOLS (Danecek et al., 2021) to extract all reads mapped to
the reference probes and to remove duplicate reads, and Picard
Tools (http://broadinstitute.github.io/picard) to realign the reads
mapped around indels. We used BCFTOOLS (Danecek et al., 2021)
for variant calling, indel normalization and variant filtering, and
VCFTOOLSvcf-sort (Danecek et al., 2011) to sort the VCF files.
Phylogenetic analysis of nuclear DNA regions We used phylo-
genetic analysis of nuclear data to confirm the close relationship
between the Ecuadorian tetraploids and sweetpotato. We used
consensus sequences and included the tetraploids from Ecuador
and Colombia, 10 I. batatas specimens, 10 I. trifida specimens, one
I. tabascana specimen, two I. batatas var. apiculata specimens, one
specimen of each of the other 14 species closely related to
sweetpotato and one I. cryptica J.R.I. Wood & Scotland as
outgroup (Mu~noz-Rodrıguez et al., 2018, 2019). We obtained
consensus sequences from VCF variant files using BCFTOOLS
consensus (Danecek et al., 2021) and masked all positions in the
consensus sequences with read coverage lower than 59. The use of
consensus sequences in phylogenetic analysis can obscure potential
subgenome differentiation in polyploids. However, the lack of a
reference genome makes it impossible to assign the alleles in the
nuclear regions to specific subgenomes. To minimize the potential
effects of divergent subgenomes, we only included likely homozy-
gous variant positions in this analysis. Heterozygous sites were
therefore masked and not considered in the main phylogenetic
analysis but were included in additional phylogenetic analyses
(Methods S3).
We used the BIOPYTHON script sequence_cleaner to remove sequences
shorter than 500 bp or with more than 10% ambiguous sites. We
excluded three further regions of the analysis (solyc06g073230.2.1_1,
solyc08g043170.2.1_1 and solyc11g012820.1.1_1)asnoneofthe
sequences in those regions passed the filters, as well as one I. batatas
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
New
Phytologist Rapid report Research 1187
var. apiculata herbarium specimen (D.F. Austin 7480) with almost
80% missing data.
We aligned each of the regions independently using MAFFT
v.7.310 (Katoh & Standley, 2013) and removed poorly
aligned regions using GBLOCKS (half gaps) (Castresana, 2000;
Talavera & Castresana, 2007). We generated summary files of
all edited alignments using AMAS (Borowiec, 2016) (Table S4).
A further 12 alignments that had no variable sites were
excluded, so this analysis was done using 371 putative single-
copy nuclear DNA regions. We also used AMAS to concatenate
the alignments.
We inferred three different phylogenies: (1) partitioned maxi-
mum likelihood (ML) analysis of concatenated alignments with
automated model selection +merge in IQ-TREE v.1.6.12 (Nguyen
et al., 2015; Kalyaanamoorthy et al., 2017); (2) Approximate ML
analysis of unpartitioned concatenated alignments in multi-
threaded double-precision FASTTREE v.2.1.10
54,71
(GTR +gamma
model); and (3) independent gene tree inference using IQ-TREE
v.1.6.12 with automated model selection followed by species tree
inference using the coalescent in ASTRAL III (Zhang et al., 2018).
We used the GNU parallel tool (Tange, 2011) to parallelize and
speed up several steps in the pipeline.
Principal component analysis We conducted a PCA of I. batatas,
I. trifida, the Ecuadorian tetraploids, the Colombian tetraploid
specimen K500/CH81.3 and the hybrids I. tabascana and I. batatas
var. apiculata. We used a subset of 20 I. batatas and 20 I. trifida
samples to try to minimize bias due to uneven population sizes
compared to the other entities (Priveet al., 2020). We also
conducted additional analyses including all I. batatas and I. trifida
samples instead of a subset (Methods S4).
We mapped the nuclear reads to a sweetpotato sample (accession
CIP 400435) and called variants using the same procedure as
earlier. We filtered out all variants with coverage lower than 59and
ran a linkage disequilibrium pruning step using PLINK (--indep-
pairwise 50 10 0.1). We then used PLINK (--double-id --allow-extra-
chr --set-missing-var-ids @:# --make-bed --pca --geno 0.20 --snps-
only --max-alleles 2) (Chang et al., 2015; Purcell, 2021) for the
PCA and plotted the results using TIDYVERSE (Wickham et al.,
2019) and GGPLOT2 (Wickham, 2016) in RSTUDIO (RStudio Team,
2021). This analysis used 419 single nucleotide polymorphisms
(SNPs) from across the 386 Ipomoea nuclear regions, both
homozygous and heterozygous.
K-mer analyses We used GENOMESCOPE2.0 (Ranallo-Benavidez
et al., 2020) to assess heterozygosity from k-mer frequencies of raw,
unaligned sequencing reads, in a representative Ecuadorian sample
(PI 561248) sequenced at high-coverage. Relative frequency
patterns can then be used to infer whether a tetraploid sample is
autopolyploid or allopolyploid. We carried out initial k-mer
counting and histogram construction on the filtered but unaligned
sequencing reads using JELLYFISH (Marc
ßais & Kingsford, 2011). We
ran both JELLYFISH and GENOMESCOPE2.0 with a maximum
coverage of 100 000 and the default k-mer value of 21. We also
ran the same analysis in three Mexican hybrid tetraploids sequenced
at lower coverage (Methods S5).
Assembly of whole chloroplast genomes We used GETORGANELLE
(-F embplant_pt; SPAdes options: "--threads 20 --only-assembler -k
21,33,55,77,93")(Jinet al., 2018) to de novo assemble the chloro-
plast genomes of the new samples. When GETORGANELLE failed to
produce a circular genome assembly in the first attempt, we ran a
second attempt using --reduce-reads-for-coverage INF and --max-
reads INF options. GETORGANELLE successfully assembled all
samples except one I. trifida sample (F. de la Puente 2961). To
assemble the genome of this one sample, we used a reference-guided
assembly using I. trifida (F. de la Puente 1054) as reference.
Phylogenetic network using chloroplast genomes This analysis
includes all I. batatas,I. trifida and I. tabascana specimens from our
previous study, together with the 15 newly sequenced samples. We
aligned the whole chloroplast genome sequences using MAFFT
v.7.310 (FFT-NS-2) and removed poorly aligned regions using
GBLOCKS (no gaps). We used POPART (http://popart.otago.ac.nz) to
infer a Median Joining Network (reticulation tolerance 0.50
(Bandelt et al., 1999)) with 602 segregating sites, 182 of them
parsimony-informative.
Results
Morphological differentiation
The tetraploid Ecuadorian and Colombian material form a
cluster distinct from I. batatas and I. trifida in PCA and LDA of
12 morphological characters (Fig. 1e,f). The PCA shows three
clusters corresponding to I. batatas,I. trifida and the Ecuado-
rian/Colombian material, with some overlap at the margins,
predominantly between I. trifida and I. batatas (Fig. 1e). Hybrid
specimens from Mexico, i.e. I. tabascana (PI 518479) and
I. batatas var. apiculata (PI 518474), fall close to or within the
clusters of I. trifida and I. batatas. The three distinct clusters
were more pronounced in the LDA trained on 80% of the data
(Fig. 1e), which yielded a 90% success rate in accurately
identifying the test data and recovered the Mexican hybrids
within I. trifida.
Genomic differentiation
The phylogenetic analysis of nuclear regions recovers the six
tetraploid specimens from Ecuador and one from Colombia in a
clade sister to hexaploid I. batatas (Fig. 2a). This relationship is
recovered in all methods of phylogenetic inference with strong
support (Figs S7, S8). In addition, the tetraploid Ecuadorian and
Colombian specimens also form a distinct group from I. batatas and
I. trifida in the different PCA using nuclear SNPs (Figs 2b, S9). The
analysis using a subset of I. batatas and I. trifida samples, shown in
Fig. 2(b), aimed at preventing bias due to uneven population sizes
(Priveet al., 2020). In this analysis, the Ecuadorian and Colombian
tetraploids partially overlap with I. trifida in principal component
one (PC1) but clearly separate from all entities, including I. trifida,
in principal component two (PC2). The single specimen of the
Mexican hybrid I. tabascana and the three I. batatas var. apiculata
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
Research Rapid report
New
Phytologist
1188
specimens are intermediate between I. trifida and I. batatas in PC1
but cluster with both species in PC2 (Fig. 2b).
The analysis of nucleotide heterozygosity patterns suggests that
the Ecuadorian tetraploids have a genomic structure consistent
with an autopolyploid origin, with proportions of aaab consistently
higher than aabb (Table S5; Notes S1). This pattern is indicative of
two identical or highly similar subgenomes originating from a
whole genome duplication (Ranallo-Benavidez et al., 2020). The
same analysis for the hybrid I. tabascana and two specimens of
I. batatas var. apiculata, albeit using lower coverage data (Methods
S2), shows instead a higher proportion of aabb than aaab
(Table S5; Notes S1), which suggests that two distinct subgenomes
have been derived from a recent hybridization event.
Analysis of whole chloroplast genomes
The Median Joining phylogenetic network inferred using 602
segregating sites from the alignment of whole chloroplast genomes
shows the Ecuadorian plants are associated with the ancestral
sweetpotato lineage 1, whereas the single Colombian specimen we
sequenced (K500/80.3) is associated with the sweetpotato lineage
2. The hybrid I. tabascana and I. batatas var. apiculata are also
associated with sweetpotato lineage 2.
Discussion
Ipomoea aequatoriensis is a distinct species
We have identified a group of plants from Ecuador that are distinct
from cultivated sweetpotato and from all sweetpotato CWRs
known to date. These tetraploid plants are of wild provenance,
morphologically and geographically coherent, most likely autote-
traploid, isolated in the genetic space, and form a monophyletic
group most closely related to sweetpotato in phylogenetic analysis
of nuclear data. Their distinctiveness justifies recognition as a new
species I. aequatoriensis T. Wells & P. Mu~noz. A formal diagnosis is
presented here. Specimen citation, full description and ecological
notes are provided in the Notes S2. Specimens from Colombia,
although possibly also part of this species, require further study and
are not formally included in I. aequatoriensis (see Notes S2).
Ipomoea aequatoriensis T. Wells & P. Mu~noz, sp. nov. (Illustra-
tion in Fig. S10) TYPE: ECUADOR. Esmeraldas Province,
Quininde. Austin, D.F. 7803 (holotype FTG, Isotype CIP).
Diagnosis This species is most closely related to I. batatas (L.)
Lam. (Figs 2a, 3) which it resembles in corolla size, dense sub-
umbellate inflorescence and pubescent ovary, but differs in
possessing sepals that are consistently shorter (outer: <7vs
>7 mm; inner: <10 mm vs >12 mm) and stems that are thinner
(13mmvs26 mm diameter) with longer internodes (616 cm vs
210 cm), consistent with a twining (rather than trailing) habit. It
also closely resembles I. trifida (Kunth) G. Don, particularly in the
twining habit and chartaceous sepals, but differs in having obtuse
sepals (80°–160°vs 20°–70°) and laxer, more obviously umbellate
inflorescences with a greater number of flowers (524 vs 212) and
mostly entire, larger leaves (414 cm vs 210 cm long).
Identifying the tetraploid progenitor of sweetpotato
A major barrier to understanding the origin and evolution of
sweetpotato remains the difficulty of assembling its large
(b)(a)
Fig. 2 Molecular analyses identify Ipomoea aequatoriensis as a distinct entity, phylogenetically distinct and isolated in the genetic space. (a) Approximate
maximum likelihood analysis of 371 single-copy nuclear DNA regions. Numbers on the branches indicate ShimodairaHasegawa-like support values; black dots
indicate branches with 100% support. (b) Principal component analysis of Ipomoea batatas (green), Ipomoea trifida (red), I. aequatoriensis (blue) and the
hybrids Ipomoea tabascana and I. batatas var. apiculata (black and orange respectively). Principal component analysis inferred using 419 single nucleotide
polymorphisms (SNPs) from across the 386 nuclear probes. Ellipses indicate multivariate t-distribution. The Colombian specimen K500/CH81.3 discussed
throughout the text is indicated in light blue.
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
New
Phytologist Rapid report Research 1189
allohexaploid genome (Isobe et al., 2019), which comprises three
subgenomes: two identical (BBBB) and one slightly different (AA)
in an AABBBB structure (Ting & Kehr, 1953; Ting et al., 1957;
Jones, 1965; Magoon et al., 1970; Nishiyama et al., 1975; Shiotani
& Kawase, 1987; Srisuwan et al., 2006; Gao et al., 2011; Yang
et al., 2017). These subgenomes are most likely derived from a
hybridization event between a diploid progenitor that contributed
the AA subgenome and a tetraploid progenitor that contributed the
BBBB subgenomes (Fig. 4). The AA subgenome is most likely
derived from a diploid ancestor shared with I. trifida (Yang et al.,
2017; Mu~noz-Rodrıguez et al., 2018), but the tetraploid progen-
itor that contributed the BBBB subgenomes remains unidentified
(Yang et al., 2017).
The new autotetraploid species I. aequatoriensis is the closest
wild relative of sweetpotato identified to date, and our results
strongly suggest it is the direct descendant of sweetpotato’s
tetraploid progenitor. A possible scenario for this is presented in
Fig. 4, and there are four lines of evidence for this conclusion. First,
the wild provenance of the samples we studied, which were not
cultivated, feral or derived from breeding programmes (Notes S3).
Second, I.aequatoriensis is consistently recovered as monophyletic
and sister to I. batatas in nuclear phylogenies, regardless of the
method of phylogenetic inference, both in our study (Figs 2a, S7,
S8) and in a recent pre-print (Yan et al., 2021). Third, its genetic
structure is indicative of an autopolyploid origin (Table S5; Notes
S1), a requirement for the tetraploid progenitor of the sweetpotato
Ipomoea aequatoriensis
(Ecuador)
Ipomoea 4x
(Colombia)
Ipomoea batatas
(Lineage 1)
Ipomoea batatas
(Lineage 2)
Ipomoea trifida
Ipomoea batatas var. apiculata
Ipomoea tabascana
Fig. 3 The analysis of chloroplast genomes shows Ipomoea aequatoriensis is associated with the sweetpotato ancestral lineage. Median Joining phylogenetic
network inferred using 602 segregating sites (182 parsimony-informative) and showing the relationships between Ipomoea batatas,Ipomoea trifida,
I. aequatoriensis and the hybrid entities, Ipomoea tabascana and I. batatas var. apiculata. The one Colombian specimen sequenced (K500/CH80.3), indicated
with an arrow, seems to carry a chloroplast related to sweetpotato lineage 2 chloroplast; we excluded it from our diagnosis of I. aequatoriensis pending further
investigation. The size of the circles indicates the number of samples, with samples grouping in larger circles being identical for the sites studied.
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
Research Rapid report
New
Phytologist
1190
because of the AABBBB structure of the sweetpotato genome.
Fourth, I. aequatoriensis is most closely related to sweetpotato
lineage 1 the ancestral sweetpotato lineage in the analyses of
chloroplast genomes in our study (Fig. 3) and that by Roullier et al.
(2013).
Poor taxonomy and modern hybrids complicate sweetpotato
studies
Previous attempts to identify sweetpotato’s tetraploid progenitor
have been hampered by taxonomic confusion, the lack of a well-
resolved phylogenetic framework for sweetpotato and its closest
relatives or the inclusion of probably feral specimens (Jones, 1967;
Nishiyama, 1971; Martin & Jones, 1972; Austin, 1988; Roullier
et al., 2013) (Table S1).
In addition, the existence of modern hybrids between I. batatas
and its closest diploid relative, I. trifida, further complicates data
interpretation. This is because hybridization between I. batatas (3n
gametes) and I. trifida (1ngametes) will most likely produce a
tetraploid (Orjeda et al., 1991), as in the case of I. tabascana
(Austin, 1977; Jarret et al., 1992; Bohac et al., 1993; Srisuwan et al.,
2006). Because of their parentage, such tetraploids are closely
related to hexaploid I. batatas in nuclear phylogenies (Figs 2a, S7).
Therefore, studies that rely purely on phylogenetic analysis of
nuclear DNA sequence data are likely to confuse these putative
hybrid tetraploids with the autotetraploid progenitor of hexaploid
I. batatas (Yan et al., 2021) (Notes S3). However, the incorpor ation
of other lines of evidence confirms the hybrid origin of these
tetraploid entities and shows they cannot be the tetraploid
progenitor of sweetpotato. First, I. batatas var. apiculata is
recovered with the known hybrid I. tabascana in all phylogenies
(Figs 2a, S7, S8) and both entities are in an intermediate position
between I. trifida and I. batatas in the PCAs using nuclear genomic
variants (Fig. 2b), implying a highly similar genetic structure.
Second, k-mer analysis of these samples suggests that they possess
two distinct subgenomes (Table S5; Notes S1). The k-mer analyses
require confirmation using higher-coverage sequence data (Meth-
ods S2), but our results are consistent with their apparent hybrid
origin (Srisuwan et al., 2006; Mu~noz-Rodrıguez et al., 2018).
Third, the hybrid entities are most closely related in the chloroplast
analysis to the derived sweetpotato chloroplast lineage 2 (Fig. 3),
which is the result of introgression with I. trifida and therefore
I. batatas (6X, lineage 1)
I. batatas (6X)
I. aequatoriensis (4X)
modern hybrids (4X)
I. trifida (2X)
6X
4X
I. batatas
I. aequatoriensi
s
Common
ancestor
2X
6X
6X
Modern hybrids
4X
Nuclear phylogeny Chloroplast phylogeny
I. batatas (6X, lineage 2)
modern hybrids (4x)
I. trifida (2X)
Chloroplast capture
I. trifida
Present diversity
2X 2X 2X
4X
(a)
(b) (c)
I. aequatoriensis (4X)
Fig. 4 One of several possible scenarios of sweetpotato evolution and origin of current diversity. Tetraploid plants closely related to sweetpotato have two
different origins: plants from Ecuador represent direct descendants from the autotetraploid progenitor of hexaploid Ipomoea batatas, whereas plants from
Mexico and Central America are the result of a more recent hybridization between hexaploid I. batatas and diploid Ipomoea trifida. (a) One possible scenario,
congruent with the data currently available, is presented here. An autotetraploid would have arisen from a whole genome duplication of a diploid common
ancestor with I. trifida. This autotetraploid would have hybridized with the diploid ancestorto produce an allohexaploid. Subsequent introgression between the
diploid ancestor lineage and the allohexaploid would result in chloroplast capture from I. trifida, explaining the two distinct I. batatas lineages in the chloroplast
phylogenies. This separate lineage would keep a hexaploid nuclear genome but a chloroplast most similar to the diploid progenitor, and therefore to modern
I. trifida, than to the ancestral sweetpotato lineage. Red and blue colours indicate the proportion of diploid (AA, red) and tetraploid (BBBB, blue) ancestral
genomes in the different entities. Small, coloured circles represent the chloroplast. Dashed lines indicate hybridization and dotted line indicates introgression
with chloroplast capture. (b) Summary nuclear phylogeny depicting the relationship between modern taxa, with Ipomoea aequatoriensis most closely related to
I. batatas. (c) Summary chloroplast phylogeny depicting the relationship between modern taxa, with I. aequatoriensis most closely related to I. batatas lineage
1, the ancestral sweetpotato lineage.
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
New
Phytologist Rapid report Research 1191
postdates the origin of I. batatas (Mu~noz-Rodrıguez et al., 2018).
Finally, the PCA and LDA using morphology (Fig. 1d,e) consis-
tently show these specimens cluster with either I. trifida or
I. batatas, instead of forming a distinct group as is the case of
I. aequatoriensis.
In summary, a broader consideration of collection history,
nuclear and chloroplast sequence data, and genomic structure,
enables the identification of modern tetraploid hybrids, such as
I. tabascana and I. batatas var. apiculata, and rules them out as
sweetpotato’s tetraploid progenitor (Fig. 4). Our results also
suggest I. batatas var. apiculata should be treated as a distinct
entity of hybrid origin akin to I. tabascana rather than a subspecies
of I. batatas (Notes S1). Although we have not been able to study all
the material listed in earlier studies of tetraploid plants (Table S1;
Notes S4), future studies that consider the different criteria
presented here should allow the classification of those specimens as
either ancient autotetraploids or modern hybrids, and further
clarify their relationship to sweetpotato.
Ipomoea aequatoriensis, a key finding for sweetpotato
studies
The identification of the closest living relative of the tetraploid
progenitor of sweetpotato is key to untangling its genomic history
and contemporary diversity. Ipomoea aequatoriensis has all the
hallmarks of being that species, and therefore represents an
extraordinary discovery and a key finding for subsequent sweet-
potato studies.
Acknowledgements
This project was funded by a BBSRC research grant to RWS
(T001445/1). TW was funded by an Interdisciplinary DTP
BBSRC scholarship. PM-R was also funded by an Interdisciplinary
DTP BBSRC scholarship at the early stages of this project. The
authors thank botanical artist Rosemary Wise for the illustration of
Ipomoea aequatoriensis. The authors thank all herbarium curators
and germplasm managers for providing access to their material,
especially Masaru Tanaka at NARO Japan, as well as the people
who did fieldwork and collected the specimens in this study.
Author contributions
RWS, JRIW, PM-R, TW and TC conceived the project; PM- R and
TW conducted the analyses; NLA and RLJ contributed material
and information about its provenance; PM-R, TW, RWS, JRIW
and TC wrote the manuscript. PM-R and TW contributed equally
to this work.
ORCID
Noelle L. Anglin https://orcid.org/0000-0002-3454-1142
Tom Carruthers https://orcid.org/0000-0003-1586-3557
Robert L. Jarret https://orcid.org/0000-0002-0426-6186
Pablo Mu~noz-Rodrıguez https://orcid.org/0000-0002-3580-
8136
Robert W. Scotland https://orcid.org/0000-0002-6371-2238
Tom Wells https://orcid.org/0000-0002-4664-7868
John R. I. Wood https://orcid.org/0000-0001-5102-3729
Data availability
Raw reads from the 2018 study and newly generated data are
available in the Sequence Repository Archive, BioProjects
PRJNA453382 and PRJNA796763 respectively. Original and
edited files with morphological and molecular analyses and scripts
are available via the Oxford Research Archive (https://ora.ox.ac.uk/
objects/uuid:055e2f01-bbb1-4a69-a3ae-dac009db31d1). Any
other information required to re-analyse the data is available from
the lead contact upon request.
References
Austin DF. 1977. Hybrid polyploids in Ipomoea section Batatas.Journal of Heredity
68: 259260.
Austin DF. 1978. The Ipomoea batatas complex-I. Taxonomy. Bulletin of the Torrey
Botanical Club 105: 114129.
Austin DF. 1982.Flora of Ecuador. 165. Convolvulaceae. Lund, Sweden: Gleerup;
Stockholm, Sweden: Publishing House of the Swedish Research Council.
Austin DF. 1988. The taxonomy, evolution, and genetic diversity of the sweet
potato and its wild relatives. In: International Potato Center, ed. Exploration,
maintenance and utilization of sweet potato genetic resources. Lima, Peru:
International Potato Center, 2760.
Austin DF, Jarret RL, Tapia C, de la Puente F. 1993. Collecting tetraploid I.
batatas (L.) La m. in Ecuador. FAO/IBPGR P lant Genetic Resources N ewsletter 91:
3335.
Austin DF, de la Puente F, Contreras J. 1991.Ipomoea tabascana, an endangered
tropical species. Economic Botany 45: 435.
Bandelt HJ, Forster P, Rohl A. 1999. Median-joining networks for inferring
intraspecific phylogenies. Molecular Biology and Evolution 16:3748.
Bohac JR, Austin DF, Jones A. 1993. Discovery of wild tetraploid sweetpotatoes.
Economic Botany 47: 193201.
Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for
Illumina sequence data. Bioinformatics 30: 21142120.
Borowiec ML. 2016.A
MAS: a fast tool for alignment manipulation and computing
of summary statistics. PeerJ 4: e1660.
Castresana J. 2000. Selection of conserved blocks from multiple alignments for their
use in phylogenetic analysis. Molecular Biology and Evolution 17: 540552.
Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. 2015. Second-
generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4:7.
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker
RE, Lunter G, Marth GT, Sherry ST et al. 2011. The variant call format and
VCFtools. Bioinformatics 27: 21562158.
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A,
Keane T, McCarthy SA, Davies RM et al. 2021. Twelve years of SAMTOOLS and
BCFTOOLS.GigaScience 10: giab008.
Dodson CH, Gentry AH. 1978. Flora of the Rio Palenque Science Center: Los Rios
Province, Ecuador. Selbyana 4: 301302.
Dodson CH, Gentry AH, Valverde FM. 1985.La flora de Jauneche, Los Rıos,
Ecuador. Quito, Ecuador: Banco Central del Ecuador.
Gao M, Ashu GM, Stewart L, Akwe WA, Njiti V, Barnes S. 2011. Wx intron
variations support an allohexaploid origin of the sweetpotato [Ipomoea batatas (L.)
Lam]. Euphytica 177: 111133.
Huaman Z, ed. 1991.Descriptors for sweet potato. Lima, Peru: CIP, AVRDC and
IBPGR.
Isobe S, Shirasawa K, Hirakawa H. 2019. Current status in whole genome
sequencing and analysis of Ipomoea.Plant Cell Reports 38: 13651371.
Jarret RL, Gawel N, Whittemore AT. 1992. Phylogenetic relationships of the
sweetpotato [Ipomoea batatas (L.) Lam.]. Journal of the American Society for
Horticultural Science 117: 633637.
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
Research Rapid report
New
Phytologist
1192
Jin J-J, Yu W-B, Yang J-B, Song Y, dePamphilis CW, Yi T-S, Li D-Z. 2018.
GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of
organelle genomes. Genome Biology 21: 241.
Jones A. 1965. Cytological observations and fertility measurements of sweet potato
(Ipomoea batatas (L.) Lam.). Proceedings of the American Society of Horticultural
Science 86: 527537.
Jones A. 1967. Should Nishiyama’s K123 (Ipomoea trifida) be designated I. batatas?
Economic Botany 21: 163166.
Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. 2017.
MODELFINDER: fast model selection for accurate phylogenetic estimates. Nature
Methods 14: 587589.
Katoh K, Standley DM. 2013.MAFFT multiple sequence alignment software v.7:
improvements in performance and usability. Molecular Biology and Evolution 30:
772780.
L^e S, Josse J, Husson F. 2008.FACTOMINER:anRpackage for multivariate analysis.
Journal of Statistical Software 25:118.
Magoon ML, Krishnan R, Vijaya BK. 1970. Cytological evidence on the origin of
sweet potato. Theoretical and Applied Genetics 40: 360366.
Marc
ßais G, Kingsford C. 2011. A fast, lock-free approach for efficient parallel
counting of occurrences of k-mers. Bioinformatics 27: 764770.
Martin FW, Jones A. 1972. The species of Ipomoea closely related to the sweet
potato. Economic Botany 26: 201215.
Martin FW, Ruberte RM, Jones A. 1974. A wild Ipomoea species closely related to
the sweet potato. Economic Botany 28: 287292.
McDonald JA, Austin DF. 1990. Changes and additions in Ipomoea sect. Batatas.
Brittonia 42: 116120.
Mu~noz-Rodrıguez P, Carruthers T, Wood JRI, Williams BRM, Weitemier K,
Kronmiller B, Ellis D, Anglin NL, Longway L, Harris SA et al. 2018.
Reconciling conflicting phylogenies in the origin of sweet potato and dispersal to
Polynesia. Current Biology 28: 12461256.
Mu~noz-Rodrıguez P, Carruthers T, Wood JRI, Williams BRM, Weitemier K,
Kronmiller B, Goodwin Z, Sumadijaya A, Anglin NL, Filer D et al. 2019.A
taxonomic monograph of Ipomoea integrated across phylogenetic scales. Nature
Plants 5: 11361144.
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. 2015.I
Q-TREE: a fast and
effective stochastic algorithm for estimating maximum-likelihood phylogenies.
Molecular Biology and Evolution 32: 268274.
Nishiyama I. 1971. Evolution and domestication of the sweet potato. Botanical
Magazine 84: 377387.
Nishiyama I, Miyazaki T, Sakamoto S. 1975. Evolutionary autoploidy in the sweet
potato (Ipomoea batatas (L.) Lam.) and its progenitors. Euphytica 24: 197208.
Orjeda G, Freyre R, Iwanaga M. 1991. Use of Ipomoea trifida germ plasm for sweet
potato improvement. 3. Development of 4x interspecific hybrids between
Ipomoea batatas (L.) Lam. (2n=6x=90) and I. trifida (H.B.K) G. Don.
(2n=2x=30) as storage-root initiators for wild species. Theoretical and Applied
Genetics 83: 159163.
Prive F, Luu K, Blum MGB, McGrath JJ, Vilhja lmsson BJ. 2020. Efficient toolkit
implementing best practices for principal component analysis of population
genetic data. Bioinformatics 36: 44494457.
Purcell S. 2021.PLINK v. 2.00a3LM AVX2. [WWW document] URL https://www.
cog-genomics.org/plink/2.0/ [accessed 1 October 2021].
Ranallo-Benavidez TR, Jaron KS, Schatz MC. 2020.G
ENOMESCOPE 2.0 and
SMUDGEPLOT for reference-free profiling of polyploid genomes. Nature
Communications 11: 1432.
Roullier C, Duputie A, Wennekes P, Benoit L, Fernandez Bringas VM, Rossel G,
Tay D, McKey D, Lebot V. 2013. Disentangling the origins of cultivated sweet
potato (Ipomoea batatas (L.) Lam.). PLoS ONE 8: e62707.
RStudio Team. 2021.RSTUDIO: integrated development for R, v.1.4.1717. Boston,
MA, USA: RStudio, PBC.
Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T,
Preibisch S, Rueden C, Saalfeld S, Schmid B et al. 2012.F
IJI: an open-source
platform for biological-image analysis. Nature Methods 9: 676682.
Shiotani I, Kawase T. 1987. Synthetic hexaploids derived from wild species related
to sweet potato. Japanese Journal of Breeding 37: 367376.
Srisuwan S, Sihachakr D, Siljak-Yakovlev S. 2006. The origin and evolution of
sweet potato (Ipomoea batatas Lam.) and its wild relatives through the cytogenetic
approaches. Plant Science 171: 424433.
Talavera G, Castresana J. 2007. Improvement of phylogenies after removing
divergent and ambiguously aligned blocks from protein sequence alignments.
Systematic Biology 56: 564577.
Tange O. 2011. GNU Parallel the command-line power tool. The USENIX
Magazine 36:4247.
Thiers B. 2018.Index Herbariorum: a global directory of public herbaria and associated
staff. New York Botanical Garden’s Virtual Herbarium. [WWW document] URL
http://sweetgum.nybg.org/science/ih/ [accessed 10 November 2021].
Ting YC, Kehr AE. 1953. Meiotic studies in the sweet potato: (Ipomoea batatas
Lam.). Journal of Heredity 44: 207211.
Ting YC, Kehr AE, Miller JC. 1957. A cytological study of the sweet potato plant
Ipomoea batatas (L.) Lam. and its related species. The American Naturalist 91:
197203.
Venables WN, Ripley BD, Venables WN. 2002.Modern applied statistics with S.
New York, NY, USA: Springer.
Wickham H. 2016.GGPLOT2: elegant graphics for data analysis. New York, NY, USA:
Springer-Verlag.
Wickham H, Averick M, Bryan J, Chang W, McGowan L, Fran c
ßois R, Grolemund
G, Hayes A, Henry L, Hester J et al. 2019. Welcome to the TIDYVERSE.Journal of
Open Source Software 4: 1686.
Wood JRI, Mu~noz-Rodrıguez P, Williams BRM, Scotland RW. 2020.A
foundation monograph of Ipomoea (Convolvulaceae) in the New World.
PhytoKeys 143:1823.
Yan M, Li M, Moeinzadeh M-H, Quispe-Huamanquispe DG, Fan W, Nie H,
Wang Z, Heider B, Jarret R, Kreuze J et al. 2021. Haplotype-based phylogenetic
analysis uncovers the tetraploid progenitor of sweet potato. Research Square. doi:
10.21203/rs.3.rs-750500/v1.
Yang J, Moeinzadeh M-H, Kuhl H, Helmuth J, Xiao P, Haas S, Liu G, Zheng J,
Sun Z, Fan W et al. 2017. Haplotype-resolved sweet potato genome traces back its
hexaploidization history. Nature Plants 3: 696703.
Zhang C, Rabiee M, Sayyari E, Mirarab S.2018.A
STRAL-III: polynomialtime species
tree reconstruction from partially resolved gene trees. BMC Bioinformatics 19: 153.
Supporting Information
Additional Supporting Information may be found online in the
Supporting Information section at the end of the article.
Fig. S1 Ipomoea aequatoriensis specimen PI 355830/K300/
CH71.3.
Fig. S2 Ipomoea aequatoriensis specimen K500/CH80.3.
Fig. S3 Ipomoea aequatoriensis specimen PI 561248/CIP 403553.
Fig. S4 Ipomoea aequatoriensis specimen PI 561258.
Fig. S5 Ipomoea tabascana specimen PI 518479/CIP 460824 and
Ipomoea batatas var. apiculata specimen PI 518474/CIP 403953.
Fig. S6 trnL-rpl32 chloroplast DNA barcode phylogeny.
Fig. S7 Nuclear phylogenies of Ipomoea Clade A3 indicating the
position of the Ecuadorian tetraploids and the modern hybrids.
Fig. S8 Nuclear phylogenies of Ipomoea Clade A3 indicating the
position of the Ecuadorian tetraploids and the modern hybrids.
Phylogenies inferred including IUPAC characters for heterozygous
sites.
Fig. S9 Additional principal component analyses.
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
New
Phytologist Rapid report Research 1193
Fig. S10 Scientific illustration of Ipomoea aequatoriensis T. Wells &
P. Mu~noz.
Methods S1 Preliminary analysis of the trnL-rpl32 chloroplast
DNA region.
Methods S2 K-mer analysis of putative hybrid tetraploids.
Methods S3 Additional phylogenetic analysis of nuclear probes.
Methods S4 Additional principal component analyses.
Methods S5 K-mer analysis of putative hybrid tetraploids.
Notes S1 Modern hybrids closely related to Ipomoea batatas.
Notes S2 K-mer analyses diagrams.
Notes S3 Description and additional information for Ipomoea
aequatoriensis.
Notes S4 Hybrid specimens in other studies.
Table S1 Tetraploid accessions in previous studies, indicating past
and present identifications.
Table S2 Passport data of all samples included in morphological
analyses.
Table S3 Passport data of all samples included in phylogenetic
analyses.
Table S4 Statistics of the putative single copy nuclear regions used
in phylogenetic analysis.
Table S5 Patterns of nucleotide heterozygosity in k-mer spectra of
sequencing reads (k=21).
Please note: Wiley Blackwell are not responsible for the content or
functionality of any Supporting Information supplied by the
authors. Any queries (other than missing material) should be
directed to the New Phytologist Central Office.
See also the Commentary on this article by Sarkinen et al.,234: 1107–1108.
New Phytologist (2022) 234: 1185–1194
www.newphytologist.com
Ó2022 The Authors
New Phytologist Ó2022 New Phytologist Foundation
Research Rapid report
New
Phytologist
1194
... A milestone of our work was the publication of a taxonomic monograph of all 425 Ipomoea species in the Americas alongside comprehensive molecular phylogenies of the tribe Ipomoeeae Hallier f. (Muñoz-Rodríguez & al., 2019). The results of our monographic work also include the description of over 70 American species new to sciencealmost 9% of all species known worldwide -, an extensive nomenclatural review, and several other taxonomic publications (Wood & al., 2015(Wood & al., , 2016a(Wood & al., ,b, 2017a(Wood & al., ,b,c, 2018Wood & Scotland, 2017a,b,c;Jara & al., 2020;Muñoz-Rodríguez & al., 2019, 2022a. The molecular phylogenies also allowed us to address the evolution of the genus at a global scale , 2022aCarruthers & al., 2020a). ...
... The results of our monographic work also include the description of over 70 American species new to sciencealmost 9% of all species known worldwide -, an extensive nomenclatural review, and several other taxonomic publications (Wood & al., 2015(Wood & al., , 2016a(Wood & al., ,b, 2017a(Wood & al., ,b,c, 2018Wood & Scotland, 2017a,b,c;Jara & al., 2020;Muñoz-Rodríguez & al., 2019, 2022a. The molecular phylogenies also allowed us to address the evolution of the genus at a global scale , 2022aCarruthers & al., 2020a). Our results are supported by the study of over 25,000 physical herbarium specimens, with visits to herbaria in the U.K. and other European countries, Argentina, Bolivia, Brazil, Colombia, Cuba, Ecuador, Paraguay, Peru, United States and specimens in virtual herbaria, as well as fieldwork in Argentina, Bolivia, Brazil, Ecuador, and Paraguay. ...
... In our studies of Ipomoea, we have continually prioritised species-level taxonomic accounts as we consider these the priority output for taxonomy, given how little we know about individual species (Scotland & Wood, 2012;Goodwin & al., 2020). A secondary but useful output is the fact that we now have comprehensively sampled phylogenies that can be used for a range of evolutionary studies , 2022aCarruthers, 2019;Carruthers & al., 2020a,b). Taking these phylogenies into account forms a vital part in sorting (classifying) the species in a diagnosable, monophyletic genus within their evolutionary context. ...
Article
Big genera represent a significant proportion of the world's plants. However, comprehensive taxonomic and evolutionary studies of these genera are often complicated by their size and geographic spread. This paper explores the challenges faced in classifying these megadiverse plant groups consequent to the existing tension between diagnosability and increasing levels of resolution from molecular sequence data. We use recent examples from across angiosperms to illustrate how monophyly, diagnosability and completeness interplay with each other in attempts to classify several big genera and, specifically, the genus Ipomoea (Convolvulaceae). Ipomoea and the tribe Ipomoeeae have been the object of recent taxonomic and phylogenetic studies that highlight the limitations of previous attempts to classify the group, and show that the smaller segregate genera traditionally recognised in Ipomoeeae are nested within Ipomoea and are neither monophyletic nor diagnosable. We argue that existing classifications must be abandoned, and that recognising an expanded Ipomoea that incorporates all segregate genera of the Ipomoeeae is the most appropriate solution as it reconciles the properties of monophyly, diagnosability and completeness, and favours nomenclatural stability.
... However, Muñoz-Rodríguez et al. 23 identi ed I. aequatoriensis as the tetraploid progenitor of sweetpotato based on morphological and phylogenetic analyses. Therefore, the origin of sweetpotato is still controversial and needs to be determined. ...
... There are two speculations regarding the relationship between the basal 4x and sweetpotato. The rst one considers the basal 4x as the tetraploid progenitor of sweetpotato 22 , whilst the second one treats the basal 4x as hybrid offspring of crosses between sweetpotato and I. tri da 23 . The key to con rming the tetraploid progenitor is to establish an effective standard to distinguish the tetraploid progenitor and the hybrid offspring 21 . ...
... The origin of sweetpotato is still the subject of erce debate. Competing hypotheses have been put forward proposing that sweetpotato is an autopolyploid, a segmental allopolyploid, or an allopolyploid 8, 9,16,18,19,22,23,40,41 . The genetic origin of sweetpotato has remained unresolved because of the high complexity of the genome, due to its hexaploid nature and high degree of heterozygosity 5,24 , and limited plant materials available for analysis. ...
Preprint
Full-text available
The hexaploid sweetpotato is one of the most important root crops worldwide. However, its genetic origins remain controversial. In this study, we identified two likely progenitors of sweetpotato by analyzing the horizontally transferred Ib T-DNA and a haplotype-based phylogenetic analysis. The diploid form of I. aequatoriensis contributed the B 1 subgenome, the Ib T-DNA2 and the lineage 2 type of the chloroplast genome to sweetpotato. The tetraploid progenitor of sweetpotato is I. batatas 4x, donating the B 2 subgenome, Ib T-DNA1 and the lineage 1 type of chloroplast genome. Sweetpotato is derived from reciprocal crosses between the diploid and the tetraploid progenitor, and a subsequent whole genome duplication. We also detected biased gene exchanges between subgenomes. The B 1 to B 2 subgenome conversions were almost 3-fold higher than the B 2 to B 1 subgenome conversions. This study sheds lights on the evolution of sweetpotato and paves the way for the improvement of the crop.
... Understanding the ancestry and evolutionary history of crops is challenging, however, and requires the unification of two, often widely separated, knowledge systemsplant breeding and biodiversity science. In this issue of New Phytologist, Muñoz-Rodr ıguez et al. (2022;pp. 1185-1194 provide a compelling example of how the two can meet with their investigation into the ancestry of the cultivated sweetpotato (Ipomoea batatas (L.) Lam.; Convolvulaceae). ...
... Ladefoged et al., 2005;Roullier et al., 2013a, b). Muñoz-Rodr ıguez et al. (2022) identify and name a new species that represents the closest tetraploid relative of the cultivated sweetpotato. Sweetpotato is a hexaploid crop and was originally thought to be of autoploid origin, where genome doubling without hybridisation in I. trifida, a diploid ancestor native to the Caribbean (Roullier et al., 2013b;Muñoz-Rodr ıguez et al., 2018), gave rise to the cultivated species. ...
... Evidence from cytogenetic work, however, challenged this view by suggesting an alloploid origin of the sweetpotato where an unknown tetraploid species hybridised with I. trifida and, in doing so, created the hexaploid I. batatas (Yang et al., 2017). Findings by Muñoz-Rodr ıguez et al. (2022) provide solid morphological and molecular evidence of the alloploid origin of the sweetpotato and establish I. aequatoriensis from Pacific Ecuador and Colombia as the name of its hitherto enigmatic closest tetraploid relative. They used multiple independent lines of evidence from high-throughput sequence data, morphology, and geography to identify these plants as the missing piece of the complicated puzzle of sweetpotato origins. ...
Article
Full-text available
This article is a Commentary on Muñoz‐Rodríguez et al. (2022), 234: 1185–1194.
... This taxon, commonly known as 'morning glory' , is a pantropical genus that grows naturally in warm www.nature.com/scientificreports/ temperate and subtropical regions of the world 36,37 . Traditionally an American origin was suggested for Ipomoea based on the most ancient fossil record 38,39 , or in East Gondwana based on molecular data 40 . ...
Article
Full-text available
We report a new type of fossil margin galls arranged in a linear series on dicot leaf impressions from the latest Neogene (Pliocene) sediments of the Chotanagpur Plateau, Jharkhand, eastern India. We collected ca. 1500 impression and compression leaf fossils, of which 1080 samples bear arthropod damage referable to 37 different damage types (DT) in the ‘Guide to Insect (and Other) Damage Types in Compressed Plant Fossils’. A few leaf samples identified as Ipomoea L. (Convolvulaceae) have specific margin galls that do not match any galling DT previously described. This type of galling is characterized by small, linearly arranged, irregular, sessile, sub-globose, solitary, indehiscent, solid pouch-galls with irregular ostioles. The probable damage inducers of the present galling of the foliar margin might be members of Eriophyidae (Acari). The new type of gall suggests that marginal gall- inducing mites on leaves of Ipomoea did not change their host preference at the genus level since the Pliocene. The development of marginal leaf galling in Ipomoea is linked to extrafloral nectaries that do not offer protection against arthropod galling but indirectly protect the plant against herbivory from large mammals.
... The second hypothesis follows the same initial autopolyploidization of a species followed by tetraploid-diploid hybridization with a different species (likely I. trifida and I. batatas) (Austin, 1988, Oracion et al., 1990Roullier et al., 2013;Yang et al., 2017;Muñoz-Rodrıǵuez et al., 2018). Recently, Mollinari et al. proposed a hexasomic-bivalent inheritance model that may help elucidate sweetpotato origins and allow for efficient application of molecular techniques (Mollinari et al., 2020) and Muñoz-Rodrıǵuez et al. identified a new tetraploid species, Ipomoea aequatoriensis, that may be sweetpotato's closest tetraploid relative known to date (Muñoz-Rodrıǵuez et al., 2022). Due to the complexity of polyploid genetics and segregation ratios (Lebot, 2010), sweetpotato can be notoriously difficult to breed. ...
Article
Full-text available
Sweetpotato (Ipomoea batatas) is the sixth most important food crop and plays a critical role in maintaining food security worldwide. Support for sweetpotato improvement research in breeding and genetics programs, and maintenance of sweetpotato germplasm collections is essential for preserving food security for future generations. Germplasm collections seek to preserve phenotypic and genotypic diversity through accession characterization. However, due to its genetic complexity, high heterogeneity, polyploid genome, phenotypic plasticity, and high flower production variability, sweetpotato genetic characterization is challenging. Here, we characterize the genetic diversity and population structure of 604 accessions from the sweetpotato germplasm collection maintained by the United States Department of Agriculture (USDA), Agricultural Research Service (ARS), Plant Genetic Resources Conservation Unit (PGRCU) in Griffin, Georgia, United States. Using the genotyping-by-sequencing platform (GBSpoly) and bioinformatic pipelines (ngsComposer and GBSapp), a total of 102,870 polymorphic SNPs with hexaploid dosage calls were identified from the 604 accessions. Discriminant analysis of principal components (DAPC) and Bayesian clustering identified six unique genetic groupings across seven broad geographic regions. Genetic diversity analyses using the hexaploid data set revealed ample genetic diversity among the analyzed collection in concordance with previous analyses. Following population structure and diversity analyses, breeder germplasm subsets of 24, 48, 96, and 384 accessions were established using K-means clustering with manual selection to maintain phenotypic and genotypic diversity. The genetic characterization of the PGRCU sweetpotato germplasm collection and breeder germplasm subsets developed in this study provide the foundation for future association studies and serve as precursors toward phenotyping studies aimed at linking genotype with phenotype.
Article
Full-text available
Two interrelated aspects of the sweetpotato genome, its polyploid origin and inheritance type, remain uncertain. We recently proposed a segmental allohexaploid sweetpotato and thus sought to clarify its inheritance type by direct analyses of homoeolog segregations at selected single-copy loci. For such analyses, we developed a digital quantitative PCR genotyping method using one nondiscriminatory and three discriminatory probes for each selected locus to discriminate and quantify three homoeolog-differentiating variation types (homoeolog-types) in genomic DNA samples for genotype fitting and constructed a F2 population for segregation analyses. We confirmed inter-subgenomic distinctions of three identified homoeolog-types at each of five selected loci by their interspecific differentiations among 14 species in Ipomoea section batatas and genotyped the loci in 549 F2 lines, selected F1 progenies, and their founding parents. Segregation and genotype analyses revealed a locus-dependent mixed inheritance (disomic, polysomic, and intermediate types) of the homoeolog-types at 4 loci in the F2 population, displaying estimated disomic-inheritance frequencies of 0, 2.72%, 14.52%, and 36.92%, and probably in the F1 population too. There were also low-frequency non-hexaploid F1 and F2 genotypes that were probably derived from double-reduction recombination or partially unreduced gametes, and F2 genotypes of apparent aneuploids/dysploids with neopolyploid-like frequencies. Additional analyses of homoeolog-type genotypes at the 5 loci in 46 lines from various regions revealed locus-dependent selection biases, favoring genotypes having more of one homoeolog-type, i.e. more of di- or homogenized homoeolog-type composition, and one-direction ploidy trending among apparent aneuploids/dysploids. These inheritance features pointed to an evolving segmental allohexaploid sweetpotato impacted by selection biases.
Preprint
Full-text available
The hexaploid sweetpotato is one of the most important root crops worldwide. However, its genetic origins are controversial. In this study, we identified two progenitors of sweetpotato by horizontal gene transferred Ib T-DNA and haplotype-based phylogenetic analysis. The diploid progenitor is the diploid form of I. aequatoriensis , contributed the B 1 subgenome, Ib T-DNA2 and lineage 2 type of chloroplast genome to sweetpotato. The tetraploid progenitor of sweetpotato is I. batatas 4x, donating the B 2 subgenome, Ib T-DNA1 and lineage 1 type of chloroplast genome. Sweetpotato derived from the reciprocal cross between the diploid and tetraploid progenitors and a subsequent whole genome duplication. We also detected biased gene exchanges between subgenomes. The B 1 to B 2 subgenome conversions were almost 3-fold higher than the B 2 to B 1 subgenome conversions. This study sheds lights on the evolution of sweetpotato and paves a way for the improvement of sweetpotato.
Article
Full-text available
Sweetpotato (Ipomoea batatas (L.) Lam.) is one of the most important root crops cultivated worldwide. Due to its adaptability, high yield potential, and nutritional value, sweetpotato has become an important food crop, particularly in developing countries. To ensure adequate crop yields to meet increasing demand, it is essential to enhance the tolerance of sweetpotato to environmental stresses and other yield-limiting factors. The highly heterozygous hexaploid genome of I. batatas complicates genetic studies and limits the improvement of sweetpotato through traditional breeding. However, the application of next-generation sequencing (NGS) and high-throughput genotyping and phenotyping technologies to sweetpotato genetics and genomics research have provided new tools and resources for crop improvement. In this review, we discuss the genomics resources that are available for sweetpotato, including the current reference genome, databases, and available bioinformatics tools. We systematically review the current state of knowledge on the polyploid genetics of sweetpotato including studies of its origin and germplasm diversity, and also the associated mapping of important agricultural traits. We then outline the conventional and molecular breeding approaches that have been applied to sweetpotato. Finally, we discuss future goals for genetic studies in sweetpotato and crop improvement via breeding in combination with state-of-the-art multi-omics approaches such as genomic selection and gene editing. These approaches will advance and accelerate the genetic improvement of this important root crop and facilitate its sustainable global production.
Preprint
Full-text available
The hexaploid sweet potato is one of the most important root crops worldwide. However, its genetic origins, especially that of its tetraploid progenitor, are unclear. In this study, we conceived a pipeline consisting of a genome-wide variation-based phylogeny and a novel haplotype-based phylogenetic analysis (HPA) to determine that the tetraploid accession CIP695141 of Ipomoea batatas 4x from Peru is the tetraploid progenitor of sweet potato. We detected biased gene exchanges between subgenomes. The B 1 to B 2 subgenome conversions were almost 3-fold higher than the B 2 to B 1 subgenome conversions. Our analyses revealed that the genes involved in storage root formation, sugar transport, stress resistance, and maintenance of genome stability have been selected during the speciation and domestication of sweet potato. This study sheds lights on the evolution of sweet potato and paves a way for the improvement of sweet potato.
Article
Full-text available
Background: SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. They include tools for file format conversion and manipulation, sorting, querying, statistics, variant calling, and effect analysis amongst other methods. Findings: The first version appeared online 12 years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. Conclusion: Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed >1 million times via Bioconda. The source code and documentation are available from https://www.htslib.org.
Article
Full-text available
GetOrganelle is a state-of-the-art toolkit to accurately assemble organelle genomes from whole genome sequencing data. It recruits organelle-associated reads using a modified “baiting and iterative mapping” approach, conducts de novo assembly, filters and disentangles the assembly graph, and produces all possible configurations of circular organelle genomes. For 50 published plant datasets, we are able to reassemble the circular plastomes from 47 datasets using GetOrganelle. GetOrganelle assemblies are more accurate than published and/or NOVOPlasty-reassembled plastomes as assessed by mapping. We also assemble complete mitochondrial genomes using GetOrganelle. GetOrganelle is freely released under a GPL-3 license (https://github.com/Kinggerm/GetOrganelle).
Article
Full-text available
Motivation Principal component analysis (PCA) of genetic data is routinely used to infer ancestry and control for population structure in various genetic analyses. However, conducting PCA analyses can be complicated and has several potential pitfalls. These pitfalls include (i) capturing linkage disequilibrium (LD) structure instead of population structure, (ii) projected PCs that suffer from shrinkage bias, (iii) detecting sample outliers and (iv) uneven population sizes. In this work, we explore these potential issues when using PCA, and present efficient solutions to these. Following applications to the UK Biobank and the 1000 Genomes project datasets, we make recommendations for best practices and provide efficient and user-friendly implementations of the proposed solutions in R packages bigsnpr and bigutilsr. Results For example, we find that PC19–PC40 in the UK Biobank capture complex LD structure rather than population structure. Using our automatic algorithm for removing long-range LD regions, we recover 16 PCs that capture population structure only. Therefore, we recommend using only 16–18 PCs from the UK Biobank to account for population structure confounding. We also show how to use PCA to restrict analyses to individuals of homogeneous ancestry. Finally, when projecting individual genotypes onto the PCA computed from the 1000 Genomes project data, we find a shrinkage bias that becomes large for PC5 and beyond. We then demonstrate how to obtain unbiased projections efficiently using bigsnpr. Overall, we believe this work would be of interest for anyone using PCA in their analyses of genetic data, as well as for other omics data. Availability and implementation R packages bigsnpr and bigutilsr can be installed from either CRAN or GitHub (see https://github.com/privefl/bigsnpr). A tutorial on the steps to perform PCA on 1000G data is available at https://privefl.github.io/bigsnpr/articles/bedpca.html. All code used for this paper is available at https://github.com/privefl/paper4-bedpca/tree/master/code. Supplementary information Supplementary data are available at Bioinformatics online.
Article
Full-text available
An important assessment prior to genome assembly and related analyses is genome profiling, where the k-mer frequencies within raw sequencing reads are analyzed to estimate major genome characteristics such as size, heterozygosity, and repetitiveness. Here we introduce GenomeScope 2.0 (https://github.com/tbenavi1/genomescope2.0), which applies combinatorial theory to establish a detailed mathematical model of how k-mer frequencies are distributed in heterozygous and polyploid genomes. We describe and evaluate a practical implementation of the polyploid-aware mixture model that quickly and accurately infers genome properties across thousands of simulated and several real datasets spanning a broad range of complexity. We also present a method called Smudgeplot (https://github.com/KamilSJaron/smudgeplot) to visualize and estimate the ploidy and genome structure of a genome by analyzing heterozygous k-mer pairs. We successfully apply the approach to systems of known variable ploidy levels in the Meloidogyne genus and the extreme case of octoploid Fragaria × ananassa. Prior to genome assembly, the raw sequencing reads must be analyzed for assessment of major genome characteristics such as genome size, heterozygosity, and repetitiveness. For this purpose, the authors introduce GenomeScope 2.0, an extension of GenomeScope for polyploid genomes, and Smudgeplot, which can estimate a genome’s ploidy.
Article
Full-text available
A monograph of the 425 New World species of Ipomoea is presented. All 425 species are described and information is provided on their ecology and distribution, with citations from all countries from which they are reported. Notes are provided on salient characteristics and taxonomic issues related to individual species. A full synonymy is provided and 272 names are lectotypified. An extensive introduction discusses the delimitation and history of Ipomoea arguing that a broad generic concept is the only rational solution in the light of recent phylogenetic advances. Although no formal infrageneric classification is proposed, attention is drawn to the major clades of the genus and several morphologically well-defined clades are discussed including those traditionally treated under the names Arborescen s, Batatas , Pharbitis , Calonyction and Quamoclit , sometimes as distinct genera, subgenera, sections or series. Identification keys are provided on a regional basis including multi-entry keys for the main continental blocks. Six species are described as new, Ipomoea nivea J.R.I. Wood & Scotland from Peru, I. apodiensis J.R.I. Wood & Scotland from Brazil, I. calcicola J.R.I. Wood & Scotland, I. pochutlensis J.R.I. Wood & Scotland, I. zacatecana J.R.I. Wood & Scotland and I. ramulosa J.R.I. Wood & Scotland from Mexico, while var. australis of I. cordatotriloba is raised to specific status as I. australis (O’Donell) J.R.I. Wood & P. Muñoz. New subspecies for I. nitida (subsp. krapovickasii J.R.I. Wood & Scotland) and for I. chenopodiifolia (subsp. bellator J.R.I. Wood & Scotland) are described. The status of previously recognized species and varieties is changed so the following new subspecies are recognized: I. amnicola subsp. chiliantha (Hallier f.) J.R.I. Wood & Scotland, I. chenopodiifolia subsp. signata (House) J.R.I. Wood & Scotland, I. orizabensis subsp. collina (House) J.R.I. Wood & Scotland, I. orizabensis subsp. austromexicana (J.A. McDonald) J.R.I. Wood & Scotland, I. orizabensis subsp. novogaliciana (J.A. McDonald) J.R.I. Wood & Scotland, I. setosa subsp. pavonii (Hallier f.) J.R.I. Wood & Scotland, I. setosa subsp. melanotricha (Brandegee) J.R.I. Wood & Scotland, I. setosa subsp. sepacuitensis (Donn. Sm.) J.R.I. Wood & Scotland, I. ternifolia subsp. leptotoma (Torr.) J.R.I. Wood & Scotland. Ipomoea angustata and I. subincana are treated as var. angustata (Brandegee) J.R.I. Wood & Scotland and var. subincana (Choisy) J.R.I. Wood & Scotland of I. barbatisepala and I. brasiliana respectively. Attention is drawn to a number of hitherto poorly recognized phenomena in the genus including a very large radiation centred on the Parana region of South America and another on the Caribbean Islands, a strong trend towards an amphitropical distribution in the New World, the existence of a relatively large number of species with a pantropical distribution and of many species in different clades with storage roots, most of which have never been evaluated for economic purposes. The treatment is illustrated with over 200 figures composed of line drawings and photographs.
Article
Full-text available
Taxonomic monographs have the potential to make a unique contribution to the understanding of global biodiversity. However, such studies, now rare, are often considered too daunting to undertake within a realistic time frame, especially as the world’s collections have doubled in size in recent times. Here, we report a global-scale monographic study of morning glories (Ipomoea) that integrated DNA barcodes and high-throughput sequencing with the morphological study of herbarium specimens. Our approach overhauled the taxonomy of this megadiverse group, described 63 new species and uncovered significant increases in net diversification rates comparable to the most iconic evolutionary radiations in the plant kingdom. Finally, we show that more than 60 species of Ipomoea, including sweet potato, independently evolved storage roots in pre-human times, indicating that the storage root is not solely a product of human domestication but a trait that predisposed the species for cultivation. This study demonstrates how the world’s natural history collections can contribute to global challenges in the Anthropocene. Taxonomic monographs have been considered too vast and daunting as a source for studying biodiversity, but this novel study of morning glories combines herbarium specimens with DNA barcodes and high-throughput sequencing to describe new species and discover hidden traits.
Article
Full-text available
The recent advances of next-generation sequencing have made it possible to construct reference genome sequences in divergent species. However, de novo assembly at the chromosome level remains challenging in polyploid species, due to the existence of more than two pairs of homoeologous chromosomes in one nucleus. Cultivated sweet potato (Ipomoea batatas (L.) Lam) is a hexaploid species with 90 chromosomes (2n = 6X = 90). Although the origin of sweet potato is also still under discussion, diploid relative species, I. trifida and I. triloba have been considered as one of the most possible progenitors. In this manuscript, we review the recent results and activities of whole-genome sequencing in the genus Ipomoea series Batatas, I. trifida, I. triloba and sweet potato (I. batatas). Most of the results of genome assembly suggest that the genomes of sweet potato consist of two pairs and four pairs of subgenomes, i.e., B1B1B2B2B2B2. The results also revealed the relation between sweet potato and other Ipomoea species. Together with the development of bioinformatics approaches, the large-scale publicly available genome and transcript sequence resources and international genome sequencing streams are expected to promote the genome sequence dissection in sweet potato.
Article
Full-text available
Background: Evolutionary histories can be discordant across the genome, and such discordances need to be considered in reconstructing the species phylogeny. ASTRAL is one of the leading methods for inferring species trees from gene trees while accounting for gene tree discordance. ASTRAL uses dynamic programming to search for the tree that shares the maximum number of quartet topologies with input gene trees, restricting itself to a predefined set of bipartitions. Results: We introduce ASTRAL-III, which substantially improves the running time of ASTRAL-II and guarantees polynomial running time as a function of both the number of species (n) and the number of genes (k). ASTRAL-III limits the bipartition constraint set (X) to grow at most linearly with n and k. Moreover, it handles polytomies more efficiently than ASTRAL-II, exploits similarities between gene trees better, and uses several techniques to avoid searching parts of the search space that are mathematically guaranteed not to include the optimal tree. The asymptotic running time of ASTRAL-III in the presence of polytomies is [Formula: see text] where D=O(nk) is the sum of degrees of all unique nodes in input trees. The running time improvements enable us to test whether contracting low support branches in gene trees improves the accuracy by reducing noise. In extensive simulations, we show that removing branches with very low support (e.g., below 10%) improves accuracy while overly aggressive filtering is harmful. We observe on a biological avian phylogenomic dataset of 14K genes that contracting low support branches greatly improve results. Conclusions: ASTRAL-III is a faster version of the ASTRAL method for phylogenetic reconstruction and can scale up to 10,000 species. With ASTRAL-III, low support branches can be removed, resulting in improved accuracy.