ArticlePDF Available

Genomic and enzymatic evidence for acetogenesis among multiple lineages of the archaeal phylum Bathyarchaeota widespread in marine sediments

Authors:

Abstract and Figures

Members of the archaeal phylum Bathyarchaeota are widespread and abundant in the energy-deficient marine subsurface sediments. However, their life strategies have remained largely elusive. Here, we provide genetic evidence that some lineages of Bathyarchaeota are acetogens, being capable of homoacetogenesis, a metabolism so far restricted to the domain Bacteria. Metabolic reconstruction based on genomic bins assembled from the metagenome of deep-sea subsurface sediments shows that the metabolism of some lineages of Bathyarchaeota is similar to that of bona fide bacterial homoacetogens, by having pathways for acetogenesis and for the fermentative utilization of a variety of organic substrates. Heterologous expression and activity assay of the acetate kinase gene ack from Bathyarchaeota, demonstrate further the capability of these Bathyarchaeota to grow as acetogens. The presence and expression of bathyarchaeotal genes indicative of active acetogenesis was also confirmed in Peru Margin subsurface sediments where Bathyarchaeota are abundant. The analyses reveal that this ubiquitous and abundant subsurface archaeal group has adopted a versatile life strategy to make a living under energy-limiting conditions. These findings further expand the metabolic potential of Archaea and argue for a revision of the role of Archaea in the carbon cycle of marine sediments.
Content may be subject to copyright.
Genomic and enzymatic evidence for acetogenesis
among multiple lineages of the archaeal phylum
Bathyarchaeota widespread in marine sediments
Y. He1,2,M.Li
3, V. Perumal1, X. Feng1, J. Fang1, J. Xie1, S. M. Sievert4and F. Wang1,2*
Members of the archaeal phylum Bathyarchaeota are widespread and abundant in the energy-decient marine subsurface
sediments. However, their life strategies have remained largely elusive. Here, we provide genetic evidence that some
lineages of Bathyarchaeota are acetogens, being capable of homoacetogenesis, a metabolism so far restricted to the
domain Bacteria. Metabolic reconstruction based on genomic bins assembled from the metagenome of deep-sea
subsurface sediments shows that the metabolism of some lineages of Bathyarchaeota is similar to that of bona de
bacterial homoacetogens, by having pathways for acetogenesis and for the fermentative utilization of a variety of organic
substrates. Heterologous expression and activity assay of the acetate kinase gene ack from Bathyarchaeota, demonstrate
further the capability of these Bathyarchaeota to grow as acetogens. The presence and expression of bathyarchaeotal
genes indicative of active acetogenesis was also conrmed in Peru Margin subsurface sediments where Bathyarchaeota
are abundant. The analyses reveal that this ubiquitous and abundant subsurface archaeal group has adopted a versatile
life strategy to make a living under energy-limiting conditions. These ndings further expand the metabolic potential of
Archaea and argue for a revision of the role of Archaea in the carbon cycle of marine sediments.
Alarge number of bacteria and archaea reside in energy-
decient subsurface sediments at depths greater than one
metre below the seaoor (mbsf) and thus represent a con-
siderable portion of the Earths biomass1. Most of these microorgan-
isms do not have cultivated representatives, and this poses a major
challenge when elucidating the mechanisms allowing life to exist
under the extreme energy-limiting conditions prevailing in the
deep biosphere2. It has been proposed that these subsurface
microbes are mainly fuelled by recalcitrant organic carbon buried
in the marine sediments3. However, it remains unexplained how
these subsurface microorganisms subsist under conditions where
the available energy ux appears to be 1,000-fold lower than the
minimum value for maintenance energy estimated from laboratory
cultures2,4. It is estimated that 90% of the cells reside below the
sulfate methane transition zone (SMTZ) of continental shelf sedi-
ments, where fermentation and methanogenesis are assumed to
occur5. Although acetogenesis has been found to co-occur with
sulfate reduction and methanogenesis in terrestrial groundwater
and subseaoor sediments6,7, the microbes associated with this
remain unknown.
Members of Archaea, the third domain of life distinct from
Bacteria and Eukarya, constitute a major component of subsurface
microbial communities4,8. Globally, members of the Miscellaneous
Crenarchaeota Group (MCG, recently assigned to a novel phylum
Bathyarchaeota) often dominate marine subsurface archaeal com-
munities9,10 and have been identied to be one of the most active
microbial groups in the deep marine biosphere based on the esti-
mation of chemical rates11. Bathyarchaeota have been found to
account for between 10 and 100% of all archaea (on average
3060%) in the deep marine biosphere9. Assuming that bacteria
and archaea are present in equal abundance in the deep biosphere12
and that 50% of the deep biomass (in total 2.9 ×10
29
cells globally1)
occurs at continental margins (at water depths of <3,500 m), with
approximately 90% below the SMTZ5, then Bathyarchaeota cells
would account for between 2 ×10
28
and 3.9 ×10
28
cells globally,
making it one of the most abundant microbial groups on the
planet. Bathyarchaeota were initially considered to be heterotrophs4,
which has been partially conrmed by single-cell genomic and
metagenomic analyses10,13, as well as stable isotope probing exper-
iments14. However, the phylum Bathyarchaeota is composed of a
large number of diverse lineages, most probably also with a high
metabolic diversity9,15. Recently, members of two lineages of
Bathyarchaeota were suggested to be capable of methane meta-
bolism based on two genomes obtained from metagenomic
analysis16. However, it is the paucity of genomes across this huge,
diverse phylum10,15,16 that currently impedes a comprehensive under-
standing of the metabolic capabilities and potential biogeochemical
roles of this phylum9.
Results and discussion
We conducted shotgun metagenomic sequencing of a marine
sediment sample collected from the Guaymas Basin (GB) in the
Gulf of California. Quantitative-PCR (q-PCR) analyses of the 16S
rRNA gene demonstrated that the sediment sample contained
2.8 ×10
8
archaeal cells per gram of sediment, of which 74% were
Bathyarchaeota (Supplementary Fig. 1). De novo assembly of metage-
nomic reads and binning by tetranucleotide signatures (for details see
Methods) revealed six Bathyarchaeota genomic bins (Supplementary
Fig. 2). The bin size of the obtained Bathyarchaeota ranged from
0.9 Mb to 2.0 Mb, with coverage ranging from 10×to 28×
(Table 1 and Supplementary Table 1). Genome completeness
within each bin was between 60 and 94% based on the presence of
1State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China. 2State Key
Laboratory of Ocean Engineering, Shanghai Jiao Tong University, Shanghai, China. 3Institute for Advanced Study, Shenzhen University, Shenzhen,
Guangdong, China. 4Biology Department, Woods Hole Oceanographic Institution, Woods Hole, Massachusetts 02543, USA. These authors contributed
equally to this work. *e-mail: fengpingw@sjtu.edu.cn
ARTICLES
PUBLISHED: 4 APRIL 2016 | ARTICLE NUMBER: 16035 | DOI: 10.1038/NMICROBIOL.2016.35
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology 1
© 2016 Macmillan Publishers Limited. All rights reserved
single-copy genes (Table 1), and 148 genes were shared by the ident-
ied Bathyarchaeota genomic bins (Supplementary Table 2).
The phylogeny based on 53 concatenated archaeal conserved
single-copy genes17 conrmed the placement of Bathyarchaeota
into a novel archaeal phylum (Fig. 1a and Supplementary
Table 3), which is consistent with previous results13. A separate
analysis with 16S rRNA genes from all publicly available
Bathyarchaeota genomes15,16 indicated that they represent separate
groups that diverge widely within Bathyarchaoeta: B23 belongs to
subgroup MCG-15; B26-1 and B26-2 belong to MCG-16; and B24
and B25 represent members of unclassied novel groups (Fig. 1b).
Metabolic reconstruction showed that the Bathyarchaeota ident-
ied in the present study (if not all the sublineages within it15,16)
have the genetic potential for inorganic carbon xation via the reduc-
tive acetyl-CoA (WoodLjungdahl, WL) pathway (Fig. 2). Genes of
all ve subunits of the carbon monoxide dehydrogenase/acetyl-CoA
synthase complex (Codh/Acs), the key enzyme for the WL
pathway, were found in ve of the genomic bins (but not in B23,
Supplementary Table 4). In addition, most of the genes that encode
enzymes for the methyl branch of the WL pathway in archaea,
through which CO
2
is reduced to methyltetrahydromethanopterin,
were found (Fig. 2 and Supplementary Table 4). The enzyme formyl-
methanofuran dehydrogenase (FwdA-F), which is involved in the
reduction of CO
2
, was detected in all six Bathyarchaeota genomic
bins. Genes encoding for enzymes catalysing the subsequent
reactions, that is, formylmethanofuran:tetrahydromethanopterin
formyltransferase (Ftr), N5,N10-methenyltetrahydromethanopterin
cyclohydrolase (Mch), methylenetetrahydromethanopterin dehy-
drogenase (Mtd) and N5,N10-methylenetetrahydromethanopterin
reductase (Mer), were all identied in three genomic bins (B24,
B26_1 and B26_2). B63 included all of these genes except mer
(Supplementary Table 4). The complete set of genes encoding the
membrane-associated methyltransferase (MtrA-H) was only found
in B25, and was absent in all the other ve bins. Similarly, only
B25 contained the gene coding for methanol coenzyme M methyl-
transferase (MtaA)(Supplementary Table 4). These results suggest
that most members of the Bathyarchaeota possess the key
enzymes required for a functional WL pathway. Interestingly,
none of these six Bathyarchaeota genomic bins (which represent
at least four Bathy-subgroups within the Bathyarchaeota) contains
genes coding for McrABG (Supplementary Table 4), the key gene
for methane metabolism, indicating that only a few groups of
Bathyarchaeota have the potential for methane metabolism16.
The present data further provide evidence that members of the
Bathyarchaeota are able to metabolize acetate. Phosphate acetyltrans-
ferase (Pta) and acetate kinase (Ack) were found in three of the six
Bathyarchaeota genomic bins (Supplementary Table 4). The Pta-Ack
pathway for acetate production/assimilation is widely distributed in
bacteria, whereas archaea usually use the acetyl-CoA synthetase (ade-
nosine diphosphate (ADP)-forming)(ACD),includingthepotentially
acetogenic Bathyarchaeota lineages reported previously15.ACDwas,
until recently, regarded as being specicforArchaea,whenitwas
also found in a few bacterial genomes, such as the propionic acid-pro-
ducing bacteria Propionibacterium acidipropionici18,19.Inarchaea,the
Pta-Ack pathway has so far only been identied in the methanogenic
genus Methanosarcina,whichiscapableofproducingmethanefrom
acetate20.Thus,thepresentstudyistherst to identify genes coding
for Pta-Ack in non-methanogenic archaea. The pta-ack genes in
Methanosarcina are assumed to have been acquired by a single hori-
zontal gene transfer event, probably from a clade of cellulolytic
Clostridia21.However,phylogeneticanalysisofthetranslatedack and
pta sequences of Bathyarchaeota indicated that these genes form a
monophylogenetic clade that is distinct from all other known
sequences (Fig. 3a,b). Thus, the Pta-Ack pathway in Bathyarchaeota
appears to have evolved independently from that of Bacteria.
Bona de homoacetogenesis, that is, the production of acetate solely
from hydrogen and CO
2
,istraditionallyseenasametabolismrestricted
to Bacteria22,althoughtwoarchaealspeciesthe methanogen
Methanosarcina acetivorans and the sulfate-reducing archaeon
Archaeoglobus fulgidushave been shown to produce acetate from
CO in addition to formate under certain growth conditions20,23.In
support of the acetogenic lifestyle of Bathyarchaeota suggested by the
genomic data presented here, we were able to clone and express the
bathyarchaeotal ack gene in Escherichia coli (Supplementary Fig. 3).
The puried enzyme showed catalysing activity for the reversible phos-
phorylation of acetate to acetyl phosphate. The bathy-Ack was approxi-
mately 15-fold more active (higher catalytic efciency) in the reaction
of acetate synthesis than the reverse reaction, and it showed a much
higher afnity to acetyl phosphate (K
m
=0.48mM) than to acetate
(K
m
=3.9mM)(SupplementaryTable5).
The Bathyarchaeota analysed in the present study also demon-
strated the potential to use a number of complex organic compounds
as growth substrates, including proteins, cellulose, chitin and aromatic
compounds, in line with previous ndings10,13,15. This exibility
suggests that these widespread archaea9,10 are heterotrophs with
extensive abilities to efciently use organic matter, a strategy well
suited to make a living in carbon-starved deep marine sediments2.
Genes encoding glycolysis/gluconeogenesis and beta-oxidation path-
ways were present in all genomic bins (Supplementary Tables 6 and 7),
highlighting the heterotrophic lifestyle of Bathyarchaeota. A
partial tricarboxylic acid (TCA) cycle only missing citrate synthase
(Cit) was identied in genomic bins, similarly to what has been
Table 1 | Overview of Bathyarchaeota genomic bins.
Bathyarchaeota bin B23 B24 B25 B26-1 B26-2 B63
Number of contigs 108 37 92 136 84 158
Bin size (Mb) 0.898 1.73 0.939 1.384 1.969 1.22
Number of predicted genes 1,093 1,791 1,079 1,454 1,948 1,391
N50 value (kb) 8,536 64,843 12,095 12,086 33,188 8,025
Number of genes annotated by Pfam 812 1,437 867 1,051 1,452 1,008
Number of CSCGs* 97 152 114 119 124 95
Expected complete genome size based on CSCGs (Pfam, in Mb)* 1.5 1.84 1.334 1.884 2.572 2.08
Achieved genome coverage (Pfam)* 59.90% 93.80% 70.40% 73.50% 76.60% 58.70%
Number of genes annotated by COG 836 1,482 890 1,083 1,463 1,037
Number of CSCGs
31 42 38 36 43 29
Expected complete genome size based on CSCGs (COG, in Mb)
1.275 1.81 1.087 1.692 2.015 1.851
Achieved genome coverage (COG)
70.40% 95.50% 86.40% 81.80% 97.70% 65.90%
Completeness
60.9% 96.7% 74.3% 90.7% 93.0% 59.8%
Contamination
0.0% 2.8% 1.3% 6.1% 3.8% 0.9%
Estimation of genome size wasconducted with two separateapproaches: *with162 identied archaea conserved single copy genes (CSCGs) annotated by Pfam as in ref. 53;
for 44 CSCGs annotatedby COG as
in ref. 10.
Completeness and contamination were estimated by CheckM (ref. 46).
ARTICLES NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology2
© 2016 Macmillan Publishers Limited. All rights reserved
0.1
Bathyarchaeota
Aarhus Bay, Denmark10
White Oak River, USA15
Surat Basin, Australia16
Guaymas Basin, Mexico
Nanoarchaeota
Woesearchaeota
Pacearchaeota
Diapherotrites
Aenigmarchaeota
Nanohaloarchaeota
Parvarchaeota
Euryarchaeota
Lokiarchaeota
Korarchaeota
Geoarchaeota
Crenarchaeota
B25
B24
SMTZ-80
B63
B26-1
B26-2
BA1
BA2
SG8-32-1-MCG1
SG-32-3-MCG1
AD8-1-MCG6
SMTZ1-55
B23
DG-45-MCG15
E09
Thaumarchaeota
Aigarchaeota
0.1
Subgroup 15
Subgroup 17
Subgroup 16
Subgroup 3
Subgroup 6
Subgroup 8
Aigarchaeota
Thaumarchaeota
B25
B24
AB177267
FJ404038
AB362547
EU385859
E09
DQ363811
AM697965
DQ641883 (BLAST: DG-45-MCG15)
GU567019
B23
SMTZ80
FJ536696
FJ649521
AM942165
EU713901
FJ264803 (BLAST: SMTZ1-55)
AY354118
AY396672
AY592009
FJ484299
DQ641807
Subgroup 13
AB237758
B26-1
AB355130
EF367568
FJ649531
FJ902705
AM942159
GQ848427
AY354121
Subgroup 14
Subgroup 12
Subgroup 11
Subgroup 1
Subgroup 7
Subgroup 5a
Subgroup 5b
AY501700
AB161338
AY464796
EU155996 (BLAST: AD8-1-MCG6)
AB161343
AB538512
AB183856
FJ902830
AY454600
BA1
EF367492
FJ649517
EF680209
BA2
FJ484638
FJ901676
AY592006
EF367607
FJ485506
FJ853479
GU972264
Subgroup 10
Subgroup 4
Subgroup 9
Subgroup 2
a
b
Figure 1 | Phylogenetic trees showing the placement of genomic bins in the archaeal phylum Bathyarchaeota. a, Maximum-likelihood phylogeny of
concatenated alignments constituting 53 conserved archaeal single-copy genes (Supplementary Table 3), rooted with the DPANN superphyla
(Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota and Nanohaloarchaeota). b, Maximum-likelihood 16S rRNA gene tree showing the
placement of bathyarchaeotal representatives from all publicly available Bathyarchaeota genomic and environmental sequences, with the previously dened
classication9. Sequences from Aigarchaeota and Thaumarchaeota are selected as outgroups. Bootstrap values are based on 1,000 replicates, and
percentages are shown with open (80%) and lled (90%) circles. Phylotypes identied in the present study are named by their genomic bin index and
indicated by red circles, while phylotypes reported in the studies described in refs 10, 15 and 16 are indicated by blue stars, green squares and orange
triangles, respectively.
NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35 ARTICLES
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology 3
© 2016 Macmillan Publishers Limited. All rights reserved
found in autotrophic Methanobacteriales24. The presence of both
an archaeal-type phosphoenolpyruvate carboxylase (Pepc) and
phosphoenolpyruvate carboxykinase (Pck) in the Bathyarchaeota
genomic bins suggests that phosphoenolpyruvate (PEP) might
serve a central role in carbon metabolism by maintaining the con-
tinuity of the TCA cycle and also by connecting glycolysis to the
TCA cycle24 (Fig. 2). As previously identied by single-cell
genome analysis10, Bathyarchaeota are well equipped to degrade
proteins, as 17 different families of extracellular peptidases, such
as C1A, C25 and S8A, could be identied (Supplementary
Table 8). Diverse genes for carbohydrate-activating enzymes were
also identied, such as those for amylase, cellulase, chitinase, ester-
ase, fucosidase galactosidase, hexosaminidase, lipase and xylanase
(Supplementary Table 9). The presence of genes for benzoyl degra-
dation conrmed that members of Bathyarchaeota are capable of
degrading aromatic compounds (Supplementary Table 10). The
presence of a wide variety of transporters of organic compounds
(Supplementary Table 11) further supports the versatile hetero-
trophic metabolism of Bathyarchaeota (Fig. 2), a strategy well
suited to make a living in carbon-starved deep marine sediments2.
The analysed Bathyarchaeota appear to be capable of using a
avin-based electron bifurcation involving heterodisulde reductase
(HdrABC) and methyl-viologen-reducing hydrogenase (MvhADG),
which are proposed to be used repeatedly in different contexts. In
Bathyarchaeota genomic bins, genes of HdrABC and MvhADG
were located adjacent to one another (no Mvh in B23 and B63,
see Supplementary Table 12 for details). They might also be
involved in methylenetetrahydrofolate reduction, as proposed for
Moorella thermoacetica25, which will require further investigation.
In addition, Fpo subunits (F
420
H
2
dehydrogenase, missing in
B26_1 and B63) were also present (Fig. 2 and Supplementary
Table 12). Genes for the hydrogenase maturation enzyme, such as
HypA-F, were identied in all genomic bins. B25 had the complete
gene cluster for the synthesis of F
420
/F
420
H
2
(the oxidized/reduced
forms of a 5-deazaavin derivative), and part of this gene cluster
was also found in the other genomic bins (Supplementary
Table 12). The beta subunit of F
420
-reducing NiFe-hydrogenase
(FrhB) was found to be present in four out of six genomic bins,
while the other two subunits, FrhA and FrhG, were not present in
any of the genomic bins. The proton-translocating ATPase was
present in three genomic bins (B23, B25 and B63; Fig. 2 and
Supplementary Table 12). The available genomic data for
Bathyarchaeota suggest that H
2
serves as the electron donor, and
reduced F
420
serves as the potential electron acceptor for the
reduction of ferredoxin. However, different groups of
Bathyarchaeota may use different pathways, and further analyses
TCA
Oxaloacetate
WL pathway
Beta
oxidation
ABC transporter
Glycolysis
ABC transporter
Cit
Fdox
Fdred
Cu2+
CopFeo
Fe2+
Trk
K+
Hem
Fe3+
Pst
Pi
ATPase
ATPADP
H+
H+
Aromatic
compounds
ABC transporter
PEP
Acetyl-CoA
CO2
CO2
CHO-MFR
CHO-THMPT
CHTHMPT
CH2=THMPT
CH3-THMPT
Ftr
Mch
Fwd
Mtd Mer
CO2
CO
Codh
Fdred
Fdred
Fdox
Fdox
Acs
Pfor
Fpo
F420
F420H2
Lipids
Fatty acids
Pyruvate
Malate
Citrate
Pck/
Pepk
Ps/Pk
Acetyl-
phosphate
Acetate Ack
Acd
Pta
Acetogenesis
Peptide
degradation
Proteins
Extracellular
peptidases
Oligo/dipeptides
Amino acids
Amyloses Hemicelluloses CellulosesChitins
ABC transporter
Mono/
oligosaccharides
Amino
acids
F420H2
F420H2
F420
F420
Hydrogenase
Hydrogenase
Hydrogenase
Hydrogenase
Hydrogenase
B24
B26-1
B26-2
B24
B26-1
B26-2
Figure 2 | Overview of pathways reconstructed in Bathyarchaeota. Protein abbreviations are described, with full entry information, in Supplementary Tables
412. Grey solid lines refer to genes found in fewer than three Bathyarchaeota genomic bins, and solid lines in other colours refer to genes identied in the
majority (that is, at least three of six) of the Bathyarchaeota genomic bins: green solid lines, acetogenesis (including the WL pathway); purple, TCA cycle;
orange, organic carbon degradation; black, transporters. Grey dotted lines refer to genes not identied in any genome. Orange dashed lines refer to processes
where multiple steps are involved. Each step is fully described in the Supplementary Tables 412.
ARTICLES NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology4
© 2016 Macmillan Publishers Limited. All rights reserved
are required to fully understand the mechanisms of electron
bifurcation and energy production in this phylum.
In all Bathyarchaeota genomic bins, no potential enzymes were
detected that would be indicative of the use of external electron accep-
tors, such as oxygen, sulfate and nitrate (nitrite). Collectively, the
available data suggest that some lineages of Bathyarchaeota (such as
MCG-16 and some novel groups to which B24, B26_1 and B26_2
belong) are archaeal acetogens that generate energy by fermenting
various organic carbon compounds and/or acetogenesis: they have
the WL pathway and genes for acetate production/consumption
(mainly through the Pta-Ack pathway) and they also possess pyruvate
ferredoxin oxidoreductase (Pfor, Supplementary Table 4 and
Supplementary Fig. 4), a key genethatlinkstheheterotrophic
metabolism to the WL pathway in canonical acetogenic bacteria26.
Interestingly, the acetogenic Bathyarchaeota identied in the present
study appear to be a hybrid using an archaeal-type methyl-transferring
pathway via tetrahydromethanopterin (H4MPT) as carrier and a
unique bacterial-type ACK-PTA system for acetate production from
acetyl-CoA. This nding has signicant implications considering the
evolution of bacteria and archaea. Methanogenesis and acetogenesis
are regarded as the most ancient biochemical reactions, mainly as
they use the simple WL pathway for both CO
2
-xation and adenosine
triphosphate (ATP) generation27.Hence,acetogensandmethanogens
without cytochromes and quinones have been considered founders
of bacteria and archaea, respectively28.Methanogenesishasbeen
traced back to about 3.5 billion years ago, soon after the origin of
life29.Acetoclasticmethanogensareproposedtohaveevolvedaround
250300 million years ago by obtaining the bacterial ack-pta,
7; Actinobacteria
4; Bacteroidetes
9; Epsilonproteobacteria
6; Deltaproteobacteria
6; Planctomycetia
Bathyarchaeota
Bathyarchaeota
Spirochaetia
9; Mollicutes
9; Erysipelotrichia
5; Bacilli
13; Cyanobacteria
15; Alpha/Betaproteobacteria
7; Chlorobi
5; Acidobacteria
8; Gammaproteobacteria
4; Deinococci
10; Thermotogae
5; Spirochaetia
5; Bacteroidetes
10; Synergistia
4; Clostridia
4; Methanosarcinales
9; Negativicutes
10; Fusobacteriia
100
99
37
98
97
97
86
78
100
100
99
99
93
89
97
57
100
99
99
99
59
100
93
100
99
98
85
86
99
100
100
99
51
95
100
87
53
85
99
84
92
53
98
88
0.2
0.2
B24
B26-1
B26-2
7; Fusobacteriia
8; Negativicutes
B26-1
B26-2
B24
Spirochaeta thermophila DSM 6578
Treponema primitia ZAS-2
Treponema denticola SP23
Treponema denticola SP32
Treponema denticola US-Trep
Treponema maltophilum ATCC 51939
9; Alpha/Beta/Gammaproteobacteria
6; Erysipelotrichia
13; Bacilli
8; Epsilonproteobacteria
8; Oscillatoriophycideae
10; Deltaproteobacteria
8; Planctomycetia
5; Actinobacteria
3; Betaproteobacteria
7; Gammaproteobacteria
8; Alpha/Betaproteobacteria
9; Mollicutes
5; Deltaproteobacteria
8; Bacteroidetes
Lachnoclostridium phytofermentans ISDg
Caldicellulosiruptor saccharolyticus DSM 8903
Methanosarcina barkeri str. Fusaro
Methanosarcina acetivorans C2A
Methanosarcina mazei Tuc01
Ruminiclostridium thermocellum ATCC 27405
Clostridium cellulolyticum H10
Clostridium papyrosolvens C7
Clostridium cellulovorans 743B
Clostridium perfringens SM101
Clostridium sp. 7_2_43FAA
Clostridium sartagoforme AAU1
97
97
99
98
100
65
100
47
98
99
60
68
99
93
99
98
100
93
98
97
84
77
99
99
95
100
84
99
78
93
97
96
91
97
100
100
90
100
99
30
97
64
99
99
95
100
79
91
92
75
Methanosarcinales
Clostridia
Clostridia
ab
Figure 3 | Phylogenies of key genes known to be involved in acetogenesis. a,b, Maximum-likelihood phylogenies of the alignment of 407 amino-acid
residues of Ack homologues (a) and 331 amino-acid residues of Pta homologues (b) identied in Bathyarchaeota and all other archaea and bacteria.
Bootstrap values are based on 1,000 replicates, and percentages are shown at the nodes. Genes identied in this study are named by their genomic bin
index. Numbers of genomes in each collapsed clade are displayed before the clade name.
NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35 ARTICLES
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology 5
© 2016 Macmillan Publishers Limited. All rights reserved
supposedly contributing to the mass extinction at that time30.Thus,the
nding of acetogenic archaea with the pta-ack system described in the
present study, combined with the previous report of methanogenesis in
the same phylum16,raisesinterestingquestionsandopensanew
window for investigating the evolution of early life.
In the subsurface sediments, evidence is accumulating suggesting
that acetogenesis is playing an important role in organic carbon
cycling in the deep biosphere7,31,32. Our results suggest that
Bathyarchaeota, which are ubiquitous and dominant members of
the deep biosphere, have the potential to grow as acetogens and
thus could potentially account for the observed acetogenic activities.
Acetogens are well adapted to low-energy environments, using
the simplest CO
2
-xing pathwaythe reductive acetyl CoA
pathwayfor both energy production and biosynthesis.
Acetogenesis is thermodynamically favourable over a wide range
of organic substrates, as calculated for typical deep subsurface con-
ditions32. Furthermore, acetogenic archaea might have an energetic
advantage over acetogenic bacteria, as they do not have to invest
ATP to activate formate22, which could be a competitive advantage
under the energy-limiting conditions prevailing in the deep subsur-
face. Screening the publicly available metagenomes33 and metatran-
scriptomes34 from the Peru Margin deep subsurface sediments for
key genes associated with inorganic carbon oxidation and acetate
turnover lends further support to this hypothesis, as a high pro-
portion of the genes previously considered to be from methanogens
probably originate from Bathyarchaeota, as they display the highest
identity to bathyarchaeotal sequences (Supplementary Tables 13
and 14 and Supplementary Fig. 5a,b). The discovery that some
lineages of Bathyarchaeota are acetogens can also explain previous
paradoxical observations, such as the simultaneous evidence for het-
erotrophy10,13,14 and
13
C-depletion in the signature lipid biomarkers
of Bathyarchaeota35. Similar to acetogenic bacteria, bathyarchaeal
acetogens appear to use a wide range of carbon compounds as
energy substrates, such as CO
2
/H
2
, CO, fatty acids, alcohols, alde-
hydes, aromatic compounds and complex organic polymers36. This
suggests that acetogenic archaea may play key roles in organic
carbon transformations in deep marine sediments, degrading
complex organic carbon and producing acetate, fueling the subsur-
face ecosystem with substrates for heterotrophy and acetoclastic
methanogenesis, and changing our view of the roles of archaea in
the carbon cycle of marine sediments (Fig. 4). It should be noted
that the metabolic capabilities and ecological roles of
Bathyarchaeota are still far from being fully understood, as there
appears to be a high metabolic diversity among members of this
new archaeal phylum, as evidenced by the recent discovery of dissim-
ilatory nitrite reduction to ammonium and methane metabolism15,16.
In conclusion, our work demonstrates that some lineages of
Bathyarchaeota are likely to be acetogens with a versatile metab-
olism, which expands the metabolic repertoire of Archaea. By
being able to x inorganic carbon and to use a wide range of
organic substrates, the metabolism of the acetogenic
Bathyarchaeota is similar to canonical homoacetogenic bacteria,
but with the added advantage of using a more energy-efcient
pathway, making them key players in the carbon cycle of deep
marine sediments. Both acetogenesis and methanogenesis are
thought to represent ancient metabolic pathways. Thus, the identi-
cation of both pathways in the Bathyarchaeota phylum also has
important implications for the evolution of life on Earth.
Methods
Sample description. The sediment sample used in this study was collected by push
cores at a deep-sea vent site at Guaymas Basin (Gulf of California, Mexico) in
October 2007 during cruise AT15-25 on Alvin dive 4,358 (latitude 27° 0.71238 N,
longitude 111° 24.3237 W, depth 2,016 m). The sediment was oily, and the sediment
surface was covered witha white microbial mat. The temperature was measured withthe
Acetoclastic
methanogens
Heterotrophic
bacteria
Detrital
organic
matter
Depth
CH4
Bathyarchaeota
Refractory
compounds Acetogenic archaea
Fermentation
WL pathway
Aromatic
compounds
Chitin
Cellulose
Protein
CO2
CO2
CO2
H2
H2
B24, subgroup 16
Acetate
Subgroups 1,6,15,7/17
Hydrolysis
n
OO
CH3
OO
HO NH
O
OH
OCH3
NH
OH
HO
O
n
C
N
H
R
O
n
O
O
O
HO
OH
OH
OH
OH
O
OH
H
Figure 4 | Proposed roles of acetogenic archaea in the deep biosphere. Ovals represent acetogenic archaea (Bathyarchaeota) and different colours (purple,
blue and orange) refer to Bathyarchaeota that differ in their abilities to use refractory compounds, such as aromatic compounds, extracellular protein, chitin
and cellulose. Both heterotrophic bacteria and acetoclastic methanogens may consume acetate produced by acetogenic Bathyarchaeota (B24 and MCG-16
from this study, highlighted in red; 1, 6, 15 and 7/17 from the study reported in ref. 15 highlighted in green; and others to be identied).
ARTICLES NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology6
© 2016 Macmillan Publishers Limited. All rights reserved
Alvin temperature probe and increased from 5 °C at the surface to 44 °C at a sediment
depth of 10 cm. Sediment from the upper 10 cm of the push core was transferred to a
Pyrex bottle with a butyl rubber stopper, gassed with an anaerobic N
2
/CO
2
gas mixture,
and stored anaerobically at 4 °C until analyses were conducted in 2014.
SDS-based DNA extraction. Sediment (300 mg) was mixed with an equal weight of
0.1 mm glass beads and 670 µl extraction buffer consisting of 100 mM Tris-HCl
(pH 8.0), 100 mM sodium EDTA (pH 8.0), 100 mM sodium phosphate (pH 8.0),
1.5 m NaCl and 1% hexadecyltrimethylammonium bromide. The sediment was
mixed with the extraction buffer using a low-speed vortex for 5 min. The mixture
was homogenized with a tissue lyser for 2 ×30 s at 30 Hz with a2 min interval. After
homogenization, 50 µl lysozyme (20 mg ml
1
) and proteinase-K (20 mg ml
1
) were
added, and the mixture was incubated for 30 min at 37 °C. Then, 70 µl of 20% SDS
was added and incubated at 65 °C for 2 h with a 10 min mingling interval.
The supernatant was collected after centrifuging at 1,000gfor 10 min at room
temperature and then transferred into a 2 ml microcentrifuge tube. The sediment
pellets were extracted again by adding 500 µl extraction buffer and repeating the
previous steps, with the only difference being that the incubation at 65 °C was
carried out for only 1 h. The supernatants from the two cycles of extraction were
combined and mixed with equal volumes of phenol:chloroform:iso-amyl alcohol
(24:25:1). The aqueous phase was recovered by centrifugation and precipitated
overnight 4 °C at with a 0.6 volume of ice-cold propyl alcohol and 0.3 M sodium
acetate (pH 5.2). The nucleic acid was pelleted by centrifuging at 16,000gfor 20 min
at room temperature. The supernatant was removed and the nucleic acid pellet was
washed with 70% ice-cold ethanol, then resuspended in distilled water. The nucleic
acids from three sets of independent extractions were combined for further
downstream applications.
Archaeal and bacterial 16S rRNA gene quantication. Copy numbers of archaeal
and bacterial small subunit ribosomal genes were quantied by qPCR with an Applied
Biosystems 7500 Real Time system. Archaeal 16S rRNA gene copies were quantied
with the universal primer Uni 519F (5-CAGCMGCCGCGGTAA-3)37 and the
archaeal-specic primer Arch 908R (5-CCCGCCAATTCCTTTAAGTT-3)38.
The qPCR conditions included SYBR Green Premix Ex Taq II (TAKARA), 0.8 µM
each of forward and reverse primers and 1 µl template DNA. The qPCR, with a 25 µl
reaction volume for each sample together with standard series and negative control,
was run in triplicate. The thermal cycling program was 15 min at 95 °C and
40 cycles of 95 °C for 30 s, 60 °C for 30 s and 72 °C for 45 s. The standard curve
contained a series of diluted known linearized plasmids containing between 10
and 1 ×10
7
copies of 16S rRNA genes µl
1
. Bacterial genomic DNA was used as a
negative control. The amplication efciency was 96%, the R
2
values of the standard
curve were 0.99, and the slope value was 3.26. Bacterial 16S rRNA gene copies
were quantied using the same SYBR Green Premix Ex Taq II mix, bacterial
specic primer Bac431F (5-CCTACGGGWGGCWGCA-3)38 and prokaryotic 519R
(5-TTACCGCGGCKGCTG-3)37. The standard curve consisted of a dilution series
ranging from 1 ×10
2
to 1 ×10
7
copies µl
1
of a known amount of linearized plasmid
containing bacterial 16S rRNA genes. Pyrococcus yayanosii CH1 genomic DNA was
used as a negative control. The estimated amplication efciency was 100%, the R
2
value of the standard curve was 0.99, and the slope value was 3.03. The thermal cycle
program was 15 min at 95 °C, and 35 cycles at 95 °C for 30 s and 72 °C for 30 s.
Tag sequencing of archaeal V4 regions of 16S rRNA genes. The archaeal
hypervariable V4 regions of 16S rRNA genes were amplied with Uni519F
(5-XXXXXXXXXXXY MGCCRCGGKAAHACC-3)39 and Arc806R
(5-GGACTACNSGGGTMTCTAAT-3)39 primer sets fused to variable
11-nucleotide key tags for multiplexing, where the xregion represents various key
tags for each sample. An initial denaturation step of 3 min at 95 °C was followed
by 35 cycles at 94 °C for 40 s, 56 °C for 1 min and 72 °C for 1 min. The nal
extension step was 72 °C for 10 min. The bacterial V4 region was amplied by 520F
(5-XXXXXXXXAYTGGGYDTAAAGNG-3)39 with 8-nucleotide key tags for each
sample and 802R (5-TACNVGGGTATCTAATCC-3)39. The amplication mix
contained 1 µl each of forward and reverse primers, 1 µl template genomic DNA,
5 µl 10×Ex Taq buffer, 0.25 µl Ex Taq (TAKARA), 4 µl 2.5 mM dNTP mix and 5 µl
BSA (25 mg ml
1
). An initial denaturation step of 3 min at 95 °C was followed by
35 cycles at 94 °C for 40 s, 56 °C for 40 s and 72 °C for 2 min. The nal extension step
was 72 °C for 10 min. The amplied 290 bp PCR products were excised from 1.8%
agarose gel and puried using a gel extraction kit (EZNA-Omega) and sequenced with
the Illumina Miseq platform. Sequence demultiplexing, trimming, clustering and
classication were performed in mothur40. In brief, sequence reads were initially
denoised using the shhh.ows command to prevent mismatches of barcodes or
primers, and reads <200 bp were removed. Putative chimeric sequences were
identied and removed by the chimera.uchime command using the most abundant
reads in the respective sequence data sets as references. Finally, the sequence reads
were classied according to the Silva taxonomy41 using the classify.seqs command.
Metagenome library construction and sequencing. Between 2.5 and 5 µg of DNA
were initially fragmented by Covaris and then tested by electrophoresis using 1%
agarose gel. Fragmented DNA was combined with an end repair mix and incubated
at 20 °C for 30 min. The end-repaired DNA was then puried with a QIAquick PCR
Purication Kit (Qiagen). Subsequently, the A-tailing mix was added and the mixture
was incubated at 37 °C for 30 min. The puried 3ends adenylate DNA with adapter
and ligation mix were incubated for the ligation reaction at 20 °C for 15 min. After
this process, the adapter-ligated DNA was selected by running a 2% agarose gel to
recover the target fragments. The gel was puried with a QIAquick Gel Extraction kit
(Qiagen) and several rounds of PCR amplication with a PCR primer cocktail and
PCR master mix were performed to enrich the adapter-ligated DNA fragments. The
PCR products were then selected by running another 2% agarose gel to recover the
target fragments. Gel purication was achieved using a QIAquick Gel Extraction kit
(Qiagen) and the nal libraries were validated by theaverage molecule length using an
Agilent 2100 Bioanalyzer instrument (Agilent DNA 1000 Reagents) and by real-time
q-PCR (TaqMan Probe). The qualied libraries (with a 350 bp insert size) were
amplied on cBot to generate a cluster on the owcell (TruSeq PE Cluster Kit
V3-cBot-HS, Illumina). Pair-end sequencing of the amplied ow cell was performed
using a HiSeq 2000 System (TruSeq SBS KIT-HS V3, Illumina, at BGI-Shenzhen).
De novo assembly and binning of metagenomic sequences. The raw shotgun
sequencing metagenomic reads were dereplicated (100% identity over 100% lengths)
and trimmed with Sickle (https://github.com/najoshi/sickle) using the seoption
and default setting. The dereplicated, trimmed and paired-end DNA reads were
assembled using IDBA-UD42 with the following parameters: pre_correction, mink
52, maxk 92, step 8 and seed kmer 52. Binning of assembled metagenomic sequences
was initially performed using tetranucleotide frequencies in ESOM using 4 to 8 K as
fragment cutoffs43. The tetranucleotide frequencies of each DNA fragment of
assemble scaffold and reference genomes were determined using a custom Perl
script. DNA fragments were then clustered by tetranucleotide frequency using Data
bionics ESOM tools. An assembled scaffold was designed in a specic genomic bin if
all fragments of the assembled sequence were placed into this genomic bin during
the ESOM binning process. Initially, ve Bathyarchaeota genomic bins were
identied from ESOM binning. The completeness of the genomic bins was then
estimated by counting universal single-copy genes10. One initial Bathyarchaeota
genomic bin (B26) was shown to contain a least two closely related genomes based
on the number of single copy genes. These scaffolds were further separated by
plotting the differential guanine and cytosine content and assembled sequence
coverage within the bin. The coverage of each sequence was determined by recruiting
original DNA short reads to scaffolds using the BurrowsWheeler Aligner44. After
this step, B26 was divided into two bins (B26_1 and B26_2). In the end, we obtained
six Bathyarchaeota genomic bins (Table 1 and Supplementary Table 1). To rule out
contamination, all identied genomic bins were rebinned with the method described
by Wrighton and colleagues45 a binning method considering both sequence
composition and sequencing coverage. After this step, some reads were identied
as potential contamination from bacteria and other archaea and thus removed
from identied Bathyarchaeota. Detailed estimates of genome contamination
(shown as Completenessand Contaminationin Table 1) were assessed based
on lineage-specic marker sets with CheckM46.
Metagenome annotation. Protein coding sequences (CDS) were determined using
MetaGene47 with the default settings. Ribosomal RNA (rRNA) genes were called
with RNAmmer48, and the archaeal model was used to identify all three rRNA
subunits. For each predicted CDS, functional information was collected by a
sequence-similarity search against the NCBI NR database using BLASTP with
E-values of <1 ×10
5
. The sequences with reliable hits in the NR database were also
compared against KEGG, COG and Pfam with E-values <1 ×10
5
. The metagenome
was also annotated using IMG-M (https://img.jgi.doe.gov/cgi-bin/m/main.cgi). All
of the predicted CDS and functional annotations were subject to manual inspection
because the annotations identied by more than one source database were
considered for the assigned functions. For genes encoding peptidases and
carbohydrate-active enzymes, all of the annotated genes in the Bathyarchaeota bins
were searched against public databases of peptidases (MEROPS)49 and the CAZy
database50 with E-values <1 ×10
10
by BLASTP. All of the hits were compared
against the NCBI NR protein database, but only top hits for peptidases and
carbohydrate-active enzymes were considered. The extracellular peptidases were
further conrmed based on the identication of extracellular transport signals using
SignalP51. Genes related to carbohydrate-active enzymes were further classied into
different groups according to the CAZy predictions. Genes involved in the lipid
metabolic pathway of Bathyarchaeota were identied using similar procedures as
well as with the NCBI NR protein database and LIPID Metabolites and Pathways
Strategy (LIPID MAPS) database52.
Estimation of the completeness of genomic bins. The completeness of each
Bathyarchaeota genomic bin was estimated based on a conserved single copy gene
(CSCG) analysis similar to that carried out in previous work53. Two different archaeal
CSCG data sets were used in this study to provide information on the estimated
genome size. We collected 44 CSCGs annotated by the COG database as previously
used by Lloyd and colleagues10 (COG0016, COG0018, COG0049, COG0052, COG
0072, COG0080, COG0081, COG0087, COG0088, COG0089, COG0090, COG0091,
COG0092, COG0093, COG0094, COG0096, COG0097, COG0098, COG0099, COG
0102, COG0103, COG0185, COG0197, COG0202, COG0231, COG0250, COG0256,
COG1093, COG1094, COG1095, COG1358, COG1471, COG1537, COG1631, COG
NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35 ARTICLES
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology 7
© 2016 Macmillan Publishers Limited. All rights reserved
1632, COG1727, COG1976, COG2007, COG2092, COG2097, COG2125, COG2139,
COG2147 and COG5257) and 162 CSCGs (annotated Pfam) as previously applied by
Rinke and colleagues53 (for details see Supplementary Table 1). The ratios of the
numbers of CSCGs present in the metagenome and the number of total archaeal
CSCGs were then used to estimate the bin size. No corrections were made for a
decreasing number of CSCGs or for biases caused by the clustering of CSCGs.
Metabolic pathway identication. The similarity of genes with conrmed
annotations was identied using the KEGG database. A match was counted if the
similarity resulted in an E-value <1 ×10
5
. All KO (KEGG Orthology) numbers were
mapped against KEGG pathway functional hierarchies. Similarly, all predicted
open reading frames were also identied using the COG database with an
E-value <1 ×10
5
. Certain pathways were reconstructed from the literature
and are discussed in the main text of this work. To determine the potential origin of
genes from the Peru Margin metagenome33 and metatranscriptome34, a best-
reciprocal BLASTN approach was used. Genes identied previously in the Peru
Margin were searched against the NCBI NR database and our data set. Genes in the
Peru Margin with best hits from our data set were considered to be
bathyarchaeotal genes.
Phylogenetic tree construction. To identify relevant gene homologues of
bathyarchaeotal genes, 113 complete archaeal genomes (available in January 2015 at
NCBI) were included in this analysis. In total, 252 archaeal genomes, including all
publicly available Bathyarchaeota genomic bins and some partial/uncultured
archaeal genomes, were included in the phylogenetic analysis wherever applicable
(Supplementary Table 15). As stated above, functional genes were annotated against
Pfam, COG, KEGG and NCBI NR. Archaeal CSCGs were used to infer the
phylogenetic afliation of inspected Bathyarchaeota genomic bins. Previously, 58
prokaryotic genes were considered to undergo no horizontal gene transfer events17.
A total of 53 of 58 archaeal CSCGs were found to be present in most of the archaeal
genomes, and in all six Bathyarchaeota genomic bins. Homologue sequences were
individually aligned with ClustalW54. For the phylogenetic analysis, deletions and
highly variable regions were removed using a 30% positional conservatory lter, and
alignments were concatenated for the nal alignment whenever required. Short
alignments containing fewer than 20 amino-acid residues were discarded. The
individual alignments of proteins were concatenated to a supermatrixalignment for
phylogenetic inference. To construct phylogenetic trees of the 16S rRNA gene,
aligned sequences were subjected to a maximum-likelihood analysis in FastTree55
using the generalized time-reversible model with the CAT approximation. For the
phylogenies of other functional genes, a maximum-likelihood-based approach with
FastTree was performed using the JonesTaylorThornton (JTT) model with the
CAT approximation.
Expression and purication of Ack. The gene coding for Ack was synthesized by
Generay Biotech and subsequently cloned into the vector pET28a between sites
NdeI/BamHI. E. coli BL21 (DE3) obtained from TransGen Biotech was used as the
expression strain. All chemicals were reagent grade and were purchased from Sigma.
Gene coding for Ack was overexpressed in E. coli strain BL21 (DE3) with 0.5 mM
isopropyl-β-D-thiogalactoside in a total volume of 400 ml. Cells were collected by
centrifugation at 5,000 r.p.m. for 5 min, resuspended in lysis buffer (20 mM Tris-
HCl pH 8.0, 300 mM NaCl, 5 mM imidazole, 10% glycerol) and then lysed by
sonication. The cell lysate was centrifuged at 8,000 r.p.m. for 45 min to discard the
debris. The remaining supernatant was applied to a Ni-NTA column. After washing
with the buffer (20 mM Tris-HCl, pH 8.0, 300 mM NaCl, 20 mM imidazole and
10% glycerol), the fusion protein was eluted with the same washing buffer and then
dialysed to storage buffer (20 mM Tris-HCl, pH 8.0, 300 mM NaCl, 50% glycerol).
An empty pET28a vector was used as control.
Enzyme assays. The activityof Ack was determined at 25 °C by (1) measuring the rate
of ADP and acetyl phosphate formation by couplingthe enzymes pyruvate kinase (PK)
and lactate dehydrogenase (LDH), and (2) monitoring the formation of acetate and
ATP using a hexokinase (HK)-glucose 6-phosphate dehydrogenase (G6PDH)
enzyme-linked assay. For reaction (1), the assay mixtures contained 100 mM Hepes
(pH 7.5), 0.25 mM NADH, 5 mM PEP, 2 U of PK, 2 U of LDH, 5 mM MgCl
2
,
1 mM ATP, 20 mM acetate K, 2.12 µg Ack. For reaction (2), the assay mixtures
contained 100 mM Tris (pH 8.5), 0.25 mM NADP, 5 mM glucose, 2 U HK, 2 U
G6PDH, 10 mM MgCl
2
, 1 mM ADP, 20 mM acetyl phosphate, 2.12 µg Ack. Kinetic
parameters were determined at different substrate concentrations and calculated by
tting the data of initial rates to the MichaelisMenten equation by nonlinear
regression. The measurements were performed a minimum of three times.
Accession codes. This sediment metagenome has been deposited at JGI IMG-MER
(https://img.jgi.doe.gov/cgi-bin/m/main.cgi) with ID no. 3300003332. The genomes
supporting the results have been deposited at DDBJ/ENA/GenBank under the
BioProjectID PRJNA312413 (accession numbers from LUCA00000000 to
LUCF00000000).
Received 10 December 2015; accepted 24 February 2016;
published 4 April 2016
References
1. Kallmeyer, J., Pockalny, R., Adhikari, R. R., Smith, D. C. & DHondt, S. Global
distribution of microbial abundance and biomass in subseaoor sediment. Proc.
Natl Acad. Sci. USA 109, 1621316216 (2012).
2. Hoehler, T. M. & Jorgensen, B. B. Microbial life under extreme energy limitation.
Nature Rev. Microbiol. 11, 8394 (2013).
3. DHondt, S. et al. Distributions of microbial activities in deep subseaoor
sediments. Science 306, 22162221 (2004).
4. Biddle, J. F. et al. Heterotrophic Archaea dominate sedimentary subsurface
ecosystems off Peru. Proc. Natl Acad. Sci. USA 103, 38463851 (2006).
5. Bowles, M. W., Mogollon, J. M., Kasten, S., Zabel, M. & Hinrichs, K. U. Global
rates of marine sulfate reduction and implications for sub-sea-oor metabolic
activities. Science 344, 889891 (2014).
6. Pedersen, K. et al. Numbers, biomass and cultivable diversity of microbial
populations relate to depth and borehole-specic conditions in groundwater
from depths of 4450 m in Olkiluoto, Finland. ISME J. 2, 760775 (2008).
7. Lever, M. A. et al. Acetogenesis in deep subseaoor sediments of the Juan de
Fuca Ridge Flank: a synthesis of geochemical, thermodynamic, and gene-based
evidence. Geomicrobiol. J. 27, 183211 (2010).
8. Lipp, J. S., Morono, Y., Inagaki, F. & Hinrichs, K. U. Signicant contribution
of Archaea to extant biomass in marine subsurface sediments. Nature 454,
991994 (2008).
9. Kubo, K. et al. Archaea of the Miscellaneous Crenarchaeotal Group are
abundant, diverse and widespread in marine sediments. ISME J. 6,
19491965 (2012).
10. Lloyd, K. G. et al. Predominant archaea in marine sediments degrade detrital
proteins. Nature 496, 215218 (2013).
11. Fry, J. C., Parkes, R. J., Cragg, B. A., Weightman, A. J. & Webster, G. Prokaryotic
biodiversity and activity in the deep subseaoor biosphere. FEMS Microbiol.
Ecol. 66, 181196 (2008).
12. Lloyd, K. G., May, M. K., Kevorkian, R. T. & Steen, A. D. Meta-analysis of
quantication methods shows that archaea and bacteria have similar abundances
in the subseaoor. Appl. Environ. Microbiol. 79, 77907799 (2013).
13. Meng, J. et al. Genetic and functional properties of uncultivated MCG archaea
assessed by metagenome and gene expression analyses. ISME J. 8,
650659 (2014).
14. Seyler, L. M., McGuinness, L. M. & Kerkhof, L. J. Crenarchaeal heterotrophy in
salt marsh sediments. ISME J. 8, 15341543 (2014).
15. Lazar, C. B. et al. Genomic evidence for distinct carbon substrate preferences and
ecological niches of Bathyarchaeota in estuarine sediments. Environ. Microbiol.
http://dx.doi.org/10.1111/1462-2920.13142 (2016).
16. Evans, P. N. et al. Methane metabolism in the archaeal phylum Bathyarchaeota
revealed by genome-centric metagenomics. Science 350, 434438 (2015).
17. Puigbo, P., Wolf, Y. I. & Koonin, E. V. Search for a Tree of Lifein the thicket of
the phylogenetic forest. J. Biol. 8, 59 (2009).
18. Schmidt, M. & Schonheit, P. Acetate formation in the photoheterotrophic
bacterium Chloroexus aurantiacus involves an archaeal type ADP-forming
acetyl-CoA synthetase isoenzyme I. FEMS Microbiol. Lett. 349,
171179 (2013).
19. Parizzi, L. P. et al. The genome sequence of Propionibacterium acidipropionici
provides insights into its biotechnological and industrial potential. BMC
Genomics 13, 562 (2012).
20. Rother, M. & Metcalf, W. W. Anaerobic growth of Methanosarcina acetivorans
C2A on carbon monoxide: an unusual way of life for a methanogenic archaeon.
Proc. Natl Acad. Sci. USA 101, 1692916934 (2004).
21. Fournier, G. P. & Gogarten, J. P. Evolution of acetoclastic methanogenesis in
Methanosarcina via horizontal gene transfer from cellulolytic Clostridia.
J. Bacteriol. 190, 11241127 (2008).
22. Drake, H. L., Kusel, K. & Matthies, C. Ecological consequences of the
phylogenetic and physiological diversities of acetogens. Antonie Van
Leeuwenhoek 81, 203213 (2002).
23. Henstra, A. M., Dijkema, C. & Stams, A. J. Archaeoglobus fulgidus couples CO
oxidation to sulfate reduction and acetogenesis with transient formate
accumulation. Environ. Microbiol. 9, 18361841 (2007).
24. Ettema, T. J. et al. Identication and functional verication of archaeal-type
phosphoenolpyruvate carboxylase, a missing link in archaeal central
carbohydrate metabolism. J. Bacteriol. 186, 77547762 (2004).
25. Mock, J., Wang, S., Huang, H., Kahnt, J. & Thauer, R. K. Evidence for a
hexaheteromeric methylenetetrahydrofolate reductase in Moorella
thermoacetica.J. Bacteriol. 196, 33033314 (2014).
26. Ragsdale, S. W. Enzymology of the WoodLjungdahl pathway of acetogenesis.
Ann. NY Acad. Sci. 1125, 129136 (2008).
27. Martin, W. F., Sousa, F. L. & Lane, N. Energy at lifes origin. Science 344,
10921093 (2014).
28. Martin, W. F., Neukirchen, S. & Sousa, F. L. Microbial Evolution under Extreme
Conditions 171184 (de Gruyter, 2015).
ARTICLES NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology8
© 2016 Macmillan Publishers Limited. All rights reserved
29. Ueno, Y., Yamada, K., Yoshida, N., Maruyama, S. & Isozaki, Y. Evidence from
uid inclusions for microbial methanogenesis in the early Archaean era. Nature
440, 516519 (2006).
30. Rothman, D. H. et al. Methanogenic burst in the end-Permian carbon cycle.
Proc. Natl Acad. Sci. USA 111, 54625467 (2014).
31. Heuer, V. B., Pohlman, J. W., Torres, M. E., Elvert, M. & Hinrichs, K. U. The
stable carbon isotope biogeochemistry of acetate and other dissolved carbon
species in deep subseaoor sediments at the northern Cascadia margin.
Geochim. Cosmochim. Acta 73, 33233336 (2009).
32. Lever, M. A. Acetogenesis in the energy-starved deep biospherea paradox?
Front. Microbiol. 2, 284 (2011).
33. Biddle, J. F., Fitz-Gibbon, S., Schuster, S. C., Brenchley, J. E. & House, C. H.
Metagenomic signatures of the Peru Margin subseaoor biosphere show a
genetically distinct environment. Proc. Natl Acad. Sci. USA 105,
1058310588 (2008).
34. Orsi, W. D., Edgcomb, V. P., Christman, G. D. & Biddle, J. F. Gene expression in
the deep biosphere. Nature 499, 205208 (2013).
35. Meador, T. B. et al. The archaeal lipidome in estuarine sediment dominated by
members of the Miscellaneous Crenarchaeotal Group. Environ. Microbiol. 17,
24412458 (2014).
36. Drake, H. L., Gossner, A. S. & Daniel, S. L. Old acetogens, new light. Ann. NY
Acad. Sci. 1125, 100128 (2008).
37. Ovreas, L., Forney, L., Daae, F. L. & Torsvik, V. Distribution of bacterioplankton
in meromictic Lake Saelenvannet, as determined by denaturing gradient gel
electrophoresis of PCR-amplied gene fragments coding for 16S rRNA. Appl.
Environ. Microbiol. 63, 33673373 (1997).
38. Jorgensen, S. L. et al. Correlating microbial community proles with
geochemical data in highly stratied sediments from the Arctic Mid-Ocean
Ridge. Proc. Natl Acad. Sci. USA 109, E2846E2855 (2012).
39. Song, Z. Q. et al. Bacterial and archaeal diversities in Yunnan and Tibetan hot
springs, China. Environ. Microbiol. 15, 11601175 (2013).
40. Schloss, P. D. et al. Introducing mothur: open-source, platform-independent,
community-supported software for describing and comparing microbial
communities. Appl. Environ. Microbiol. 75, 75377541 (2009).
41. Pruesse, E. et al. SILVA: a comprehensive online resource for quality checked and
aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res.
35, 71887196 (2007).
42. Peng, Y., Leung, H. C., Yiu, S. M. & Chin, F. Y. IDBA-UD: a de novo assembler
for single-cell and metagenomic sequencing data with highly uneven depth.
Bioinformatics 28, 14201428 (2012).
43. Dick, G. J. et al. Community-wide analysis of microbial genome sequence
signatures. Genome Biol. 10, R85 (2009).
44. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows
Wheeler transform. Bioinformatics 25, 17541760 (2009).
45. Wrighton, K. C. et al. Fermentation, hydrogen, and sulfur metabolism in
multiple uncultivated bacterial phyla. Science 337, 16611665 (2012).
46. Parks, D. H. et al. CheckM: assessing the quality of microbial genomes recovered
from isolates, single cells, and metagenomes. Genome Res. 25, 10431055 (2015).
47. Noguchi, H., Park, J. & Takagi, T. MetaGene: prokaryotic gene nding from
environmental genome shotgun sequences. Nucleic Acids Res. 34,
56235630 (2006).
48. Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA
genes. Nucleic Acids Res. 35, 31003108 (2007).
49. Rawlings, N. D., Waller, M., Barrett, A. J. & Bateman, A. MEROPS: the database
of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 42,
D503D509 (2014).
50. Cantarel, B. L. et al. The Carbohydrate-Active EnZymes database (CAZy): an
expert resource for glycogenomics. Nucleic Acids Res. 37, D233D238 (2009).
51. Petersen, T. N., Brunak, S., von Heijne, G. & Nielsen, H. SignalP 4.0:
discriminating signal peptides from transmembrane regions. Nature Methods 8,
785786 (2011).
52. Cotter, D., Maer, A., Guda, C., Saunders, B. & Subramaniam, S. LMPD: LIPID
MAPS proteome database. Nucleic Acids Res. 34, D507D510 (2006).
53. Rinke, C. et al. Insights into the phylogeny and coding potential of microbial
dark matter. Nature 499, 431437 (2013).
54. Larkin, M. A. et al. Clustal W and Clustal X version 2.0. Bioinformatics 23,
29472948 (2007).
55. Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2approximately maximum-
likelihood trees for large alignments. PLoS ONE 5, e9490 (2010).
Acknowledgements
The authors thank R. Thauer for critical reading of the manuscript, chief scientist
C. Vetriani for logistical support, as well as the of cers, crewand pilots of R/V Atlantis and
DSV Alvin for their expert help at sea. This work has been nancially supported by the
Natural Science Foundation of China (grant numbers 91228201, 31290232, 41525011 and
41506163), the China Ocean Mineral Resources R&D Association (grant DY125-15-T-04),
the Natural Science Foundation of Guangdong and Shenzhen of China (grant
numbers 2014A030310056 and JCY20140828163633985) and the US National Science
Foundation (grant numbers MCB-0456689 and MCB-0702677 to S.M.S.).
Author contributions
Y.H., M.L., S.M.S. and F.W. designed the experiment and analysis, and interpreted the data.
S.M.S. carried out sampling and preservation. V.P. prepared samples for nucleic acid
extractions and sequencing, and performed quantitative PCR. Y.H., M.L., X.F. and J.F.
performed the bioinformatics analyses. J.X. conducted the protein expression, purication
and enzyme assay. Y.H., M.L., S.M.S. and F.W. wrote the manuscript, in consultation with
all other authors.
Additional information
Supplementary information is available online.
Reprints and permissions information is
available online at www.nature.com/reprints. Correspondence and requestsfor materials should
be addressed to F.W.
Competing interests
The authors declare no competing nancial interests.
NATURE MICROBIOLOGY DOI: 10.1038/NMICROBIOL.2016.35 ARTICLES
NATURE MICROBIOLOGY |www.nature.com/naturemicrobiology 9
© 2016 Macmillan Publishers Limited. All rights reserved
... Ca. Bathyarchaeia is one of the ubiquitous class of archaea, found widely in freshwater (Xiang et al., 2017), and considered a core species of the microbiome in coastal , and marine sediments as well (He et al., 2016). This class is one of the most abundant groups of archaea on Earth, with an estimated global abundance ranging from 2.0 to 3.9 × 10 28 cells/ml. ...
... Genomic studies unveil that Ca. Bathyarchaeia primarily follows a heterotrophic lifestyle centered around acetyl-CoA, with the potential for acetogenesis (He et al., 2016), methane metabolism (Evans et al., 2015), and sulfur reduction . They also possess a diverse range of genes that encode carbohydrate-active enzymes , suggesting their ability to utilize various carbohydrates and lignin (Yu et al., 2018). ...
... It is noteworthy that not all microorganisms have the ability to directly degrade complex compounds, making the role of Ca. Bathyarchaeia in nutrient cycling is vital for the entire microbial community (He et al., 2016;Zhou et al., 2018). The potential of Ca. ...
Article
Full-text available
Most microorganisms resist pure cultivation under conventional laboratory conditions. One of the primary issues for this un-culturability is the absence of biologically produced growth-promoting factors in traditionally defined growth media. However, whether cultivating microbes by providing spent culture supernatant of pivotal microbes in the growth medium can be an effective approach to overcome this limitation is still an under-explored area of research. Here, we used the spent culture medium (SCM) method to isolate previously uncultivated marine bacteria and compared the efficiency of this method with the traditional cultivation (TC) method. In the SCM method, Ca. Bathyarchaeia-enriched supernatant (10%) was used along with recalcitrant organic substrates such as lignin, humic acid, and organic carbon mixture. Ca. Bathyarchaeia, a ubiquitous class of archaea, have the capacity to produce metabolites, making their spent culture supernatant a key source to recover new bacterial stains. Both cultivation methods resulted in the recovery of bacterial species from the phyla Pseudomonadota, Bacteroidota, Actinomycetota, and Bacillota. However, our SCM approach also led to the recovery of species from rarely cultivated groups, such as Planctomycetota, Deinococcota, and Balneolota. In terms of the isolation of new taxa, the SCM method resulted in the cultivation of 80 potential new strains, including one at the family, 16 at the genus, and 63 at the species level, with a novelty ratio of ~ 35% (80/219). In contrast, the TC method allowed the isolation of ~ 10% (19/171) novel strains at species level only. These findings suggest that the SCM approach improved the cultivation of novel and diverse bacteria.
... Bathyarchaeia, formerly the Miscellaneous Crenarchaeotal Group (MCG), is an archaeal class widespread in marine and freshwater sediments [1][2][3][4][5][6]. The estimated global abundance of Bathyarchaeia reaches up to 2.0-3.9 × 10 28 cells, representing one of the most abundant groups of archaea on Earth [7]. Genomic analyses suggest that Bathyarchaeia lead an acetyl-CoA-centered heterotrophic lifestyle with the potential for acetogenesis [7], methane metabolism [8], and sulfur reduction [9]. ...
... The estimated global abundance of Bathyarchaeia reaches up to 2.0-3.9 × 10 28 cells, representing one of the most abundant groups of archaea on Earth [7]. Genomic analyses suggest that Bathyarchaeia lead an acetyl-CoA-centered heterotrophic lifestyle with the potential for acetogenesis [7], methane metabolism [8], and sulfur reduction [9]. Bathyarchaeia also encompass a variety of genes encoding carbohydrate-active enzymes [10] and thus likely can unitize various carbohydrates and lignin [11]. ...
Article
Full-text available
Bathyarchaeia represent a class of archaea common and abundant in sedimentary ecosystems. Here we report 56 metagenome-assembled genomes of Bathyarchaeia viruses identified in metagenomes from different environments. Gene sharing network and phylogenomic analyses led to the proposal of four virus families, including viruses of the realms Duplodnaviria and Adnaviria, and archaea-specific spindle-shaped viruses. Genomic analyses uncovered diverse CRISPR elements in these viruses. Viruses of the proposed family “Fuxiviridae” harbor an atypical type IV-B CRISPR-Cas system and a Cas4 protein that might interfere with host immunity. Viruses of the family “Chiyouviridae” encode a Cas2-like endonuclease and two mini-CRISPR arrays, one with a repeat identical to that in the host CRISPR array, potentially allowing the virus to recruit the host CRISPR adaptation machinery to acquire spacers that could contribute to competition with other mobile genetic elements or to inhibit host defenses. These findings present an outline of the Bathyarchaeia virome and offer a glimpse into their counter-defense mechanisms.
... The archaeal genus Bathyarchaeia, formerly known as the Miscellaneous Crenarchaeotal Group (MCG) but now assigned to the phylum Bathyarchaeota based on the Genome Taxonomy Database (Rinke et al., 2021;Hou et al., 2022), has previously been detected in peatland ecosystems (Xiang et al., 2017;Truu et al., 2020;Prasitwuttisak et al., 2022). Several studies have suggested that Bathyarchaeia are organotrophic and are able to utilize lignin as an energy source for acetogenesis (Yu et al., 2018), leading to the degradation of a variety of organic carbon compounds such as acetate and complex hydrocarbons (He et al., 2016). Also, some members of Bathyarchaeia may be able to undergo anaerobic methane production (Evans et al., 2015). ...
... The archaeal genus Bathyarchaeia, formerly known as the Miscellaneous Crenarchaeotal Group (MCG) but now assigned to the phylum Bathyarchaeota based on the Genome Taxonomy Database (Rinke et al., 2021;Hou et al., 2022), has previously been detected in peatland ecosystems (Xiang et al., 2017;Truu et al., 2020;Prasitwuttisak et al., 2022). Several studies have suggested that Bathyarchaeia are organotrophic and are able to utilize lignin as an energy source for acetogenesis (Yu et al., 2018), leading to the degradation of a variety of organic carbon compounds such as acetate and complex hydrocarbons (He et al., 2016). Also, some members of Bathyarchaeia may be able to undergo anaerobic methane production (Evans et al., 2015). ...
Article
Full-text available
The conversion of tropical peat swamp forests to oil palm plantations has become a focal point in relation to global change. However, it is difficult to understand the ecological consequences of this conversion because little is known about how the microorganisms in these ecosystems respond to land-use conversion. Therefore, in this study, we assessed the microbial community structures of tropical peatland under two land uses, peat swamp forest and oil palm plantation (OP), to investigate how changes in local environmental conditions due to the conversion from forest to OP may have affected the microbial communities. For each land use, the microbial communities were assessed at three depths (0-5 cm, 20-25 cm, and 40-50 cm) using meta-16S amplicon analysis with Illumina Miseq. We found that the microbial communities under both land uses were dominated by anaerobes and fermenters, such as Acidobacteriota, Proteobacteria and Actinobacteria, which accounted for 80%-90% of the total abundance. There were strong similarities between the microbial communities in the 0-5 cm forest samples and the OP samples at all depths. However, the microbial communities in the 20-25 cm and 40-50 cm forest samples were different from the other samples. The differences in the deeper forest samples were likely related to water table and peat quality. CO 2 fluxes from the forest were significantly higher than from OP, with mean fluxes of 190 ± 66.92 and 57.86 ± 33.66 mg m −2 h −1 , respectively (p < 0.05). However, there were no differences in either CH 4 or N 2 O fluxes between the forest and OP. We detected nine microbial taxa that characterized the differences in the microbial communities between the forest and OP (p > 0.05); Bathyarchaeia, Dadabacteriales, Syntrophobacter, and Subgroup_13 were significantly more abundant in the forest, especially in the deeper peat layers, whereas Acidobacter, Bryobacter, 11-24, Leptospirillum, and WPS-2 were significantly more abundant in the OP (p > 0.05). This study helps improve our understanding of the differences in microbial community structures between peat swamp forests and OP systems. KEYWORDS microbial communities and structure, 16 rRNA, land-use change, peat swamp forest, oil palm plantation
... Interestingly, another acetogenesis pathway, the phosphate acetyltransferase (pta)-acetate kinase (ack) pathway, was also detected in small Archaea (e.g., Pacearchaeales and Woesearchaeales), Asgardarchaeota, and Thermoproteota. The pta-ack pathway is thought to be a bacterial-type acetogenesis pathway, although it has been found in a few Archaea 49,50 . Our discovery of widespread pta-ack pathway enzymes suggests that this so-called Bacteria-specific pathway is more widely distributed in Archaea than previously thought. ...
Article
Full-text available
Terrestrial geothermal springs are physicochemically diverse and host abundant populations of Archaea. However, the diversity, functionality, and geological influences of these Archaea are not well understood. Here we explore the genomic diversity of Archaea in 152 metagenomes from 48 geothermal springs in Tengchong, China, collected from 2016 to 2021. Our dataset is comprised of 2949 archaeal metagenome-assembled genomes spanning 12 phyla and 392 newly identified species, which increases the known species diversity of Archaea by ~48.6%. The structures and potential functions of the archaeal communities are strongly influenced by temperature and pH, with high-temperature acidic and alkaline springs favoring archaeal abundance over Bacteria. Genome-resolved metagenomics and metatranscriptomics provide insights into the potential ecological niches of these Archaea and their potential roles in carbon, sulfur, nitrogen, and hydrogen metabolism. Furthermore, our findings illustrate the interplay of competition and cooperation among Archaea in biogeochemical cycles, possibly arising from overlapping functional niches and metabolic handoffs. Taken together, our study expands the genomic diversity of Archaea inhabiting geothermal springs and provides a foundation for more incisive study of biogeochemical processes mediated by Archaea in geothermal ecosystems.
... They were previously placed in the Miscellaneous Crenarchaeotal Group (MCG) [4,5]. Bathyarchaeia has been found in various other environments, including sediments, volcanic mud, termite guts, bioreactors, and soils [6][7][8][9][10][11]. Bathyarchaeia is highly abundant in marine sediments, making them one of the most abundant groups of microorganisms on the earth [12,13]. However, previous studies on Bathyarchaeia have mainly focused on sediments, whereas the distribution of Bathyarchaeia in arable soils is not much studied. ...
Article
Full-text available
Bathyarchaeia (formerly Bathyarchaeota) is a group of highly abundant archaeal communities that play important roles in global biogeochemical cycling. Bathyarchaeia is predominantly found in sediments and hot springs. However, their presence in arable soils is relatively limited. In this study, we aimed to investigate the spatial distributions and diversity of Bathyarchaeia in paddy soils across eastern China, which is a major rice production region. The relative abundance of Bathyarchaeia among total archaea ranged from 3 to 68% in paddy soils, and Bathy-6 was the dominant subgroup among the Bathyarchaeia (70–80% of all sequences). Bathyarchaeia showed higher migration ability and wider niche width based on the neutral and null model simulations. Bathy-6 was primarily assembled by deterministic processes. Soil pH and C/N ratio were identified as key factors influencing the Bathyarchaeia composition, whereas C/N ratio and mean annual temperature influenced the relative abundance of Bathyarchaeia. Network analysis showed that specific Bathyarchaeia taxa occupied keystone positions in the archaeal community and co-occurred with some methanogenic archaea, including Methanosarcina and Methanobacteria, and ammonia-oxidizing archaea belonging to Nitrososphaeria. This study provides important insights into the biogeography and niche differentiation of Bathyarchaeia particularly in paddy soil ecosystems. Supplementary Information The online version contains supplementary material available at 10.1186/s40793-024-00555-8.
Article
Full-text available
Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of 'marker' genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. Published by Cold Spring Harbor Laboratory Press.
Article
Full-text available
Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of ‘marker’ genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree along with information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. CheckM is open source software available at http://ecogenomics.github.io/CheckM.
Article
Investigations of the biogeochemical roles of benthic Archaea in marine sediments are hampered by the scarcity of cultured representatives. In order to determine their metabolic capacity, we reconstructed the genomic content of four widespread uncultured benthic archaea recovered from estuary sediments at 48 to 95% completeness. Four genomic bins were found to belong to different subgroups of the former Miscellaneous Crenarcheota Group (MCG) now called Bathyarchaeota: MCG-6, MCG-1, MCG-7/17 and MCG-15. Metabolic predictions based on gene content of the different genome bins indicate that subgroup 6 has the ability to hydrolyse extracellular plant-derived carbohydrates, and that all four subgroups can degrade detrital proteins. Genes encoding enzymes involved in acetate production as well as in the reductive acetyl-CoA pathway were detected in all four genomes inferring that these archaea are organo-heterotrophic and autotrophic acetogens. Genes involved in nitrite reduction were detected in all Bathyarchaeota subgroups and indicate a potential for dissimilatory nitrite reduction to ammonium. Comparing the genome content of the different Bathyarchaeota subgroups indicated preferences for distinct types of carbohydrate substrates and implicitly, for different niches within the sedimentary environment.
Article
Methanogenic and methanotrophic archaea play important roles in the global flux of methane. Culture-independent approaches are providing deeper insight into the diversity and evolution of methane-metabolizing microorganisms, but, until now, no compelling evidence has existed for methane metabolism in archaea outside the phylum Euryarchaeota. We performed metagenomic sequencing of a deep aquifer, recovering two near-complete genomes belonging to the archaeal phylum Bathyarchaeota (formerly known as the Miscellaneous Crenarchaeotal Group). These genomes contain divergent homologs of the genes necessary for methane metabolism, including those that encode the methyl–coenzyme M reductase (MCR) complex. Additional non-euryarchaeotal MCR-encoding genes identified in a range of environments suggest that unrecognized archaeal lineages may also contribute to global methane cycling. These findings indicate that methane metabolism arose before the last common ancestor of the Euryarchaeota and Bathyarchaeota.
Article
The anoxic sediments of the White Oak River (WOR) estuary comprise a distinctive sulfate-methane transition zone (SMTZ) and natural enrichment of the archaea affiliated with the Miscellaneous Crenarchaeota Group (MCG). Archaeal biphytanes were generally depleted in 13C, with δ13C values being less than -35‰, indicative of production by active sedimentary archaeal populations. Multivariate analysis of the downcore distributions of 63 lipid biomarkers identified three major groups of lipids that were enriched in the surface, SMTZ, or subsurface depths. Intact polar lipids with phosphatidylglycerol headgroups and glycerol dibiphytanyl glycerol tetraethers containing one, two, or three cyclopentane rings were enriched at the base of the SMTZ and likely represent the accumulated product of a small but active ANME-1 community. The recently identified butanetriol dibiphytanyl glycerol tetraethers (BDGT), which increased relatively to other lipids with depth, were correlated with the relative abundance of MCG in archaeal 16S rRNA clone libraries, and were 13C-depleted throughout the depth profile, suggesting BDGT lipids as putative biomarkers of an MCG community that may either be autotrophic or feeding on 13C-depleted organic substrates transported by porewater.