ArticlePDF Available

First genomic insights into members of a candidate bacterial phylum responsible for wastewater bulking

PeerJ

January 2015
3(1):e740

DOI:10.7717/peerj.740

Source
PubMed

License
CC BY 4.0

Authors:

Yuji Sekiguchi

National Institute of Advanced Industrial Science and Technology

Donovan H Parks

The University of Queensland

Show all 6 authorsHide

Filamentous cells belonging to the candidate bacterial phylum KSB3 were previously identified as the causative agent of fatal filament overgrowth (bulking) in a high-rate industrial anaerobic wastewater treatment bioreactor. Here, we obtained near complete genomes from two KSB3 populations in the bioreactor, including the dominant bulking filament, using differential coverage binning of metagenomic data. Fluorescence in situ hybridization with 16S rRNA-targeted probes specific for the two populations confirmed that both are filamentous organisms. Genome-based metabolic reconstruction and microscopic observation of the KSB3 filaments in the presence of sugar gradients indicate that both filament types are Gram-negative, strictly anaerobic fermenters capable of non-flagellar based gliding motility, and have a strikingly large number of sensory and response regulator genes. We propose that the KSB3 filaments are highly sensitive to their surroundings and that cellular processes, including those causing bulking, are controlled by external stimuli. The obtained genomes lay the foundation for a more detailed understanding of environmental cues used by KSB3 filaments, which may lead to more robust treatment options to prevent bulking.

Features of the Modulibacteria KSB3 population genomes.

…

Phylogenetic structure of the Modulibacteria (KSB3) phylum based on comparative analysis of 16S rRNA gene sequences, and imaging of KSB3 cells. (A) Maximum-likelihood phylogenetic tree (RAxML) of public data (accession numbers shown) and the 16S rRNA sequence determined in this study for UASB14. Sequences from the bacterial phyla Nitrospirae, Tenericutes, and Chloroflexi were used to root the tree (not shown). Reproducible interior nodes are indicated as a black circle (>90% bootstrap support for neighbor-joining [NJ], maximum parsimony [MP], and maximum-likelihood [ML] inferences), open circle (>80% support); or open rectangle (>70% support). Nodes without symbols were not reproducible between trees. The scale bar represents 5% estimated sequence divergence. Class-level clades are bracketed to the right of the figure in black. The target ranges of KSB3-specific FISH (fluorescence in situ hybridization) probes used in this study are indicated by colored brackets with the colors corresponding to cell color in (B) and (D). (B) 16S rRNA-targeted FISH detection of UASB14 and UASB270 filaments in the UASB sludge. The abundant UASB14 filaments are labeled green and the low abundance UASB270 filaments are labeled red. (C) Total KSB3 filament abundance highlighted by a phylum-level FISH probe relative to (D) all cells present in the same field (phase-contrast image). Bars in (B-D) represent 10 µm.

…

Figures - uploaded by Gene W Tyson

Content may be subject to copyright.

Content uploaded by Gene W Tyson

Content may be subject to copyright.

Available via license: CC BY 4.0

Content may be subject to copyright.

Submitted 14 October 2014

Accepted 5 January 2015

Published 27 January 2015

Corresponding authors

Yuji Sekiguchi,

y.sekiguchi@aist.go.jp

Philip Hugenholtz,

p.hugenholtz@uq.edu.au

Academic editor

Kimberly Bishop-Lilly

Additional Information and

Declarations can be found on

page 19

DOI 10.7717/peerj.740

2015 Sekiguchi et al.

Distributed under

Creative Commons CC-BY 4.0

OPEN ACCESS

First genomic insights into members of a

candidate bacterial phylum responsible

for wastewater bulking

Yuji Sekiguchi1, Akiko Ohashi1, Donovan H. Parks2,

Toshihiro Yamauchi3, Gene W. Tyson2,4 and Philip Hugenholtz2,5

1Biomedical Research Institute, National Institute of Advanced Industrial Science and Technology (AIST),

Tsukuba, Ibaraki, Japan

2Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, The University of

Queensland, St. Lucia, Queensland, Australia

3Administrative Management Department, Kubota Kasui Corporation, Minato-ku, Tokyo, Japan

4Advanced Water Management Centre, The University of Queensland, St. Lucia, Queensland, Australia

5Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Queensland, Australia

ABSTRACT

Filamentous cells belonging to the candidate bacterial phylum KSB3 were previously

identiﬁed as the causative agent of fatal ﬁlament overgrowth (bulking) in a high-rate

industrial anaerobic wastewater treatment bioreactor. Here, we obtained near

complete genomes from two KSB3 populations in the bioreactor, including the

dominant bulking ﬁlament, using diﬀerential coverage binning of metagenomic data.

Fluorescence in situ hybridization with 16S rRNA-targeted probes speciﬁc for the

two populations conﬁrmed that both are ﬁlamentous organisms. Genome-based

metabolic reconstruction and microscopic observation of the KSB3 ﬁlaments in the

presence of sugar gradients indicate that both ﬁlament types are Gram-negative,

strictly anaerobic fermenters capable of non-ﬂagellar based gliding motility, and

have a strikingly large number of sensory and response regulator genes. We propose

that the KSB3 ﬁlaments are highly sensitive to their surroundings and that cellular

processes, including those causing bulking, are controlled by external stimuli.

The obtained genomes lay the foundation for a more detailed understanding

of environmental cues used by KSB3 ﬁlaments, which may lead to more robust

treatment options to prevent bulking.

Subjects Biodiversity, Biotechnology, Genomics, Microbiology, Taxonomy

Keywords KSB3 phylum, Candidate phylum, Wastewater treatment, Anaerobic biotechnology,

Filamentous bulking, Metagenomics

INTRODUCTION

Anaerobic digestion is a major type of biological treatment extensively used around the

world (Ahring, 2003a) that is not only cost eﬀective for treating organic waste and wastewa-

ter, but also can frequently produce energy in the form of methane (biogas) (Angelidaki

et al., 2011). Over the last thirty years, a set of high rate anaerobic digestion reactor

conﬁgurations have been developed, of which the upﬂow anaerobic sludge blanket (UASB)

technology is the most successful and commercialized conﬁguration (Kleerebezem &

Macarie, 2003;van Lier, 2008). Despite the success of this technology, serious performance

How to cite this article Sekiguchi et al. (2015), First genomic insights into members of a candidate bacterial phylum responsible for

wastewater bulking. PeerJ 3:e740;DOI 10.7717/peerj.740

issues have emerged such as the sudden washout of granular sludge biomass due to over-

growth of ﬁlamentous bacteria (bulking), which can lead to complete loss of performance.

Bulking of anaerobic digestion systems can be caused by a variety of ﬁlamentous

microorganisms (HulshoﬀPol et al., 2004;Li et al., 2008;Yamada & Sekiguchi, 2009)

and a phylogenetically novel ﬁlament was previously reported to be the cause of bulking

in an industrial UASB reactor treating sugar manufacturing wastewater (Yamada et al.,

2007;Yamada et al., 2011). Small subunit ribosomal RNA (16S rRNA) gene-based analyses

of the bulking sludge (Yamada et al., 2007) revealed that the dominant ﬁlament type

belongs to candidate bacterial phylum KSB3, originally proposed by Tanner et al. (2000)

based on an environmental 16S rRNA gene clone sequence obtained from a sulfur-rich

marine sediment (Tanner et al., 2000). Fluorescence in situ hybridization (FISH) with

KSB3-speciﬁc 16S rRNA-directed probes revealed that the KSB3 ﬁlaments are localized at

the outer layer of healthy granules (Yamada et al., 2007) which become substantially thicker

during bulking. The study of ﬁlamentous KSB3 bacteria will undoubtedly contribute to

our understanding of and ability to prevent bulking in anaerobic wastewater treatment sys-

tems, but has been hampered by an inability to obtain a pure culture despite repeated and

long term isolation eﬀorts (Yamada et al., 2011). However, culture-independent molecular

and imaging methods are beginning to provide clues regarding the ecophysiology of these

organisms. This includes their ability to uptake simple carbohydrates, particularly maltose

and glucose, under anaerobic conditions and from these observations it was proposed that

high carbohydrate loading in the UASB reactor may trigger proliferation of KSB ﬁlament

populations (Yamada et al., 2011).

Here, we obtained near complete genomes from in situ populations of the dominant

bulking KSB3 ﬁlament type and a second moderately related low abundance KSB3

ﬁlament via diﬀerential coverage binning (Albertsen et al., 2013) using metagenomic data

previously reported from a full-scale UASB reactor (Soo et al., 2014). Diﬀerential coverage

binning groups together anonymous metagenomic fragments (contigs) belonging to the

same population based on the similarity of their sequencing coverage across multiple

related metagenomes (Albertsen et al., 2013). These genomes represent the ﬁrst genomic

information for candidate phylum KSB3 and provide insights into the metabolism of KSB3

ﬁlaments and their ability to cause bulking.

METHODS

Samples

Methanogenic sludge samples reported in a previous study (Soo et al., 2014) were used in

the present study for shotgun sequencing and ﬂuorescence in situ hybridization (FISH).

Brieﬂy, two sludge samples (A1 and A2) were taken from the system at diﬀerent sampling

dates (A1, 25th December, 2012; A2, 16th September, 2010), and sample A1 was further

separated into two parts (ﬂocculant sludge [F1] and granular sludges [G1]) by gravimetric

settlement (Soo et al., 2014) (Table S1). Each sample had been divided into two parts: one

used for obtaining DNA via bead-beating and phenol chloroform extraction and the other

ﬁxed in 4% paraformaldehyde for FISH (Soo et al., 2014).

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 2/24

Fluorescence in situ hybridization

KSB3-speciﬁc FISH probes were designed in ARB v5.5 (Ludwig et al., 2004) using 16S

rRNA genes identiﬁed in the KSB3 genomes and KSB3 16S rRNA gene sequences available

in the current Greengenes database (May 2013 version) (McDonald et al., 2012). In order to

maximize the speciﬁcity and ﬂuorescence intensity of the probes, helper probes were also

designed (Table S5). FISH was performed as described previously (Sekiguchi et al., 1999)

using the probes listed in Table S5 and all probes were hybridized overnight. Probes were

labeled with either Alexa488 or Cy3 ﬂuorophores, and probes with diﬀerent ﬂuorophores

were used together for dual-staining FISH. Images were captured using an epiﬂuorescence

microscope (Axioplan 2; Carl Zeiss) equipped with a cooled charge-coupled device (CCD)

camera (DP72; Olympus) and subsequently processed using imaging software (DP2-BSW,

version 2.2; Olympus). Super-imposed images were generated using Adobe Photoshop

(CS5.1; Adobe).

Metagenome sequencing

Previously sequenced paired-end and mate-pair metagenomes for samples A1, A2, F1, and

G1 (Soo et al., 2014) were supplemented with additional data generated in this study using

the same DNA. Paired-end Nextera libraries were prepared for each sample according

to the manufacturer’s instructions, quantiﬁed using the QuantIT kit (Molecular Probes)

and sequenced (2 ×250 bp paired end) on an Illumina MiSeq using the Reagent Kit v2

(Illumina) at the National Institute of Advanced Industrial Science and Technology, Japan

(AIST; Table S1). The extra metagenome sequencing generated 7.3 Gb, 13.1 Gb, 1.9 Gbp,

and 7.9 Gbp for A1, A2, F1, and G1 samples, respectively (yielding a total of 56.2 Gb for

all samples combined). For scaﬀolding, two additional large-insert mate-pair libraries

(∼3 kbp and ∼7.5 kbp) were constructed from sample A1 using the Mate Pair Library

Preparation Kit v2 (Illumina) and sequenced on Illumina MiSeq system (MiSeq Reagent

Kit v2) yielding 2.3 Gb and 3.0 Gb, respectively (for a total of 7.3 Gb from sample A1 when

combined with mate-pair data; Soo et al., 2014).

Community proﬁling

16S rRNA gene amplicon sequencing of all UASB sludge samples using the Illumina MiSeq

system was previously reported (Soo et al., 2014). Community composition was also

examined by extracting all 16S rRNA reads from the metagenome datasets using the closed

reference picking script in QIIME v1.6.0 (Caporaso et al., 2010b) with the Greengenes

database (de-replicated dataset at 97%, March 2013 McDonald et al., 2012) as reference

(otu picking method, uclust ref; similarity cutoﬀvalue, 0.95). All reads from paired end

sequencing were quality ﬁltered using a QIIME script (split library fastq.py) with the

following stringent parameters to ensure only high quality reads were included in the

analysis: read trimming with a Phred quality threshold of 17 (-q 17) and discarding reads

shorter than 50% of the input read length (-p 0.5), and then the quality ﬁltered single-end

reads were used for the closed reference picking to generate OTU tables.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 3/24

Assembly and binning

Metagenome assembly and population genome binning followed the approach previously

described (Albertsen et al., 2013). A graphical illustration of the workﬂow is shown in

Fig. S1. Brieﬂy, paired-end metagenome reads in fastq format were merged with SeqPrep

(https://github.com/jstjohn/SeqPrep) using default settings and Illumina sequencing

adapters were removed. Unmerged reads were quality trimmed and ﬁltered using

Nesoni v0.112 (https://github.com/ Victorian-Bioinformatics-Consortium/nesoni) with

removing low quality bases from reads with a Phred quality threshold of 17, removing

homopolymers reads, and eliminating trimmed reads shorter than 30 bases (clip –quality

17 –homopolymers yes –length 30). The merged and trimmed reads from the four

metagenomes were co-assembled using SPAdes v2.5.0 with the following parameters suited

to a complex metagenomic assembly: –only-assembler -k 67 –sc. Reads from respective

samples were separately mapped to scaﬀolds using BWA v0.7.4 (Li & Durbin, 2010)

with the BWA-MEM algorithm using default parameters. Population genome binning

using diﬀerential coverage (Albertsen et al., 2013) was performed using GroopM v0.1

(Imelfort et al., 2014) with the initial core formation based on contigs/scaﬀolds longer than

1,500 bp. Manual reﬁnement of population genome bins, and subsequent recruitment of

contigs/scaﬀolds longer than 500 bp was performed using the GroopM tools.

Identiﬁcation of conserved marker genes

All contigs/scaﬀolds in each genome bin were translated into six reading frames, and

hmmsearch in HMMER3 (Eddy, 2011) was used to identify 111 single copy marker

genes conserved in most bacteria (Dupont et al., 2012), 83 phylogenetically-informative

marker genes (Soo et al., 2014), and the 38 marker genes proposed by PhyloSift (Darling

et al., 2014). To determine the completeness and contamination of each genome bin, the

distribution and number of the 111 conserved single copy marker set was determined

using CheckM (Parks et al., 2014) with default settings.

Reﬁnement of population genome bins

Scaﬀolding of metagenome contigs using the mate-pair data was performed with SSPACE

v2.0 (Boetzer et al., 2011). SSPACE was run with the following two sets of parameters: lower

stringency for minor population genomes with relatively low coverage (e.g., UASB270),

-k 2 (minimum number of links to compute scaﬀold) -a 0.7 (maximum link ratio

between two best contig pairs) -x 0 (no extention of the contigs using paired reads)

-p 1 (making .dot ﬁle for visualization) and higher stringency for major population

genomes (e.g. UASB14): -k 4 -a 0.7 -x 0 -p 1. The resulting dot ﬁles were used for

visualizing contig connections using Cytoscape v.2.8.1 (Shannon et al., 2003). In addition,

Cytoscape attribute ﬁles were generated with coverage, length, and bin number (bin name)

information for each contig/scaﬀold. Based on the coverage information and number

of connections between contig/scaﬀolds, external contig/scaﬀolds are manually added

to each bin. In addition, contig/scaﬀolds with a small number of connections to other

contig/scaﬀolds in their respective bins were excluded. Reﬁned sets of contig/scaﬀolds were

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 4/24

then scaﬀolded with SSPACE with the following low stringency parameters: -k 2 -a 0.7 -x 0

-p 1. For further reﬁnement, shotgun mate-pair reads were mapped to the newly generated

scaﬀolds using CLC genomic workbench v6.0 (CLC Bio) using default parameters with

the exception of a similarity fraction of 0.98 and exported in SAM format. The assembly

was visualized using Circos (Krzywinski et al., 2009) and used for manual inspection of the

assembly as previously described (Albertsen et al., 2013). Manual correction of misassembly

and mis-scaﬀolding was performed using the microbial genome ﬁnishing module in CLC

genomic workbench v6.0 (CLC Bio).

Genome tree

Finished bacterial and archaeal genomes were downloaded from IMG (release

4.1) (Markowitz et al., 2014), from which the 38 universally (Darling et al., 2014) or

83 single-copy proteins broadly conserved in bacteria were identiﬁed using HMM

searches (Soo et al., 2014). To evaluate the robustness of the protein trees (genome trees),

four diﬀerent outgroup taxon conﬁgurations (two data sets for 38 marker genes, two data

sets for 83 marker genes) were made (Table S3). Homologous proteins obtained from the

KSB3 and reference genomes in each taxon conﬁguration were aligned using hmmalign

in HMMER3, and subsequently concatenated. A mask was generated for the concatenated

alignment using Gblocks (Talavera & Castresana, 2007) with only conserved positions

found in more than half of the sequences considered. All tree topologies were tested for

robustness using the maximum likelihood methods from FastTree v2.1.7 (with default

parameters, JTT model, CAT approximation) (Price, Dehal & Arkin, 2009) and RAxML

v7.7.8 (JTT and Gamma models with rapid 100 times bootstrapping) (Stamatakis, 2006).

The PHYLIP SEQBOOT module (Felsenstein, 1989) was used to generate 100 resampled

alignments and FastTree was used to analyze the resampled alignments (-n 100). A script

(CompareToBootstrap.pl) included in the FastTree package was used to compare the

original tree to the resampled trees and generate bootstrap values. Generated trees were

imported into ARB (Ludwig et al., 2004), where they were rooted, left-hand ladderized

using the “beautify” tool and grouped into phylum-level clades. A representative tree (tree

no. 1; Table S4) was exported from ARB and visualized using iTOL (Letunic & Bork, 2011).

16S rRNA gene phylogeny

KSB3 related 16S rRNA genes were manually curated using the Greengenes database

(version May 2013; McDonald et al., 2012) in ARB (Ludwig et al., 2004). 16S rRNA genes

from binned population genomes were aligned with PyNAST (Caporaso et al., 2010a),

imported into ARB, and the alignments were manually corrected using the ARB EDIT

tool. Sets of taxa (>1,300 nt) were selected in ARB and their alignments were exported

applying Lane mask ﬁltering. One set of taxa included representatives across all recognized

bacterial phyla to determine the relative position of KSB3 in the bacterial domain (Fig. S6).

A second set of taxa included all KSB3 sequences to determine the relative position of

the two UASB ﬁlament genomes within the KSB3 phylum (Fig. 1A). Neighbor joining

trees were calculated from the masked alignments with LogDet distance estimation

using PAUP* 4.0 (Swoﬀord, 2003) with 100 bootstrap resamplings. Maximum likelihood

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 5/24

trees were calculated based on the masked alignments using RAxML v7.7.8 (GTR and

Gamma models +I) with rapid 100 time bootstrapping. Maximum parsimony trees were

calculated using PAUP* 4.0. A heuristic search was used with a random stepwise addition

sequence of 10 replicates and nearest-neighbor-interchange swapping. Bootstrap analyses

on the maximum parsimony trees were run with 100 times resampling for each best tree.

Generated trees were re-imported into ARB for visualization.

Genome analysis

The two assembled KSB3 population genomes were initially annotated with PROKKA

v1.7 using default settings (Seemann, 2014), and manually curated by comparison to

UniRef90 (Suzek et al., 2007), IMG (Integrated Microbial Genomes, ﬁnished genomes,

release 4.0) (Markowitz et al., 2014), COGs (Clusters of Orthologous Groups) (Tatusov

et al., 2000), PFAM (Punta et al., 2012), and KEGG (Kyoto Encyclopedia of Genes and

Genomes) (Aoki-Kinoshita & Kanehisa, 2007) databases. Bi-directional best-blast matches

were performed for proteins with matches to UniRef90 and IMG using a bit score threshold

of 300, and one-way BLASTP matches with a bit score of 60 (Castelle et al., 2013).

For COGs, RPS-BLAST against COG PSSMs from the CDD database (Marchler-Bauer

et al., 2013) was performed using an e-value cutoﬀof 0.01, with the top hit retained

for each protein domain. The amino acid sequences were also searched for conserved

motifs with PFAM (Punta et al., 2012) using HMMR3 (Eddy, 2011) and PfamScan

with default settings (with family noise cutoﬀ). Protein domain structure of some gene

products were additionally evaluated using InterProScan search (Quevillon et al., 2005).

For manual annotation of the KSB3 genomes, we ranked the resulting annotations as

follows: bi-directional best-blast matches with UniRef90 and IMG data; one-way matches

with UniRef90, IMG, and COGs; PFAM matches; hypothetical proteins (Castelle et al.,

2013). For comparison of gene sets with other genomes, we downloaded the full IMG

database (release 4.1) containing all genomes in IMG and their annotations (e.g. PFAMs

and COGs). In addition, a list of all ﬁnished bacterial genomes and associated metadata

(e.g. taxonomic aﬃliation and genome size) was obtained though IMG. Ribosomal RNA

copy number was estimated by determining the ratio of average genome coverage to 16S

rRNA gene coverage for each KSB3 genome calculated using BWA read mapping. CRISPR

loci were identiﬁed using CRT v1.2 (Bland et al., 2007). Presence/absence of some gene

sets related to cell envelope structure (Albertsen et al., 2013), complex bacterial lifestyle,

and adaptability to ﬂuctuating environmental conditions (‘social IQ’) (Sirota-Madi et

al., 2010) were evaluated based on IMG annotation for ﬁnished genomes and annotated

KSB3 genomes, and the resulting abundance matrix was visualized using R and ggplot2.

Orthologous proteins between the two KSB3 genomes were identiﬁed using pairwise

bi-directional best hit BLASTP searches. Glycoside hydrolases were identiﬁed using

the CAZy database (Lombard et al., 2014) (dbCAN HMMs v3.0, Yin et al., 2012) with

HMMER3 (default settings). Transmembrane proteins were predicted using TMHMM

Server v. 2.0 (Moller, Croning & Apweiler, 2001).

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 6/24

Gram-staining and gliding motility

Gram-staining for KSB3 ﬁlaments was performed based on the method of

Hucker, (Doetsch, 1981). Gliding motility of KSB3 ﬁlaments was evaluated using fresh

sludge samples examined under an epiﬂuorescence microscope (Axioplan 2; Carl Zeiss)

equipped with an automatic thermo-control system (Thermo Plate, MATS-55SFG-FT;

Tokai Hit). Fresh sludge samples were placed on glass slides, and a cover glass carefully

positioned over the sample to minimize exposure to air. To maintain anaerobic conditions

on the slides, reducing agents (Na2S and/or L-cysteine) were added to the samples. The

temperature of the microscopic stage was maintained at 37 ◦C, and time-lapse images

were recorded with a cooled CCD camera (DP72; Olympus) equipped with the imaging

software (DP2-BSW, version 2.2; Olympus). Glucose, maltose, ribose, mannose, galactose,

arabinose, raﬃnose, sucrose, xylose, fructose, lactate, ethanol, propionate, nitrate, nitrite

(ﬁnal concentration of 5–10 mM) and yeast extract (0.1%) were used as candidate

stimuli to induce motility. Each potential stimulant was mixed with the cells prior to

the observation, or placed at the edge of the glass cover creating a gradient as the stimulant

diﬀused into the sample.

RESULTS

KSB3 populations in the UASB system

Two UASB sludge samples taken two years apart (A1 and A2), and ﬂocular (F1) and

granular (G1) fractions derived from sample A1, reported previously (Soo et al., 2014),

were used in the present study. The UASB system had a history of periodic bulking

caused by KSB3 ﬁlaments (Yamada et al., 2007;Yamada et al., 2011). Inspection of 16S

rRNA gene amplicon community proﬁles of these samples (Soo et al., 2014) revealed two

KSB3 16S rRNA phylotypes accounting for 4.9 and 3.7% of total sequencing reads from

samples A1 and A2, respectively. The dominant phylotype, representing ∼94% of the

KSB3 reads, was identical to the previously reported bulking phylotype (clone YM-1,

AB218870;Yamada et al., 2007), and the minor phylotype, representing ∼6% of the KSB3

reads, was identical to a low abundance clone detected in the UASB reactor during normal

operation (clone SmB78ﬂ, AB266927;Narihiro et al., 2009). The internal transcribed

spacer (ITS) region of the bulking phylotype was sequenced to conﬁrm that it was the same

strain present in the bulking and normally operating UASB sludge (Fig. S2). Previously

reported metagenomes (Soo et al., 2014) and additional shotgun sequencing of samples

A1, F1, G1 and A2 were used for recovering high quality draft population genomes of the

two KSB3 phylotypes (Table S1). Based on detection of 16S rRNA genes in the shotgun

paired-end read datasets, the KSB3 phylotypes comprised up to 10 and 11% of the A1 and

A2 metagenomes respectively with the dominant KSB3 phylotype having approximately

10 fold higher abundance than the minor phylotype (Fig. S3), broadly consistent with the

amplicon results.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 7/24

Recovery of KSB3 population genomes

The four metagenomes (59 Gb in total, Table S1) were co-assembled, generating 504,757

contigs/scaﬀolds (>500 bp) with a combined length of 906 Mb, an N50 of 3 kb and a

longest scaﬀold of 506 kb. Population genomes were recovered from the assembly by

exploiting variations in population abundance (coverage) between individual sample

metagenomes (diﬀerential coverage binning, Albertsen et al., 2013) using the automated

binning tool GroopM (Imelfort et al., 2014). The completeness and contamination of the

population genomes were estimated by detection of single copy marker gene sets widely

conserved in the domain Bacteria (Dupont et al., 2012). Thirty-nine bacterial population

genome bins were obtained with >65% completeness (>73/111 markers) and <10%

contamination (<11/111, marker genes with >1 copy in a population genome indicate

presumptive contamination with another organism). These genomes were reﬁned by

tracking mate-pair reads in network graphs to further improve the completeness and

reduce contamination of the bins, and to recruit repeat sequences, notably ribosomal

RNA operons, which can evade diﬀerential coverage binning if present in multiple copies

(Albertsen et al., 2013).

We identiﬁed 16S rRNA gene sequences in two reﬁned population genome bins

(UASB14 and UASB270, Fig. S4) that were identical to the amplicon sequences from

the dominant and minor KSB3 phylotypes, respectively (Table 1). Despite careful manual

curation of both genomes, their estimated completeness based on 111 conserved single

copy marker genes (Dupont et al., 2012) is only ∼93%, and both also have an inferred

∼6% contamination based on these markers. Inspection of the marker genes with no or

>1 hit, however, show a high degree of overlap between the genomes, suggesting that these

particular genes are either actually absent, duplicated, or laterally transferred based on

phylogenetic inference and gene neighborhood (Table S2). This may not be unexpected

given the phylogenetic novelty of the lineage. A revised estimate of completeness and

contamination based on a prediction that six of the 111 marker genes are absent and ﬁve

are duplicated is >98% and <2% respectively (Table 1). To estimate the number of rRNA

operons in each KSB3 genome, we compared average genomic coverage to 16S rRNA gene

coverage, which indicated that UASB14 and UASB270 have three and two rRNA operons

respectively (Fig. S5).

KSB3 phylogeny and morphology

The relative position of the two KSB3 genomes within the phylum was assessed by

comparative analysis of their 16S rRNA gene sequences with publicly available full-length

sequences. UASB14 and UASB270 represent two of several major lines of descent in the

KSB3 phylum (Fig. 1A). According to Greengenes classiﬁcation (McDonald et al., 2012),

UASB14 belongs to an unnamed class-level lineage and UASB270 is a member of class

MAT-CR-H3-D11 for which we propose the names Moduliﬂexia and Vecturitrichia,

respectively (Table 1;Supplemental Information 1). To conﬁrm the status of KSB3 as a

candidate phylum, as inferred by 16S rRNA comparative analyses (Yamada et al., 2007,

Fig. S6), we constructed phylogenetic trees based on a larger genomic sampling. Two sets

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 8/24

Table 1 Features of the Modulibacteria KSB3 population genomes.

Genome bin identiﬁer UASB14 UASB270

Candidatus name Moduliﬂexus ﬂocculans Vecturathrix granuli

Closest environmental 16S clone YM-1 (AB218870) SmB78ﬂ (AB266927)

No. of scaﬀolds 8 21

Total length (bp) 7,147,157 8,384,694

N50 1,183,318 597,372

GC (%) 50.6 47.2

Average coverage 278 38

Genome completenessa92.8% (103/111) 93.6% (104/111)

Revised genome completenessa98.1% (103/105) 99.0% (104/105)

Genome contaminationa5.4% (6/111) 6.3% (7/111)

Revised genome contaminationa0.0% (0/105) 1.9% (2/105)

Relative abundance in UASB metagenomes (%)b9.22 0.40

No. tRNA genes 54 43

rRNA genes found in genome 5S, 16S, 23S 5S, 16S, 23S

Inferred no. of rRNA operonsc3 2

No. CDS 5,989 7,048

No. CRISPR array 4 (125 repeats in total) 5 (550 repeats in total)

Coding density 84.6 % 84.3 %

Putative glycoside hydrolasesd

Cellulase 5 (0.1%) 14 (0.2%)

Amylase 19 (0.3%) 8 (0.1%)

Debranching enzyme 3 (0.1%) 2 (0.1%)

Amino-sugar-degrading enzyme 35 (0.6%) 45 (0.6%)

Oligosaccharide-degrading enzyme 43 (0.7%) 23 (0.3%)

Putative protease/peptidased

Protease 27 (0.5% of total ORFs) 28 (0.4% of total ORFs)

Peptidase 60 (1.0%) 78 (1.1%)

Putative environmental signaling system genes

Transmembrane sensore135 (2.3%) 114 (1.6%)

Response regulator containing CheY-like domainf131 (2.2%) 116 (1.6%)

Proposed class Moduliﬂexia Vecturatrichia

Proposed order Moduliﬂexales Vecturatrichales

Proposed family Moduliﬂexaceae Vecturatrichaceae

Notes.

aGenome completeness and contamination were estimated based on the presence/absence of a 111 single-copy gene set from Dupont et al. (2012). Revised genome

completeness and contamination were calculated based on a revised total of 105 single-copy genes estimated to be present in the KSB3 genomes (Table S2 ). Numbers in

parentheses indicats detected number of genes per total number of each gene set.

bRelative genome abundance for each KSB3 genome was determined based on 16S rRNA gene proﬁling using shotgun metagenome data (Table S1).

cNumber of rRNA operons in the KSB3 genomes were inferred based on relative coverage proﬁles of KSB3 16S rRNA genes and the genome averages (Fig. S2).

dCounts (% of total ORFs).

ePredicted number of transmembrane sensors based on the possession of a sensor domain (Galperin, 2004) and >1 transmembrane segments (Table S8).

fNumber of all two-domain response regulators containing CheY-like domains estimated from PSI-BLAST searches of domain-speciﬁc proﬁles against the protein set

described in Galperin (2004) (Table S8).

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 9/24

Figure 1 Phylogenetic structure of the Modulibacteria (KSB3) phylum based on comparative analysis of 16S rRNA gene sequences, and imaging

of KSB3 cells. (A) Maximum-likelihood phylogenetic tree (RAxML) of public data (accession numbers shown) and the 16S rRNA sequence

determined in this study for UASB14. Sequences from the bacterial phyla Nitrospirae, Tenericutes, and Chloroﬂexi were used to root the tree

(not shown). Reproducible interior nodes are indicated as a black circle (>90% bootstrap support for neighbor-joining [NJ], maximum parsimony

[MP], and maximum-likelihood [ML] inferences), open circle (>80% support); or open rectangle (>70% support). Nodes without symbols were

not reproducible between trees. The scale bar represents 5% estimated sequence divergence. Class-level clades are bracketed to the right of the ﬁgure

in black. The target ranges of KSB3-speciﬁc FISH (ﬂuorescence in situ hybridization) probes used in this study are indicated by colored brackets with

the colors corresponding to cell color in (B) and (D). (B) 16S rRNA-targeted FISH detection of UASB14 and UASB270 ﬁlaments in the UASB sludge.

The abundant UASB14 ﬁlaments are labeled green and the low abundance UASB270 ﬁlaments are labeled red. (C) Total KSB3 ﬁlament abundance

highlighted by a phylum-level FISH probe relative to (D) all cells present in the same ﬁeld (phase-contrast image). Bars in (B–D) represent 10 µm.

of marker genes broadly conserved in all domains of life (38 markers) (Darling et al., 2014)

or in Bacteria (83 markers) (Soo et al., 2014) were obtained from the KSB3 genomes and

up to 354 publicly available reference genomes (Markowitz et al., 2014). Each gene family

was independently aligned and ambiguous and/or non-informative positions removed,

and then the ﬁltered alignments were concatenated for maximum-likelihood inference.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 10/24

Four sets of outgroup conﬁgurations were used including representatives of all major

genomically sampled bacterial phyla (Table S3). The two KSB3 genomes form a robustly

monophyletic group in all analyses, and did not reproducibly aﬃliate with any other phyla

(Fig. 2,Fig. S7,Table S4), consistent with the original proposal that KSB3 is a candidate

bacterial phylum (Yamada et al., 2007). The average amino acid identity (AAI) between

UASB14 and UASB270 is 60.3% (Fig. S8), and supports their assignment to separate classes

as it falls within the range of known class-level AAI values (44–61%, Konstantinidis &

Tiedje, 2005).

To conﬁrm the ﬁlamentous morphology and relative abundance of the KSB3

phylotypes, we designed 16S rRNA-targeted ﬂuorescence in situ hybridization (FISH)

probes speciﬁc at the phylum and class level and combined them with previously applied

KSB3-speciﬁc probes (Yamada et al., 2007) (Table S5). We detected only ﬁlamentous KSB3

morphotypes in the UASB sludge and these comprised the majority of observed ﬁlaments

in sample F1 (Figs. 1C–1D). The relative abundance of the two KSB3 phylotypes inferred

from both amplicon and metagenome data was also consistent with FISH analyses; that

is, ﬁlaments belonging to the class Moduliﬂexia (presumably mostly UASB14) greatly

outnumbered those belonging to the Venturitrichia (presumably mostly UASB270)

(Fig. 1B), noting that the two ﬁlaments were indistinguishable by light microscopy alone

(Figs. 1C–1D).

General features of the KSB3 genomes

Both KSB3 genomes are large by bacterial standards, >7 Mb (Fig. S9) and have median

GC content, ∼50% (Table 1). Since UASB14 and UASB270 are not close relatives, large

genome size may be a characteristic feature of the KSB3 phylum or at least of the two

classes that they represent (Fig. 1A). A total of 5,989 and 7,048 open reading frames

(ORFs) were identiﬁed in the UASB14 and UASB270 genomes, respectively (Table 1).

For both genomes, approximately two thirds of the ORFs had a predicted function and

the remaining third were hypotheticals. Reciprocal BLASTP best matches between the

predicted gene products of the two KSB3 genomes indicate a shared set of 3,296 orthologs,

representing approximately half of the gene inventories in each genome. Included in this

common set are conserved genes for translation, nucleotide transport and metabolism,

and construction of a diderm (Gram negative) cell envelope including lipopolysaccharide

synthesis (Fig. S10). We identiﬁed a full complement of rRNA and tRNA genes in UASB14,

but not UASB270, which were likely missed in the latter genome (Table 1;Table S6).

The UASB14 rRNA genes are estimated to be present as three nearly identical operons

(Fig. S5) collapsed into a single large repeat during the assembly process. A number of

large clustered regularly interspaced short palindromic repeats (CRISPR) were identiﬁed in

both genomes (Table 1). CRISPR, together with associated cas genes, constitute a recently

described defense mechanism against invading foreign DNAs and have been found in a

majority of bacterial genera and most Archaea (Sorek, Kunin & Hugenholtz, 2008). A COG

category analysis of the KSB3 genomes indicates that both have high relative proportions

of carbohydrate metabolism and transport (G) and signal transduction (T) relative to the

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 11/24

Figure 2 Maximum-likelihood phylogenetic inference of Modulibacteria (KSB3) population genomes among known bacterial phyla. The tree

was constructed using RAxML based on up to 38 marker genes (using taxon-outgroup conﬁguration Conﬁg 3, Table S3) and sequences were

collapsed at the phylum level except for classes in the Proteobacteria. Ranks are indicated by preﬁx; p (phylum), c (class). KSB3 genomes obtained

in this study are highlighted in red. Superphyla (Terrabacteria, Patescibacteria, Fibrobacteres-Chlorobi-Bacteroidetes [FCB], and Planctomycetes-

Verrucomicrobia-Chlamydiae [PVC]) are highlighted with color ranges. Taxa comprising cultivated representatives are shown in black; taxa with no

cultivated representatives are indicated by outline. Reproducible associations (>80% bootstrap values from 100 resamplings) are indicated by dots

on interior nodes. Alignments of homologous proteins from archaeal genomes were used to root the tree (not shown). The scale bar represents 10%

estimated sequence divergence.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 12/24

bacterial average (Fig. 3). More detailed inferred metabolic properties of the two KSB3

genomes are described below.

Strictly fermentative metabolism

Both KSB3 representatives have an incomplete tri-carboxylic acid (TCA) cycle and

lack most electron-transport chain complexes including terminal oxidases, indicating

a strictly fermentative metabolism (Fig. 4). They encode, however, both superoxide

reductase and thioredoxin reductase, suggesting oxidative stress tolerance. Both genomes

have a large complement of transporters and enzymes for importing and degrading

complex and simple carbohydrates, which can then be fed into a complete glycolysis

(Embden-Meyerhof-Parnas) pathway (Fig. 4; COG category G in Fig. 3). Both ﬁlaments

also are likely capable of hydrolyzing polymers such as cellulose and starch via a range

of glycoside hydrolases (Table S7). They redundantly encode four diﬀerent enzymes for

converting pyruvate to acetyl–coenzyme A (acetyl-CoA), namely pyruvate dehydrogenase,

pyruvate-formate lyase, pyruvate ferredoxin oxidoreductase, and pyruvate-ﬂavodoxin

oxidoreductase. Both generate adenosine triphosphate (ATP) by converting acetyl-CoA to

acetate via two enzymes (acetate kinase and phosphate acetyltransferase) commonly found

in general fermentative anaerobes (Mai & Adams, 1996), in addition to glycolysis. They

may reoxidize NADH produced during glycolysis by converting pyruvate to D-lactate and

acetyl-CoA to ethanol (Fig. 4).

The KSB3 genomes possess a large complement of enzymes for conversion and

transport of amino acid and peptides (Fig. 4; COG category E in Fig. 3), including

numerous proteases and peptidases (Table 1). Peptide and amino acid degradation in

the KSB3 ﬁlaments may produce pyruvate, oxaloacetate, succinyl-CoA, and possibly

propionyl CoA (Fig. 4). Notably, a complete set of genes for the methylmalonyl CoA

pathway was identiﬁed in UASB270, suggesting a role in either amino acid degradation,

propionate oxidation and/or propionate formation as a fermentative end product in this

organism. Some fermentative anaerobes are known to produce hydrogen to scavenge

excess electrons generated during metabolism (Sieber, McInerney & Gunsalus, 2012).

In both KSB3 genomes, we identiﬁed several hydrogenase genes (Fig. 4). By examining

domain structure and gene neighbourhoods (Fig. S11), we predict that some of these genes

encode catalytic enzymes. This may permit them to engage in syntrophic interactions

with hydrogenotrophs, such as methanogens, in the sludge granules (Sieber, McInerney &

Gunsalus, 2012). However, based on FISH experiments highlighting KSB3 and archaeal

cells, we did not observe a close proximity between the two groups that would facilitate

syntrophy (data not shown). Some of the hydrogenase genes are located next to signal

transduction genes raising the possibility that they are involved in signal transduction and

chemotaxis (Fig. S11).

Sensory capabilities and motility

One of the most striking features of the KSB3 genomes is the presence of extensive

regulatory networks, including two-component signal transduction systems (Table 1;

Table S8). Signal transduction genes (COG category T) are among the highest represented

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 13/24

Figure 3 Relative representation of COG categories by predicted ORFs in the UASB14 and UASB270 genomes. Global averages and standard

deviation (bars) are shown for 2,279 publicly available ﬁnished bacterial and archaeal genomes (Markowitz et al., 2014). Statistically signiﬁcant

diﬀerences are indicated by percentile of scores for all the available ﬁnished bacterial and archaeal genomes.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 14/24

Figure 4 Composite metabolic overview of the Modulibacteria (KSB3) genomes based on identiﬁed genes and pathways. Gray indicates elements

common to both genomes, while orange and green show elements speciﬁc to UASB14 and UASB270, respectively. Both ﬁlament types have the

genes necessary to produce acetate, ethanol, lactate, and hydrogen (and possibly propionate) as fermentative end products, likely generating energy

through the glycolytic Embden-Meyerhof-Parnas (EMP) pathway and the fermentation of amino acids and sugars. Abbreviations: ETF, electron

transfer ﬂavoprotein; Fd-ox and Fd-red, oxidized and reduced ferredoxin, respectively; UQ, ubiquinone.

categories in both genomes (Fig. 3). Two-component systems respond to a broad range of

extracellular and intracellular signals, and play a role in many cellular processes including

growth, motility, and the cell cycle (Galperin, 2004;Skerker et al., 2005;Kirby, 2009).

UASB14 and UASB270 encode 135 and 114 putative transmembrane sensor proteins likely

used for environmental signaling (Galperin, 2004), and 131 and 116 putative response

regulators containing CheY-like domains, respectively (Table S8). They each contain over

60 methyl-accepting proteins and numerous Che-like chemotaxis proteins (Table S8).

Even when compensating for their relatively large genome sizes, both KSB3 genomes

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 15/24

Figure 5 Number of protein domains inferred to be involved in environmental signaling for the two

Modulibacteria (KSB3) genomes and ﬁnished bacterial and archaeal genomes. Number of protein

domains inferred to be involved in environmental signaling (Table S8) as a function of genome size

for the two Modulibacteria (KSB3) genomes (in red) and 2,279 publicly available ﬁnished bacterial and

archaeal genomes (in blue). The KSB3 ﬁlaments have among the highest proportion of signaling domains,

only surpassed by members of the Myxobacteria (open blue circles), which are capable of fruiting body

formation by contact-mediated signaling.

possess high proportions of environmental sensory networks compared to other sequenced

bacterial and archaeal genomes (Fig. 5;Figs. S12 and S13;Table S9). The high represen-

tation of sensory components in the KSB3 genomes is on par with social Myxococcales

such as Sorangium cellulosum and Stigmatella aurantiaca, both of which exhibit complex,

self-organizing behavior in response to environmental stimuli (Huntley et al., 2011).

Sensory capabilities are an important component of a bacterium’s overall social

“intelligence” or social IQ (Ben-Jacob et al., 2004), a metric recently proposed based on

the abundance of two-component systems, transcription factors, defense mechanisms and

transport systems (Sirota-Madi et al., 2010). We determined that the KSB3 ﬁlaments have

among the highest social IQ scores of any sequenced bacterial and archaeal species to date,

scoring particularly well in the two-component and transport system categories (Fig. S14).

This suggests that the ﬁlaments are sensitive to their surroundings and capable of adaptable

behavior in response to changes in their local environment. Key to this adaptability is

motility. No genes for ﬂagella production were identiﬁed in either KSB3 genome, so to

determine if KSB3 bacteria are indeed motile, we observed ﬁlaments enriched from UASB

granules by wet mount microscopy under a range of conditions. KSB3-speciﬁc FISH of

samples taken in parallel conﬁrmed that the majority of ﬁlaments in these samples were

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 16/24

members of the KSB3 phylum (Fig. 1). Initially no motility was observed, therefore based

on the metabolic reconstruction of the KSB3 genomes, we added a range of compounds

(mostly simple sugars, see ‘Methods’ section) to the edge of the microscope slides to create

a gradient that could be sensed by the ﬁlaments to stimulate a motility response. We

observed gliding motility at rates of between 20 to 30 µm/min only when a glucose or

maltose gradient was applied under conditions mimicking the UASB reactor operation

(Movie S1). Both KSB3 genomes encode a number of the genes necessary for type IV

pili formation (pilB, pilC, pilG, pilT, pilV, and ﬂp pilus assembly protein) that may

enable gliding via extension and retraction (Jarrell & McBride, 2008). However, the full

gene complement for pili formation (Mauriello et al., 2010) was not detected and the

mechanism for KSB3 gliding motility remains to be determined.

DISCUSSION

Despite the biotechnological signiﬁcance of industrial-scale anaerobic digestion, our

understanding of the microbial ecology that underpins these processes is still rudimentary

because most microorganisms cannot be cultured and such systems are essentially man-

aged as “black boxes” (Ahring, 2003b;Rivi`

ere et al., 2009). Emerging culture-independent

molecular techniques such as diﬀerential coverage binning of metagenomic data, which

allows even low abundance population genomes to be recovered (Sharon et al., 2013;

Albertsen et al., 2013), are providing new opportunities to understand and optimize system

performance (Vanwonterghem et al., 2014).

Using this approach, we obtained the ﬁrst population genomes representing candidate

bacterial phylum KSB3 (Tanner et al., 2000;Yamada et al., 2007). One of these genomes,

UASB14, belongs to a high abundance ﬁlament (∼10% of the community; Table 1;Fig. 1)

previously reported to be responsible for bulking in an industrial UASB system treating

wastewater from sugar manufacture (Yamada et al., 2007). A second genome from the

same habitat, UASB270, represents a low abundance (<0.5%) ﬁlament only moderately

related to the ﬁrst, i.e., they represent diﬀerent classes within the KSB3 phylum (Fig. 1).

Metabolic reconstruction indicates that both ﬁlaments are primary fermenters of sugar

and amino acid-containing compounds in the system (Fig. 4), and both have a high “social

IQ” based in part on possession of extensive regulatory networks (Table 1;Tables S8 and

S9;Fig. S14). These ﬁndings support the hypothesis that KSB3 ﬁlaments are important

primary fermenters in healthy sludge granules (Yamada et al., 2011) and further suggest

that the ﬁlaments are sensitive to their surroundings and that their cellular processes,

such as growth, may be controlled by external signals. Whether these features can be

extrapolated to the whole KSB3 phylum, or simply reﬂect the specialized habitat from

which the genomes were obtained, remains to be determined. Environmental surveys

suggest that the phylum has a shallow ecological footprint, having been identiﬁed in mostly

anoxic saline habitats (Fig. 1A), which may indicate that a fermentative metabolism is

universal.

The inferred capacity of the ﬁlaments to detect physicochemical gradients in their

surroundings suggests that they should be motile. Apart from an incomplete gene

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 17/24

complement for Type IV pili, no motility mechanism could be identiﬁed. However,

microscopic observations indicated that the KSB3 ﬁlaments are capable of gliding motility

in response to applied sugar gradients (Movie S1). Gliding motility is thought to have

evolved independently in multiple bacterial lineages, and the molecular mechanisms

of gliding are only partially elucidated for a limited number of bacterial taxa (Jarrell

& McBride, 2008;Mignot & Kirby, 2008). This is the ﬁrst report of gliding motility

of organisms in UASB sludge granules, which have long been considered to have an

organization driven by growth and attachment rather than motility of cells (Liu et al.,

2003;HulshoﬀPol et al., 2004). An enhanced sensory system is also likely the key driver of

the bulking phenomenon; that is, changes in the UASB reactor such as increases in glucose

or maltose concentration trigger outgrowth of the KSB3 ﬁlaments (Yamada et al., 2011).

It may also explain why repeated attempts to cultivate KSB3 ﬁlaments have failed to date

(Yamada et al., 2011), because they require speciﬁc and possibly complex environmental

cues to stimulate growth in axenic culture.

The inference that the KSB3 ﬁlaments sense sugars and the observation of a gliding

motility response in the presence of a glucose or maltose gradient is consistent with the

previous observation of uptake of these sugars by KSB3 ﬁlaments (Yamada et al., 2011).

Plant operators began monitoring glucose concentration in the UASB reactor inﬂuent

using a simple urine test strip. No further bulking has occurred to date since keeping

inﬂuent glucose concentration uniformly low (<200 mg/L) via adjustment of retention

times in the acidiﬁcation pretreatment. A more detailed understanding of environmental

stimuli responsible for growth and bulking will be facilitated by the availability of the KSB3

genome sequences which may lead to genome-directed cultivation (Tyson et al., 2005) and

other treatment options for bulking.

We propose the names ‘Candidatus Moduliﬂexus ﬂocculans’ and ‘Candidatus

Vecturithrix granuli’ for the two KSB3 ﬁlament types represented by the UASB14 and

UASB270 genomes respectively, and the phylum name, Modulibacteria, and intermediate

rank names (Table 1;Supplemental Information 1).

CONCLUSIONS

In summary, this study adds novel genomic ‘foliage’ to the tree of life by reporting the near

complete genomes of two phylogenetically diverse members of candidate bacterial phylum

KSB3 obtained from an industrial UASB system. Genome-based metabolic reconstruction

and experimental observations provide clues to the roles of the KSB3 bacteria in the

treatment system including their ability to ferment sugars and chemotactically respond

to glucose and maltose gradients, laying the foundations for a detailed understanding of

their ecophysiology and role in wastewater bulking.

ACKNOWLEDGEMENTS

We thank Jason Steen, Ben Woodcroft, Mohamed F. Haroon, and Michael Imelfort from

the University of Queensland for assistance and advice on the bioinformatic analyses and

Taeko Yokoi from AIST for assistance with ITS and FISH experiments. We also thank

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 18/24

Satoshi Hanada from AIST and Bernhard Schink from the University of Konstanz for

etymological advice.

ADDITIONAL INFORMATION AND DECLARATIONS

Funding

The study was supported by the Biomedical Research Institute of the National Institute of

Advanced Industrial Science and Technology (AIST). Philip Hugenholtz and Donovan H.

Parks were supported by an Australian Research Council (ARC) Discovery Outstanding

Researcher Award (DORA) grant DP120103498. Gene W. Tyson was supported by an ARC

Queen Elizabeth II Fellowship, grant DP1093175. Donovan H. Parks was also supported

by the Natural Sciences and Engineering Research Council of Canada. The funders had no

role in study design, data collection and analysis, decision to publish, or preparation of the

manuscript.

Grant Disclosures

The following grant information was disclosed by the authors:

National Institute of Advanced Industrial Science and Technology.

Australian Research Council.

Discovery Outstanding Researcher Award: DP120103498.

ARC Queen Elizabeth II Fellowship: DP1093175.

Natural Sciences and Engineering Research Council of Canada.

Competing Interests

Yuji Sekiguchi and Akiko Ohashi are employees of the National Institute of Advanced

Industrial Science and Technology (AIST). Toshihiro Yamauchi is an employee of Kubota

Kasui Corporation.

Author Contributions

•Yuji Sekiguchi conceived and designed the experiments, performed the experiments,

analyzed the data, contributed reagents/materials/analysis tools, wrote the paper,

prepared ﬁgures and/or tables, reviewed drafts of the paper.

•Akiko Ohashi and Toshihiro Yamauchi performed the experiments, contributed

reagents/materials/analysis tools, reviewed drafts of the paper.

•Donovan H. Parks and Gene W. Tyson analyzed the data, contributed

reagents/materials/analysis tools, reviewed drafts of the paper.

•Philip Hugenholtz analyzed the data, contributed reagents/materials/analysis tools,

wrote the paper, prepared ﬁgures and/or tables, reviewed drafts of the paper.

DNA Deposition

The following information was supplied regarding the deposition of DNA sequences:

The genome sequences of ‘Candidatus Moduliﬂexus ﬂocculans’ and ‘Candidatus

Vecturithrix granuli’ have been deposited in DDBJ/EMBL/GenBank under the accession

numbers DF820455–DF820462 and DF820463–DF820483. The ITS sequences of the

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 19/24

two genotypes have also been deposited in the databases under the accession numbers

AB933567 and AB933568.

New Species Registration

The following information was supplied regarding the registration of a newly described

species:

Not applicable.

Supplemental Information

Supplemental information for this article can be found online at http://dx.doi.org/

10.7717/peerj.740#supplemental-information.

REFERENCES

Ahring BK. 2003a. Biomethanation I,Advances in biochemical engineering/biotechnology. Berlin:

Springer.

Ahring BK. 2003b. Perspectives for anaerobic digestion. In: Biomethanation I,Advances in

biochemical engineering/biotechnology. Berlin: Springer, 1–30.

Albertsen M, Hugenholtz P, Skarshewski A, Nielsen KL, Tyson GW, Nielsen PH. 2013. Genome

sequences of rare, uncultured bacteria obtained by diﬀerential coverage binning of multiple

metagenomes. Nature Biotechnology 31:533–538 DOI 10.1038/nbt.2579.

Angelidaki I, Karakashev D, Batstone DJ, Plugge CM, Stams AJM. 2011. Biomethanation and its

potential. Methods in Enzymology 494:327–351 DOI 10.1016/B978-0-12-385112-3.00016-0.

Aoki-Kinoshita KF, Kanehisa M. 2007. Gene annotation and pathway mapping in KEGG. Methods

in Molecular Biology 396:71–91.

Ben-Jacob E, Becker I, Shapira Y, Levine H. 2004. Bacterial linguistic communication and social

intelligence. Trends in Microbiology 12:366–372 DOI 10.1016/j.tim.2004.06.006.

Bland C, Ramsey TL, Sabree F, Lowe M, Brown K, Kyrpides NC, Hugenholtz P. 2007. CRISPR

Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced

palindromic repeats. BMC Bioinformatics 8:209 DOI 10.1186/1471-2105-8-209.

Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. 2011. Scaﬀolding pre-assembled

contigs using SSPACE. Bioinformatics 27:578–579 DOI 10.1093/bioinformatics/btq683.

Caporaso JG, Bittinger K, Bushman FD, Desantis TZ, Andersen GL, Knight R. 2010a. PyNAST:

a ﬂexible tool for aligning sequences to a template alignment. Bioinformatics 26:266–267

DOI 10.1093/bioinformatics/btp636.

Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, Fierer N,

Pe˜

na AG, Goodrich JK, Gordon JI, Huttley GA, Kelley ST, Knights D, Koenig JE, Ley RE,

Lozupone CA, McDonald D, Muegge BD, Pirrung M, Reeder J, Sevinsky JR, Turnbaugh PJ,

Walters WA, Widmann J, Yatsunenko T, Zaneveld J, Knight R. 2010b. QIIME allows

analysis of high-throughput community sequencing data. Nature Methods 7:335–336

DOI 10.1038/nmeth.f.303.

Castelle CJ, Hug LA, Wrighton KC, Thomas BC, Williams KH, Wu D, Tringe SG, Singer SW,

Eisen JA, Banﬁeld JF. 2013. Extraordinary phylogenetic diversity and metabolic versatility in

aquifer sediment. Nature Communications 4:Article 2120 DOI 10.1038/ncomms3120.

Darling AE, Jospin G, Lowe E, Matsen IV FA, Bik HM, Eisen JA. 2014. PhyloSift: phylogenetic

analysis of genomes and metagenomes. PeerJ 2:e243 DOI 10.7717/peerj.243.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 20/24

Doetsch RN. 1981. Determinative methods of light microscopy. In: Gerhardt P, ed. Manual of

methods for general bacteriology. Washington, D.C.: American Society for Microbiology, 21–33.

Dupont CL, Rusch DB, Yooseph S, Lombardo M-J, Richter RA, Valas R, Novotny M,

Yee-Greenbaum J, Selengut JD, Haft DH, Halpern AL, Lasken RS, Nealson K, Friedman R,

Venter JC. 2012. Genomic insights to SAR86, an abundant and uncultivated marine bacterial

lineage. The ISME Journal 6:1186–1199 DOI 10.1038/ismej.2011.189.

Eddy SR. 2011. Accelerated proﬁle HMM searches. PLoS Computational Biology 7:e1002195

DOI 10.1371/journal.pcbi.1002195.

Felsenstein J. 1989. PHYLIP—Phylogeny inference package (version 3.2). Cladistics 5:164–166.

Galperin MY. 2004. Bacterial signal transduction network in a genomic perspective. Environmental

Microbiology 6:552–567 DOI 10.1111/j.1462-2920.2004.00633.x.

HulshoﬀPol LW, de Castro Lopes SI, Lettinga G, Lens PNL. 2004. Anaerobic sludge granulation.

Water Research 38:1376–1389 DOI 10.1016/j.watres.2003.12.002.

Huntley S, Hamann N, Wegener-Feldbr¨

ugge S, Treuner-Lange A, Kube M, Reinhardt R,

Klages S, M¨

uller R, Ronning CM, Nierman WC, Søgaard-Andersen L. 2011. Comparative

genomic analysis of fruiting body formation in Myxococcales. Molecular Biology and Evolution

28:1083–1097 DOI 10.1093/molbev/msq292.

Imelfort M, Parks DH, Woodcroft BJ, Dennis P, Hugenholtz P, Tyson GW. 2014. GroopM: an

automated tool for the recovery of population genomes from related metagenomes. PeerJ

2:e603 DOI 10.7717/peerj.603.

Jarrell KF, McBride MJ. 2008. The surprisingly diverse ways that prokaryotes move. Nature

Reviews Microbiology 6:466–476 DOI 10.1038/nrmicro1900.

Kirby JR. 2009. Chemotaxis-like regulatory systems: unique roles in diverse bacteria. Annual

Reviews in Microbiology 63:45–59 DOI 10.1146/annurev.micro.091208.073221.

Kleerebezem R, Macarie H. 2003. Treating industrial wastewater: anaerobic digestion comes of

age. Chemical Engineering 110:56–64.

Konstantinidis KT, Tiedje JM. 2005. Towards a genome-based taxonomy for prokaryotes. Journal

of Bacteriology 187:6258–6264 DOI 10.1128/JB.187.18.6258-6264.2005.

Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. 2009.

Circos: an information aesthetic for comparative genomics. Genome Research 19:1639–1645

DOI 10.1101/gr.092759.109.

Letunic I, Bork P. 2011. Interactive Tree Of Life v2: online annotation and display of phylogenetic

trees made easy. Nucleic Acids Research 39:W475–W478 DOI 10.1093/nar/gkr201.

Li H, Durbin R. 2010. Fast and accurate long-read alignment with Burrows–Wheeler transform.

Bioinformatics 26:589–595 DOI 10.1093/bioinformatics/btp698.

Li J, Hu B, Zheng P, Qaisar M, Mei L. 2008. Filamentous granular sludge bulking in a laboratory

scale UASB reactor. Bioresource Technology 99:3431–3438 DOI 10.1016/j.biortech.2007.08.005.

Liu Y, Xu H-L, Yang S-F, Tay J-H. 2003. Mechanisms and models for anaerobic granulation in

upﬂow anaerobic sludge blanket reactor. Water Research 37:661–673

DOI 10.1016/S0043-1354(02)00351-2.

Lombard V, Ramulu HG, Drula E, Coutinho PM, Henrissat B. 2014. The carbohydrate-

active enzymes database (CAZy) in 2013. Nucleic Acids Research 42:D490–D495

DOI 10.1093/nar/gkt1178.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 21/24

Ludwig W, Strunk O, Westram R, Richter L, Meier H, Yadhukumar, Buchner A, Lai T, Steppi S,

Jobb G, F¨

orster W, Brettske I, Gerber S, Ginhart AW, Gross O, Grumann S, Hermann S,

Jost R, K¨

onig A, Liss T, L¨

ussmann R, May M, NonhoﬀB, Reichel B, Strehlow R, Stamatakis A,

Stuckmann N, Vilbig A, Lenke M, Ludwig T, Bode A, Schleifer K-H. 2004. ARB: a software

environment for sequence data. Nucleic Acids Research 32:1363–1371 DOI 10.1093/nar/gkh293.

Mai X, Adams MW. 1996. Puriﬁcation and characterization of two reversible and ADP-dependent

acetyl coenzyme. A synthetases from the hyperthermophilic archaeon Pyrococcus furiosus.

Journal of Bacteriology 178:5897–5903.

Marchler-Bauer A, Zheng C, Chitsaz F, Derbyshire MK, Geer LY, Geer RC, Gonzales NR,

Gwadz M, Hurwitz DI, Lanczycki CJ, Lu F, Lu S, Marchler GH, Song JS, Thanki N,

Yamashita RA, Zhang D, Bryant SH. 2013. CDD: conserved domains and protein

three-dimensional structure. Nucleic Acids Research 41:D348–D352 DOI 10.1093/nar/gks1243.

Markowitz VM, Chen I-MA, Chu K, Szeto E, Palaniappan K, Pillay M, Ratner A, Huang J,

Pagani I, Tringe S, Huntemann M, Billis K, Varghese N, Tennessen K, Mavromatis K, Pati A,

Ivanova NN, Kyrpides NC. 2014. IMG/M 4 version of the integrated metagenome comparative

analysis system. Nucleic Acids Research 42:D568–D573 DOI 10.1093/nar/gkt919.

Mauriello EMF, Mignot T, Yang Z, Zusman DR. 2010. Gliding motility revisited: how do the

myxobacteria move without ﬂagella? Microbiology and Molecular Biology Reviews 74:229–249

DOI 10.1128/MMBR.00043-09.

McDonald D, Price MN, Goodrich J, Nawrocki EP, Desantis TZ, Probst A, Andersen GL,

Knight R, Hugenholtz P. 2012. An improved Greengenes taxonomy with explicit ranks for

ecological and evolutionary analyses of bacteria and archaea. The ISME Journal 6:610–618

DOI 10.1038/ismej.2011.139.

Mignot T, Kirby JR. 2008. Genetic circuitry controlling motility behaviors of Myxococcus xanthus.

BioEssays 30:733–743 DOI 10.1002/bies.20790.

Moller S, Croning M, Apweiler R. 2001. Evaluation of methods for the prediction of membrane

spanning regions. Bioinformatics 17:646–653 DOI 10.1093/bioinformatics/17.7.646.

Narihiro T, Terada T, Kikuchi K, Iguchi A, Ikeda M, Yamauchi T, Shiraishi K, Kamagata Y,

Nakamura K, Sekiguchi Y. 2009. Comparative analysis of bacterial and archaeal communities

in methanogenic sludge granules from upﬂow anaerobic sludge blanket reactors treating various

food-processing, high-strength organic wastewaters. Microbes and Environments 24:88–96

DOI 10.1264/jsme2.ME08561.

Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2014. CheckM: assessing the

quality of microbial genomes recovered from isolates, single cells, and metagenomes. PeerJ

PrePrints 2:e554v1 DOI 10.7287/peerj.preprints.554v1.

Price MN, Dehal PS, Arkin AP. 2009. FastTree: computing large minimum evolution trees

with proﬁles instead of a distance matrix. Molecular Biology and Evolution 26:1641–1650

DOI 10.1093/molbev/msp077.

Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K,

Ceric G, Clements J, Heger A, Holm L, Sonnhammer ELL, Eddy SR, Bateman A, Finn RD.

2012. The Pfam protein families database. Nucleic Acids Research 40:D290–D301

DOI 10.1093/nar/gkr1065.

Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. 2005.

InterProScan: protein domains identiﬁer. Nucleic Acids Research 33:W116–W120

DOI 10.1093/nar/gki442.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 22/24

Rivi`

ere D, Desvignes V, Pelletier E, Chaussonnerie S, Guermazi S, Weissenbach J, Li T,

Camacho P, Sghir A. 2009. Towards the deﬁnition of a core of microorganisms involved in

anaerobic digestion of sludge. The ISME Journal 3:700–714 DOI 10.1038/ismej.2009.2.

Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069

DOI 10.1093/bioinformatics/btu153.

Sekiguchi Y, Kamagata Y, Nakamura K, Ohashi A, Harada H. 1999. Fluorescence in situ

hybridization using 16S rRNA-targeted oligonucleotides reveals localization of methanogens

and selected uncultured bacteria in mesophilic and thermophilic sludge granules. Applied and

Environmental Microbiology 65:1280–1288.

Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B,

Ideker T. 2003. Cytoscape: a software environment for integrated models of biomolecular

interaction networks. Genome Research 13:2498–2504 DOI 10.1101/gr.1239303.

Sharon I, Morowitz MJ, Thomas BC, Costello EK, Relman DA, Banﬁeld JF. 2013. Time series

community genomics analysis reveals rapid shifts in bacterial species, strains, and phage during

infant gut colonization. Genome Research 23:111–120 DOI 10.1101/gr.142315.112.

Sieber JR, McInerney MJ, Gunsalus RP. 2012. Genomic insights into syntrophy: the

paradigm for anaerobic metabolic cooperation. Annual Review of Microbiology 66:429–452

DOI 10.1146/annurev-micro-090110-102844.

Sirota-Madi A, Olender T, Helman Y, Ingham C, Brainis I, Roth D, Hagi E, Brodsky L,

Leshkowitz D, Galatenko V, Nikolaev V, Mugasimangalam RC, Bransburg-Zabary S,

Gutnick DL, Lancet D, Ben-Jacob E. 2010. Genome sequence of the pattern forming

Paenibacillus vortex bacterium reveals potential for thriving in complex environments. BMC

Genomics 11:710 DOI 10.1186/1471-2164-11-710.

Skerker JM, Prasol MS, Perchuk BS, Biondi EG, Laub MT. 2005. Two-component signal

transduction pathways regulating growth and cell cycle progression in a bacterium: a

system-level analysis. PLoS Biology 3:e334 DOI 10.1371/journal.pbio.0030334.

Soo RM, Skennerton CT, Sekiguchi Y, Imelfort M, Paech SJ, Dennis PG, Steen JA, Parks DH,

Tyson GW, Hugenholtz P. 2014. An expanded genomic representation of the phylum

cyanobacteria. Genome Biology and Evolution 6:1031–1045 DOI 10.1093/gbe/evu073.

Sorek R, Kunin V, Hugenholtz P. 2008. CRISPR |a widespread system that provides acquired

resistance against phages in bacteria and archaea. Nature Reviews Microbiology 6:181–186

DOI 10.1038/nrmicro1793.

Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic

analyses with thousands of taxa and mixed models. Bioinformatics 22:2688–2690

DOI 10.1093/bioinformatics/btl446.

Suzek BE, Huang H, McGarvey P, Mazumder R, Wu CH. 2007. UniRef: comprehensive

and non-redundant UniProt reference clusters. Bioinformatics 23:1282–1288

DOI 10.1093/bioinformatics/btm098.

Swoﬀord DL. 2003. PAUP*: phylogenetic analysis using parsimony. version 4.0b10. Available at http:

//paup.csit.fsu.edu/index.html.

Talavera G, Castresana J. 2007. Improvement of phylogenies after removing divergent and

ambiguously aligned blocks from protein sequence alignments. Systematic Biology 56:564–577

DOI 10.1080/10635150701472164.

Tanner MA, Everett CL, Coleman WJ, Yang MM. 2000. Complex microbial communities

inhabiting sulﬁde-rich black mud from marine coastal environments. Biotechnology et alia

8:1–16.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 23/24

Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for

genome-scale analysis of protein functions and evolution. Nucleic Acids Research 28:33–36

DOI 10.1093/nar/28.1.33.

Tyson GW, Lo I, Baker BJ, Allen EE, Hugenholtz P, Banﬁeld JF. 2005. Genome-directed

isolation of the key nitrogen ﬁxer Leptospirillum ferrodiazotrophum sp. nov. from an

acidophilic microbial community. Applied and Environmental Microbiology 71:6319–6324

DOI 10.1128/AEM.71.10.6319-6324.2005.

Van Lier JB. 2008. High-rate anaerobic wastewater treatment: diversifying from end-of-the-pipe

treatment to resource-oriented conversion techniques. Water Science and Technology

57:1137–1148 DOI 10.2166/wst.2008.040.

Vanwonterghem I, Jensen PD, Ho DP, Batstone DJ, Tyson GW. 2014. Linking microbial

community structure, interactions and function in anaerobic digesters using new molecular

techniques. Current Opinion in Biotechnology 27:55–64 DOI 10.1016/j.copbio.2013.11.004.

Yamada T, Kikuchi K, Yamauchi T, Shiraishi K, Ito T, Okabe S, Hiraishi A, Ohashi A, Harada H,

Kamagata Y, Nakamura K, Sekiguchi Y. 2011. Ecophysiology of uncultured ﬁlamentous

anaerobes belonging to the phylum KSB3 that cause bulking in methanogenic granular sludge.

Applied and Environmental Microbiology 77:2081–2087 DOI 10.1128/AEM.02475-10.

Yamada T, Sekiguchi Y. 2009. Cultivation of uncultured Chloroﬂexi subphyla: signiﬁcance and

ecophysiology of formerly uncultured Chloroﬂexi “subphylum I” with natural and biotechno-

logical relevance. Microbes and Environments 24:205–216 DOI 10.1264/jsme2.ME09151S.

Yamada T, Yamauchi T, Shiraishi K, Hugenholtz P, Ohashi A, Harada H, Kamagata Y,

Nakamura K, Sekiguchi Y. 2007. Characterization of ﬁlamentous bacteria, belonging to

candidate phylum KSB3, that are associated with bulking in methanogenic granular sludges.

The ISME Journal 1:246–255 DOI 10.1038/ismej.2007.28.

Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. 2012. dbCAN: a web resource for automated

carbohydrate-active enzyme annotation. Nucleic Acids Research 40:W445–W451

DOI 10.1093/nar/gks479.

Sekiguchi et al. (2015), PeerJ, DOI 10.7717/peerj.740 24/24

Valid publication of names of two domains and seven kingdoms of prokaryotes

Article

Full-text available

Jan 2024
INT J SYST EVOL MICR

The International Code of Nomenclature of Prokaryotes (ICNP) now includes the categories domain and kingdom. For the purpose of the valid publication of their names under the ICNP, we consider here the two known domains, ‘ Bacteria ’ and ‘ Archaea ’, as well as a number of taxa suitable for the rank of kingdom, based on previous phylogenetic and taxonomic studies. It is proposed to subdivide the domain Bacteria into the kingdoms Bacillati , Fusobacteriati , Pseudomonadati and Thermotogati . This arrangement reflects contemporary phylogenetic hypotheses as well as previous taxonomic proposals based on cell wall structure, including ‘diderms’ vs. ‘monoderms’, Gracilicutes vs. Firmicutes , ‘ Negibacteria ’ vs. ‘ Unibacteria ’, ‘ Hydrobacteria ’ vs. ‘ Terrabacteria ’, and ‘ Hydrobacterida ’ vs. ‘ Terrabacterida ’. The domain Archaea is proposed to include the kingdoms Methanobacteriati , Nanobdellati and Thermoproteati , reflecting the previous division into ‘ Euryarchaeota ’, ‘DPANN superphylum’ and ‘TACK superphylum’.

The double-edged effects of La2O3 and CeO2 on the performance of high-rate anaerobic sludge bed reactors and microbial mechanisms

Article

Full-text available

Feb 2024

Industrial wastewaters, generated from such as abrasive and catalyst-producing processes, usually contain rare earth oxides: La 2 O 3 and CeO 2. When employing anaerobic methanogenesis technology to treat these industrial wastewaters, the impacts of La 2 O 3 and CeO 2 cannot be ignored. This study thoroughly investigated the long-term effects of La 2 O 3 or CeO 2 in high-rate anaerobic expanded sludge bed (EGSB) reactors. When compared to the control reactor, it was found that the addition of La 2 O 3 led to an 11% decrease in chemical oxygen demand (COD) removal efficiency and 38% decrease in methane production rate (MPR). Similarly, the introduction of CeO 2 resulted in a 9% in COD removal efficiency and a 17% decrease in MPR. Both La 2 O 3 and CeO 2 had positive influences on enhancing methanogenic activity of anaerobic granular sludge (AnGS). Through the results of quantitative polymerase chain reaction (qPCR) and high-throughput sequencing analysis, it was revealed that La 2 O 3 and CeO 2 contributed to a balanced microbial community by promoting the growth of methanogens and suppressing anaerobic propionate and butyrate-producing bacteria. Meanwhile, La 2 O 3 and CeO 2 stimulated the proliferation of electroactive species: Geobacter and Methanospirillum. However, the results of quantification and characterization of Extracellular Polymeric Substances (EPS) of AnGS showed both La 2 O 3 and CeO 2 reduced the presence of tryptophan substances in EPS and decreased the protein/polysaccharide ratio. This led to the disintegration of AnGS into smaller zoogloea complexes. The findings of this study offer valuable insights for early-warning and optimization of anaerobic methanogenesis treatment in dealing with industrial wastewater containing La 2 O 3 and CeO 2 .

A large-scale genomically predicted protein mass database enables rapid and broad-spectrum identification of bacterial and archaeal isolates by mass spectrometry

Article

Full-text available

Dec 2023
GENOME BIOL

MALDI-TOF MS-based microbial identification relies on reference spectral libraries, which limits the screening of diverse isolates, including uncultured lineages. We present a new strategy for broad-spectrum identification of bacterial and archaeal isolates by MALDI-TOF MS using a large-scale database of protein masses predicted from nearly 200,000 publicly available genomes. We verify the ability of the database to identify microorganisms at the species level and below, achieving correct identification for > 90% of measured spectra. We further demonstrate its utility by identifying uncultured strains from mouse feces with metagenomics, allowing the identification of new strains by customizing the database with metagenome-assembled genomes. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-023-03096-4.

Microbial Motility at the Bottom of North America: Digital Holographic Microscopy and Genomic Motility Signatures in Badwater Spring, Death Valley National Park

Article

Full-text available

Jan 2023
ASTROBIOLOGY

Motility is widely distributed across the tree of life and can be recognized by microscopy regardless of phylogenetic affiliation, biochemical composition, or mechanism. Microscopy has thus been proposed as a potential tool for detection of biosignatures for extraterrestrial life; however, traditional light microscopy is poorly suited for this purpose, as it requires sample preparation, involves fragile moving parts, and has a limited volume of view. In this study, we deployed a field-portable digital holographic microscope (DHM) to explore microbial motility in Badwater Spring, a saline spring in Death Valley National Park, and complemented DHM imaging with 16S rRNA gene amplicon sequencing and shotgun metagenomics. The DHM identified diverse morphologies and distinguished run-reverse-flick and run-reverse types of flagellar motility. PICRUSt2- and literature-based predictions based on 16S rRNA gene amplicons were used to predict motility genotypes/phenotypes for 36.0-60.1% of identified taxa, with the predicted motile taxa being dominated by members of Burkholderiaceae and Spirochaetota. A shotgun metagenome confirmed the abundance of genes encoding flagellar motility, and a Ralstonia metagenome-assembled genome encoded a full flagellar gene cluster. This study demonstrates the potential of DHM for planetary life detection, presents the first microbial census of Badwater Spring and brine pool, and confirms the abundance of mobile microbial taxa in an extreme environment.

Draft Genome Sequence of Bacteroidales Strain TBC1, a Novel Isolate from a Methanogenic Wastewater Treatment System

Article

Full-text available

Oct 2015

We report here the draft genome sequence of Bacteroidales strain TBC1, isolated from a methanogenic wastewater treatment system. The draft genome has a size of 4,514,407 bp and a G+C content of 46.7%. The predicted genomic content provides the basis for characterizing the metabolism and ecological strategies of strain TBC1.

Peeling off the layers from microbial dark matter (MDM): recent advances, future challenges, and opportunities

Article

Full-text available

Feb 2024
CRIT REV MICROBIOL

Microbes represent the most common organisms on Earth; however, less than 2% of microbial species in the environment can undergo cultivation for study under laboratory conditions, and the rest of the enigmatic, microbial world remains mysterious, constituting a kind of "microbial dark matter" (MDM). In the last two decades, remarkable progress has been made in culture-dependent and culture-independent techniques. More recently, studies of MDM have relied on culture-independent techniques to recover genetic material through either unicellular genomics or shotgun metagenomics to construct single-amplified genomes (SAGs) and metagenome-assembled genomes (MAGs), respectively, which provide information about evolution and metabolism. Despite the remarkable progress made in the past decades, the functional diversity of MDM still remains uncharacterized. This review comprehensively summarizes the recently developed culture-dependent and culture-independent techniques for characterizing MDM, discussing major challenges, opportunities, and potential applications. These activities contribute to expanding our knowledge of the microbial world and have implications for various fields including Biotechnology, Bioprospecting, Functional genomics, Medicine, Evolutionary and Planetary biology. Overall, this review aims to peel off the layers from MDM, shed light on recent advancements, identify future challenges, and illuminate the exciting opportunities that lie ahead in unraveling the secrets of this intriguing microbial realm.

Microbiological insights into anaerobic phenol degradation mechanisms and bulking phenomenon in a mesophilic upflow anaerobic sludge blanket reactor in long-term operation

Article

Feb 2024
WATER RES

Candidatus List. Lists of names of prokaryotic Candidatus phyla

Article

May 2023
INT J SYST EVOL MICR

Exploring Key Factors in Anaerobic Syntrophic Interactions: Biomass Activity, Microbial Community, and Morphology

Article

Sep 2022
BIORESOURCE TECHNOL

The present work evaluated the impacts of microbial communities, biomass activity and sludge morphology on anaerobic syntrophic reactions. Experiments were conducted using mature floc sludge and granular sludge under different food/microbes ratios, and with different sludge types (floc sludge, concentrated floc sludge and granular sludge) and sludge morphology (granules, vortexed granules, and granules with different particle sizes). The results show that the intact granules achieved the most effective syntrophic reaction among all sludge types. The granule structure facilitated the enrichment of syntrophic acetate oxidation bacteria (g_Syner-01 and g_Mesotoga) and methanogens, which corresponds to their superior specific methanogenic activity and high production of communication compounds. Despite the high diffusion and substrate uptake capacities, the disintegrated granules had low H2 consumption rates, which led to poor syntrophic activities. The results underline the importance of sludge spatial structures in promoting excellent syntrophic activities and the development of diverse microbial communities.

Exploring Key Factors in Anaerobic Syntrophic Interactions: Biomass Activity, Microbial Community, and Morphology

Article

Jan 2022

CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

Article

Full-text available

May 2015

Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of 'marker' genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. Published by Cold Spring Harbor Laboratory Press.

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

Article

Full-text available

Oct 2014

Donovan H Parks

Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of ‘marker’ genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree along with information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. CheckM is open source software available at http://ecogenomics.github.io/CheckM.

GroopM: An automated tool for the recovery of population genomes from related metagenomes

Article

Full-text available

Sep 2014

Metagenomic binning methods that leverage differential population abundances in microbial communities (differential coverage) are emerging as a complementary approach to conventional composition-based binning. Here we introduce GroopM, an automated binning tool that primarily uses differential coverage to obtain high fidelity population genomes from related metagenomes. We demonstrate the effectiveness of GroopM using synthetic and real-world metagenomes, and show that GroopM produces results comparable with more time consuming, labor-intensive methods.

An Expanded Genomic Representation of the Phylum Cyanobacteria

Article

Full-text available

May 2014

Molecular surveys of aphotic habitats have indicated the presence of major uncultured lineages phylogenetically classified as members of the Cyanobacteria. One of these lineages has recently been proposed as a non-photosynthetic sister phylum to the Cyanobacteria, the Melainabacteria, based on recovery of population genomes from human gut and groundwater samples. Here, we expand the phylogenomic representation of the Melainabacteria through sequencing of six diverse population genomes from gut and bioreactor samples supporting the inference that this lineage is non-photosynthetic, but not the assertion that they are strictly fermentative. We propose that the Melainabacteria is a class within the phylogenetically defined Cyanobacteria based on robust monophyly and shared ancestral traits with photosynthetic representatives. Our findings are consistent with theories that photosynthesis occurred late in the Cyanobacteria and involved extensive lateral gene transfer, and extends the recognised functionality of members of this phylum.

Biomethanation I

Book

Mar 2003
ADV BIOCHEM ENG BIOT

Anaerobic digestion is a major field for the treatment of waste and wastewater. Lately the focus has been on the quality of the effluent setting new demands for pathogen removal and for successful removal of unwanted chemicals during the anaerobic process. The two volumes on Biomethanation are devoted to presenting the state of art within the science and application of anaerobic digestion. They describe the basic microbiolgical knowledge of importance for understanding the processes of anaerobic bioreactors along with the newest molecular techniques for examining these systems. In addition, the applications for treatment of waste and wastewaters are presented along with the latest knowledge on process control and regulation of anaerobic bioprocesses. Together these two volumes give an overview of a growing area, which previously has never been presented in such a comprehensive way.

PHYLIP-phylogeny inference package (Version 3.2)

Article

Jan 2002

J. Felsenstein

Genomic insights to SAR86, an abundant and uncultivated marine bacterial lineage

Article

Jan 2012

C.L. Dupont

Interactive Tree of Life v2: online annotation and display of phylogenetic trees made easy

Article

Jan 2011

Treating industrial wastewater: Anaerobic digestion comes of age

Article

Apr 2003
CHEM ENG-NEW YORK

Various aspects of anaerobic digestion systems, which offers advantages over conventionally applied aerobic processes for removing organic pollutants from water-based streams, are discussed. Anaerobic treatment is in principle an energy-generating process through the production of byproduct methane-rich biogas. The use of the methane for energy generation elsewhere at the plant allows for conservation of more than 90% of the caloric value of the organic substrate being treated. By comparison, during aerobic treatment, most of the calorific value of the organic substrate id dissipated as non-recoverable heat. And, aerobic bioreactors require significant amount of energy for aeration.

PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4.0b10

Book

Jan 2002

David L. Swofford

— We studied sequence variation in 16S rDNA in 204 individuals from 37 populations of the land snail Candidula unifasciata (Poiret 1801) across the core species range in France, Switzerland, and Germany. Phylogeographic, nested clade, and coalescence analyses were used to elucidate the species evolutionary history. The study revealed the presence of two major evolutionary lineages that evolved in separate refuges in southeast France as result of previous fragmentation during the Pleistocene. Applying a recent extension of the nested clade analysis (Templeton 2001), we inferred that range expansions along river valleys in independent corridors to the north led eventually to a secondary contact zone of the major clades around the Geneva Basin. There is evidence supporting the idea that the formation of the secondary contact zone and the colonization of Germany might be postglacial events. The phylogeographic history inferred for C. unifasciata differs from general biogeographic patterns of postglacial colonization previously identified for other taxa, and it might represent a common model for species with restricted dispersal.

First genomic insights into members of a candidate bacterial phylum responsible for wastewater bulking

Abstract and Figures

Recommended publications

Kinesin motor density and dynamics in gliding microtubule motility

Envelope structure of Synechococcus sp. WH8113, a nonflagellated swimming cyanobacterium

Denitrification by a novel halophilic fermentative bacterium

Bifurcation of Velocity Distributions in Cooperative Transport of Filaments by Fast and Slow Motors