PreprintPDF Available

Abstract and Figures

Databases of genes and enzymes involved in hydrocarbon degradation have been previously reported. However, these databases specialize on only a specific group of hydrocarbons and/or are constructed partly based on enzyme sequences with putative functions indicated by in silico research, with no experimental evidence. Here, we present a curated database of Hydrocarbon Aerobic Degradation Enzymes and Genes (HADEG) containing proteins and genes involved in alkane, alkene, aromatic, and plastic aerobic degradation and biosurfactant production based solely on experimental evidence, which are present in bacteria, and fungi. HADEG includes 259 proteins for petroleum hydrocarbon degradation, 160 for plastic degradation, and 32 for biosurfactant production. This database will help identify and predict hydrocarbon degradation genes/pathways and biosurfactant production in genomes. Graphical abstract Data summary The HADEG database repository is https://github.com/jarojasva/HADEG . All Supplementary Material file is available on: https://figshare.com/articles/dataset/Supplementary_Material_HADEG/20752642 .
Content may be subject to copyright.
1
HADEG: A Curated Database of Hydrocarbon
1
Aerobic Degradation Enzymes and Genes
2
3
Jorge Rojas-Vargas1*, Hugo G. Castelán-Sánchez 2, Liliana Pardo-López1*
4
1Departamento de Microbiología Molecular, Instituto de Biotecnología, UNAM, Av.
5
Universidad #2001, Col. Chamilpa, 62210 Cuernavaca, Morelos, México
6
2Programa de Investigadoras e Investigadores por México. Grupo de Genómica y Dinámica
7
Evolutiva de Microorganismos Emergentes. Consejo Nacional de Ciencia y Tecnología. Av.
8
Insurgentes Sur 1582, Crédito Constructor, Benito Juárez, CP 03940, Ciudad de México,
9
México.
10
* Correspondence:
11
Liliana Pardo-López, liliana.pardo@ibt.unam.mx
12
Jorge Rojas-Vargas, jorge.rojas@ibt.unam.mx
13
ABSTRACT
14
Databases of genes and enzymes involved in natural and synthetic hydrocarbon degradation
15
have been previously reported. However, these databases specialize on only a specific group
16
of hydrocarbons and/or are constructed partly based on enzyme sequences with putative
17
functions indicated by in silico research, with no experimental evidence. Here, we present a
18
curated database of Hydrocarbon Aerobic Degradation Enzymes and Genes (HADEG)
19
containing enzymes and genes involved in alkane, alkene, aromatic, and polymer aerobic
20
degradation and biosurfactant production based solely on experimental evidence. HADEG
21
includes 259 enzymes for petroleum hydrocarbon degradation, 160 for polymer degradation,
22
and 32 for biosurfactant production. This database will help identify and predict hydrocarbon
23
degradation genes/pathways and biosurfactant production in genomes.
24
Keywords
25
Curated database, hydrocarbon degradation, polymer degradation, biosurfactant production.
26
Impact statement
27
There is currently no database that integrates enzymes and genes with experimental evidence
28
related to the degradation of different hydrocarbon groups (alkanes, alkenes, aromatics,
29
polymers) or the production of biosurfactants. HADEG is an effort to assemble a single
30
database of enzymes and gene sequences of the aforementioned groups. This database will
31
be helpful for more fully predicting the potential degradation of microorganism genomes (or
32
metagenomes), thus increasing knowledge about the biology of microbes that metabolize
33
pollutants. Our findings from HADEG also shed light on the identification of a set of possible
34
biomarkers for targeted searching of promising hydrocarbon-degrading microorganisms.
35
Data summary
36
The HADEG database repository is https://github.com/jarojasva/HADEG.
37
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
2
1 Introduction
38
Crude oil spills and plastic pollution of anthropogenic origin are two major concerns in the
39
context of environmental equilibrium and conservation (Bhattacharjee & Dutta, 2022; Horton,
40
2022). Bioremediation technologies remain one of the most promising options for crude oil
41
and plastic cleanup, based on microorganisms that can use hydrocarbons (HCs) and polymers
42
as carbon sources. Interest in bioremediation has led to the identification and characterization
43
of some enzymes involved in degradation processes. Among the features of these enzymes,
44
studies have described their activity conditions, substrates, and products, in addition to their
45
protein structures and active sites in some cases (Brzeszcz & Kaszycki, 2018; Kaushal et al.,
46
2021; Pathak et al., 2022; Xu et al., 2018). Enzymes for biosurfactant production have also
47
been studied; these microbial surface-active compounds increase the bioavailability of HCs
48
and therefore promote higher levels of HC degradation (Banat, 1995).
49
Advances in sequencing technologies have allowed the nucleotide sequences of
50
hydrocarbon-degrading genes and, thus, the amino acid sequences of the encoded proteins
51
to be obtained. Some of these sequences can be found in existing tailored data repositories
52
and can be used to identify and predict the degradation potential of microbial genomes and
53
metagenomes. Regarding hydrocarbon degradation, the AromaDeg database is focused on
54
the aerobic degradation of aromatic compounds. It was constructed based on a phylogenomic
55
approach considering different oxygenases with known and unknown aromatic substrates
56
(Duarte et al., 2014). A more recent database is that of the Calgary approach to ANnoTating
57
HYDrocarbon degradation genes (CANT-HYD), which contains genes involved in aerobic and
58
anaerobic aliphatic and aromatic hydrocarbon degradation pathways. CANT-HYD uses
59
Hidden Markov Models (HMMs) for the annotation of six monophyletic clades of genes related
60
to aliphatic aerobic degradation (alkB, almA, ladA, bmo, CYP153, prm), six related to
61
aromatics (dmp, dsz, MAH, ndo, tmo, tom), and eight related to anaerobic pathways (Khot et
62
al., 2021). The workflow derives 37 HMMs from experimental and in silico-inferred functional
63
enzymes and is designed for the identification and annotation of marker genes, not including
64
some pathways of hydrocarbon degradation.
65
For polymers, the Plastics Microbial Biodegradation Database (PMBD) (http://pmbd.genome-
66
mining.cn/home/) is an online resource containing 79 experimentally predicted and ~8000 in
67
silico-predicted sequences of enzymes from the literature and the UniProt database (Gan &
68
Zhang, 2019). The Plastics-Active Enzymes Database (PAZy) is an inventory of enzymes that
69
act on only four synthetic fossil fuel-based polymers and three polymers mainly found in
70
renewable resources (Buchholz et al., 2022). Additionally, PlasticDB (Gambarini et al., 2022)
71
includes 174 experimentally tested enzymes (consulted on July 15, 2022) related to the
72
degradation of seven natural and twenty synthetic polymers. Regarding biosurfactants, the
73
only reported database for these compounds is BioSurfDB (www.biosurfdb.org) (Oliveira et
74
al., 2015), which includes experimental and predicted enzymes.
75
Here, we introduce the manually curated database of Hydrocarbon Aerobic Degradation
76
Enzymes and Genes (HADEG), which is a public repository containing sequences of
77
experimentally characterized enzymes and genes to be used for annotation purposes. HADEG
78
groups these molecules according to their substrates (in the case of alkane, alkene, aromatic,
79
and polymer aerobic degradation) and products (in the case of biosurfactant production). In
80
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
3
this work, we also briefly reviewed the enzymes and pathways involved in petroleum
81
hydrocarbon and polymer degradation.
82
2 Materials and methods
83
2.1 Selection of reference sequences
84
For enzymes involved in alkane, alkene, and aromatic aerobic degradation, we compiled and
85
classified metabolic pathways from the primary literature and two experimentally validated
86
curated databases, MetaCyc (Krieger et al., 2004) and KEGG (Ogata et al., 1999). We
87
searched the sequences of enzymes and genes involved in the aforementioned metabolic
88
pathways in the well-characterized UniProtKB/Swiss-Prot database (Boutet et al., 2007), the
89
NCBI portal, and in the literature if sequences were not available in any database
90
(Supplementary Tables 1, 2, and 3).
91
To obtain polymer degradation enzymes, we selected nonredundant protein sequences from
92
PlasticDB (Gambarini et al., 2022). To obtain biosurfactant production enzymes, we used the
93
UniProt platform with different biosurfactants as keywords, and the corresponding proteins
94
included in the UniProtKB/Swiss-Prot database were collected. In constructing our polymer
95
and biosurfactant database, we added some new proteins from a search of the primary
96
literature (Supplementary Tables 4 and 5). The nucleotide sequences of all proteins were
97
retrieved from the European Nucleotide Archive (ENA) platform (Leinonen et al., 2011) or the
98
literature.
99
2.2 Protein annotation and orthology
100
We used eggNOG-mapper (Huerta-Cepas et al., 2019) online version 2.1.9, run on July 23
101
(2022), to annotate our protein database, identify orthologs, and study their taxonomic
102
distribution at the phylum level. Protein sequences without orthologs were considered seed
103
sequences (Supplementary Table 6). We used InterProScan v.5.51-85.0 (Jones et al., 2014)
104
for protein domain inference with the default parameters to identify the shared protein domains
105
predicted with Pfam-33.1 (Supplementary Table 7).
106
2.3 Searching for hydrocarbon degradation enzymes in genomes using
107
HADEG
108
We selected 60 genomes (Supplementary Table 8) of recognized HC- and plastic-degrading
109
microorganisms and biosurfactant producers for validation purposes. The protein FASTA files
110
were retrieved from the NCBI web portal (consulted on July 1, 2022). BLAST software
111
(McGinnis & Madden, 2004) was used to predict enzymes by applying the following cutoff
112
values: 50 for the percentage of identity, 1x10-60 for the e-value, ±10% of the variation in the
113
query sequences, and the higher bit score, using the protein FASTA files from our database.
114
3 Results and discussion
115
3.1 Hydrocarbon aerobic degradation
116
HCs are relatively stable molecules but can serve as a source of carbon and energy for any
117
organism able to activate them. This activation by the aerobic insertion of oxygen or the
118
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
4
anaerobic addition of succinate distinguishes hydrocarbon-degrading organisms from their
119
counterparts (Prince et al., 2010). In HADEG, we included enzymes related to aerobic HC
120
degradation, such as alkanes, alkenes, aromatics, and polymers (Supplementary Tables 1-4).
121
3.1.1 Alkanes
122
Alkanes are the principal components of crude oil. Four microbial pathways of alkane
123
degradation have been identified (Fig. 1) (Ji et al., 2013; Moreno & Rojo, 2019; Wentzel et al.,
124
2007). All of them begin with the activation of the alkane chain through its oxidation to a
125
primary or secondary alcohol, as observed in terminal (TO), biterminal (BO), and subterminal
126
oxidation (SO), or to an n-alkyl hydroperoxide via the Finnerty pathway (FP). Among the
127
activation enzymes included in HADEG, we report the extensively studied alkane
128
monooxygenase (alkB), cytochrome P153 (CYP153), and P450 (CYP450), flavin-binding
129
protein monooxygenase (almA), long-chain alkane monooxygenase (ladA), and terminal
130
alkane-hydroxylase (alkM). The rubredoxins (rubA, rubB), which play a crucial role in electron
131
transfer to alkB, are also included. Some other included enzymes belong to a particular
132
pathway, such as Baeyer-Villiger monooxygenase (BVMO) in SO and alkyl hydroperoxide
133
reductase (ahpCF) in FP. Enzymes with specific substrates, such as methane
134
monooxygenase (pmoA), propane 2-monooxygenase (prm), and butane monooxygenase
135
hydroxylase (bmo), are also included in HADEG.
136
Some other enzymes included in HADEG are alcohol and aldehyde dehydrogenases (adh,
137
ald), cyclohexanone 1,2-monooxygenase (CHMO), alkanesulfonate monooxygenase (ssuD),
138
the helix-turn-helix transcriptional regulator of alkB (alkS), and enzymes related to the uptake
139
of alkanes, such as an outer membrane protein (alkL), an outer membrane lipoprotein (blc),
140
long-chain fatty acid transport enzyme (fadL), and a methyl-accepting chemotaxis protein
141
(alkN or mcp).
142
3.1.2 Alkenes
143
Alkenes are transformed into corresponding epoxides by an alkene monooxygenase.
144
Epoxides are somewhat toxic to cells but can be processed using epoxide hydrolases, such
145
as the enzyme epoxyalkane CoM transferase, and their products ultimately enter the central
146
metabolism. HADEG includes the alkene monooxygenase complex (amoABCD) identified in
147
Nocardia corallina B-276, isoprene monooxygenase (isoABCDEHI) from Rhodococcus sp.
148
AD45, the propene monooxygenase system (xamoABCDE) and epoxyalkane CoM system
149
(xecADC) from Xanthobacter autotrophicus Py2, the epoxyalkane CoM (etnE) from
150
Mycolicibacterium rhodesiae, and the dehydrogenase mpdBC from M. austroafricanum.
151
3.1.3 Aromatics
152
Aromatic compounds are commonly known as recalcitrant pollutants because they show
153
relatively low susceptibility to microbial degradation (Ghosal et al., 2016). Some also represent
154
mutagenic and carcinogenic risks to human health. In contrast to alkanes and alkenes,
155
aromatic compound degradation requires complex metabolic machinery. In general, aerobic
156
organisms take advantage of the availability of molecular oxygen and use oxygenases that
157
introduce hydroxyl groups and cleave aromatic rings. These pathways usually involved a few
158
central intermediates, most commonly including catechol, protocatechuate, and gentisate
159
(Phale et al., 2020).
160
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
5
The goal of enzymatic degradation is the activation and cleavage of aromatic rings to obtain
161
products that can subsequently be integrated into central pathways. Our research included
162
pathways with characterized enzymes involved in the metabolic processes of poly- and
163
monoaromatic degradation. Regarding polyaromatic degradation pathways, HADEG includes
164
enzymes involved in the degradation of anthracene, biphenyl, dibenzothiophene, fluorene,
165
naphthalene, and phenanthrene. Pyrene degradation involves pyrene dioxygenases, but none
166
of these enzymes had been experimentally characterized when this manuscript was written,
167
although their activity has been inferred in Pseudomonas sp. Jpyr-1 and Mycolicibacterium
168
sp. 16F. Nevertheless, cytochrome P450 monooxygenase (CYP450), which is involved in
169
alkane degradation, seems to participate in pyrene degradation in Roseobacter (Zhou et al.,
170
2020). Regarding monoaromatic degradation pathways, we included enzymes involved in the
171
degradation of benzene, toluene, xylene (BTX), anthranilate, benzoate, 2-nitrobenzoate,
172
catechol (ortho- and meta-cleavage pathways), gentisate, phenol, phenylacetate, phthalate,
173
protocatechuate (ortho- and meta-cleavage pathways), salicylate, styrene, and terephthalate.
174
3.1.4 Polymers
175
Enzymes involved in the degradation of polymers were also added to the HADEG database,
176
since polymers are important environmental contaminants, and it has been reported that some
177
microorganisms can degrade them (Kaushal et al., 2021). Polymers can be classified
178
according to their origin into natural or synthetic polymers. Natural polymers, also known as
179
biopolymers, are derived from plants, animals, and microorganisms, are degradable and
180
environmentally sensitive, and have been studied for potential medical and engineering
181
applications (Kumbar et al., 2014; Shrivastava, 2018; Swain & Jawaid, 2019; Wagner et al.,
182
2020); (Garcia-Gareta, 2019). Some natural polymers are essential for life, such as
183
polysaccharides (e.g., cellulose, pectin, chitin, and alginic acid), polypeptides (e.g., proteins),
184
and polynucleotides (e.g., DNA). On the other hand, synthetic or man-made polymers are
185
obtained in the laboratory (Saldívar-Guerra & Vivaldo-Lima, 2013), and they can have an
186
organic or inorganic backbone (Zainudin et al., 2020). They are commonly derived from
187
petroleum (e.g., polyethylene terephthalate-PET, polyurethane-PU, polyvinyl chloride-PVC,
188
polystyrene-PS, and polypropylene-PP) and have many applications (Shrivastava, 2018).
189
Most of them are very difficult to degrade due to inherent characteristics such as high
190
molecular weights, crystallinity, and hydrophobicity (Amobonye et al., 2021; Geyer, 2020). In
191
this context, microorganisms with the capacity for polymer degradation remain one of the most
192
promising options for the reduction or recycling of synthetic polymers with long lifetimes
193
(Muriel-Millán et al., 2021).
194
The polymer degradation process begins with the superficial deterioration and subsequent
195
cleavage of polymers into intermediates, which can be used as energy and carbon sources.
196
The depolymerization process mainly involves the hydrolytic cleavage of glycosidic, ester, and
197
peptide bonds, resulting in constituent oligomers or monomers (Amobonye et al., 2021).
198
Implementing physicochemical pretreatments such as the application of chemical additives or
199
UV irradiation can facilitate microbial polymer degradation (Ru et al., 2020).
200
Among the polymer enzymes included in HADEG, there are some hydrolases for PET
201
degradation, such as the recently well-characterized polyester hydrolase of
202
Halopseudomonas aestusnigri VXGO4 (6SCD PDB), one hydrolase from a marine
203
metagenome (Hajighasemi et al., 2018), a lacasse of Rhodococcus ruber C208, and some
204
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
6
cutinases. These enzymes also include PVA dehydrogenase (pvaA, pvaB) for polyvinyl
205
alcohol (PVA) degradation (Shimao, 2001); exo-cleaving and endo-cleaving rubber
206
dioxygenases (roxA, roxB) for natural rubber degradation from Xanthomonas sp. 35Y;
207
polyurethanases (pueA, pueB) for PU degradation from Pseudomonas choloraphis and
208
Pseudomonas protegens Pf-5; manganese peroxidase 1 (mnp1) for polyethylene degradation
209
from Phanerodontia chrysosporium; and a cutinase for poly(epsilon-caprolactone) (PCL) and
210
poly(1,3-propylene adipate) (PPA) degradation from Kineococcus radiotolerans DSM 14245.
211
3.2 Biosurfactant production
212
The first step in HC degradation is accessing the molecules. Short-chain alkanes of less than
213
9 carbons are soluble enough to be transported into cells (Moreno & Rojo, 2017; Wang &
214
Shao, 2013) Alkanes of greater length require cell adhesion to crude oil droplets, as is the
215
case in Acinetobacter, which uses adhesion protein filaments (fimbriae) or the action of
216
biosurfactants (Baldi et al., 1999). Biosurfactants increase the bioavailability of HCs (Patowary
217
et al., 2016), and it is speculated that once they are excreted, they cannot enter cells (Wang
218
& Shao, 2013). Some microorganisms capable of degrading HCs secrete biosurfactants of a
219
different nature, such as lipoproteins or rhamnolipids, to emulsify HCs. Biosurfactants have
220
also been associated with viral factors (Sharma et al., 2021).
221
The HADEG database includes the enzymes involved in the production of fengycin (ffp),
222
plipastatin (ppsABCDE), surfactin (srf), iturin (itu), and mycosubtilin (mycABC) in Bacillus
223
subtilis. Additionally, enzymes related to rhamnolipids (rhl) from Pseudomonas aeruginosa;
224
emulsan from Acinetobacter lwoffii RAG-1 (wzb, wzc); arthrofactin (arfABC) from
225
Pseudomonas sp. MIS38; amphisin (amsY) from Pseudomonas sp. DSS73; serrawetin
226
(pswP) in Serratia marcescens; and lactonic sophorose lipid (sble), and sophorolipids (at) from
227
Starmerella bombicola are also included. Two fungal biosurfactants are also included;
228
mannosylerythritol lipids (emt1) from Ustilago maydis and hydrophobin (hfb1, hfb2) from
229
Hypocrea jecorina.
230
3.3 Taxonomic distribution of orthologs in the HADEG database
231
Among the proteins present in HADEG, a search for orthologs was carried out to determine
232
their distribution in different microorganisms and their taxonomic assignment. We used
233
eggNOG-mapper to identify 19,749 orthologs of 445 protein sequences among the 451
234
sequences included in HADEG (Supplementary Table 6). The six proteins without identified
235
orthologs were the PBAT esterase of an uncultured bacterium (accession no. A0A1C9T7G6),
236
the PVA hydrolase of Sphingomonas sp. (accession no. Q588Z2, oph), the butane
237
monooxygenase hydroxylase gamma subunit of Thauera butanivorans (a.n. Q8KQE7, bmoZ),
238
the fluoren-9-ol dehydrogenase of Terrabacter sp. DBF63 (a.n. Q93UV4, flnB), the oxidized
239
PVA hydrolase of Pseudomonas sp. (a.n. Q9LCQ7, pvaB), and the anthranilate-CoA ligase of
240
Rhodococcus erythropolis (a.n. S6BVH5, rauF).
241
The taxonomic analysis of orthologs at the phylum level revealed that Proteobacteria was the
242
most abundant phylum related to HC degradation pathways and biosurfactant production
243
(forest-green color in Fig. 2A). Many studies have confirmed the degradation capacities of
244
species in genera such as Alcanivorax and Marinobacter (hydrocarbonoclastics),
245
Acinetobacter (versatile HC degraders), and ubiquitous Pseudomonas. Biosurfactants such
246
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
7
as amphisin, arthrobactin, emulsan and serrawetin have only been described in
247
Proteobacteria (Soberón-Chávez, 2010), and rhamnolipid production mechanisms have been
248
well described in Pseudomonas (Soberón-Chávez et al., 2021) (Fig. 2B). Actinobacteria was
249
the second most abundant phylum related to HC degradation pathways (goldenrod color in
250
Fig. 2), and it includes the hydrocarbon-degrading genera Rhodococcus, Mycobacterium,
251
Norcadia, Gordonia, and Streptomyces, among others. Firmicutes was the third most
252
abundant phylum related to HC pathways and the second most abundant related to
253
biosurfactant production (Bacillus, Geobacillus, and Paenibacillus) (royal blue color in Fig. 2),
254
within which surfactin and iturin production by Bacillus genus members has been well studied
255
(Soberón-Chávez, 2010). In the Bacteria domain, the fourth most abundant phylum was
256
Bacteroidetes (Aequorivita, Algoriphagus, Cytophaga), particularly related to alkene
257
degradation (orange color in Fig. 2). Ascomycota was the most abundant phylum from the
258
fungal domain and was mainly associated with alkane and polymer degradation and
259
biosurfactant production (brick color in Fig. 2). Fungi have received interest in relation to
260
bioremediation applications because they do not rely solely on soluble organic compounds for
261
nutrition; they also show diverse enzymatic mechanisms and distinct mechanisms of
262
physiological adaptability to environmental conditions (Prenafeta-Boldú et al., 2018).
263
3.4 HADEG Pfams network
264
A network analysis was performed to determine whether the HADEG proteins shared
265
domains, which may indicate that a protein could participate in the degradation of more than
266
one type of HC (Fig. 3). Identifying the shared and nonshared protein domains between
267
hydrocarbon degradation groups will be helpful to establish a set of biomarkers for the targeted
268
searching of potential hydrocarbon-degrading microorganisms. A total of 166 Pfam domains
269
were identified in HADEG (Supplementary Table 7). Network analyses reveal that the polymer
270
and aromatic degradation groups contained the largest numbers of protein domains, at 75 and
271
48, respectively. These results are congruent with the diversity and complexity of aromatic
272
and polymeric chemical structures. On the other hand, we predicted 30 Pfams among
273
biosurfactants, 28 among alkanes, and 18 among alkenes.
274
Among the protein domains without connectivity or nonshared protein domains, the alkane set
275
included 16 Pfams such as the related to rubredoxin proteins (PF00301 and PF18113) and a
276
flavin-binding monooxygenase-like protein (PF00743) in almA and BVMO. The alkene set
277
included four Pfams with no connectivity: cobalamin-independent synthase (PF01717) in etnE,
278
a glutathione S-transferase C-terminal domain (PF17171 and PF17172) in isoI, and a pyridine
279
nucleotide-disulfide oxidoreductase dimerization domain (PF02852) in xecC. The aromatic set
280
included 57 nonshared Pfams like the ring-hydroxylating alpha and beta subunits (PF00848
281
and PF00866) present in aromatic-ring dioxygenase systems, the dioxygenase superfamily
282
(PF00903), and the catechol dioxygenase N-terminus (PF04444). The polymer set included
283
42 nonshared protein domains like the RTX calcium-binding nonapeptide repeat 4 copies
284
(PF00353) present in the polyester polyurethanases pueA and pueB, the alpha/beta hydrolase
285
family (PF12695) involved in PET degradation, the esterase PHB depolymerase (PF10503)
286
involved in PHA/PHB degradation and a cutinase (PF01083) that acts on carboxylic ester
287
bonds. Finally, the biosurfactant set included 26 nonshared Pfams like a condensation domain
288
(PF00668) present in Bacillus biosurfactants synthesis proteins (iturin, mycosubtilin, surfactin,
289
and plipastatin), the 4-phosphopantetheinyl transferase superfamily (PF01648) in ffp, pswP
290
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
8
and sfp, and helicase domains (PF00270 and PF00271) in rhlB and (PF13641) in rhlC, both
291
involved in rhamnolipids production.
292
Concerning the protein domains that were connected in the network or shared, only the
293
aldehyde dehydrogenase family (PF00171) participates in the degradation of the four HC
294
groups. This domain is responsible for the oxidation of aldehydes, which is considered a
295
detoxification reaction (Pfam: Family: Aldedh (PF00171), n.d.). The largest set of shared
296
domains was found between alkenes and aromatics, which shared 19 Pfams. These domains
297
included the Rieske [2Fe-2S] domain (PF00355), which is involved in the electron transfer
298
chain. PF00355 was present in enzymes such as andA, benA, doxAB, ndoAB, tmoC, todB,
299
bedB, isoC, and xamoC. The alkane, alkene and aromatic groups shared seven Pfam
300
domains, including the oxidoreductase NAD-binding and FAD-binding domains (PF00175 and
301
PF00970), present in prmB, isoF, xamoF, benC, and tmoF; the methane/phenol/toluene
302
hydroxylase domains (PF02332), found in bmoX, xamoA, and tmoA; and the MmoB/DmpM
303
family (PF02406), found in prmD, xamoD, and tmoD. The alkane and aromatic groups shared
304
a fatty acid desaturase domain (PF00487) that catalyzes the insertion of a double bond at the
305
delta position in fatty acids (Pfam: Family: FA_desaturase (PF00487), n.d.). This domain is
306
present in alkane monooxygenases (alkB), alkane hydroxylases (alkMa, alkMb), and xylene
307
monooxygenase subunit 1 (xylM).
308
3.5 Hydrocarbon degradation enzymes in different genomes
309
One way to use this database could be to identify hydrocarbon degradation genes in different
310
genomes based on homology through BLAST searches, although other methods can be used
311
for this purpose. To this end, a BLAST comparison was conducted using the amino acid
312
sequences of HADEG against 60 genomes with previously proven hydrocarbon degradation
313
and/or biosurfactant production activity. The results of this comparison showed that 53 of 60
314
genomes presented experimentally verified hydrocarbon degradation genes; thus, the data
315
were consistent with previous reports (Fig. 4). Among the remaining 7 genomes, three
316
exhibited experimental evidence hits with a percentage of identity between 44% and 49%,
317
which was below the cutoff (50%). This was the case for the BVMO gene of Thalassolituus
318
oleivorans MIL-1 involved in the SO pathway, pegAC of Sphingopyxis macrogoltabida 203
319
involved in polyethylene glycol degradation, and rhlABC of Burkholderia thailandensis E264
320
involved in rhamnolipid synthesis. Although genomes (11, 28, 42, and 50) showed some hits
321
against the HADEG database, they did not show matches with any genes or substrates with
322
previously reported experimental results. These genomes likely include other potential
323
pathways that are not covered in the HADEG database.
324
Additionally, it was possible to identify unreported hydrocarbon degradation and biosurfactant
325
production capabilities in 59 genomes with HADEG. Many of these genomes included genes
326
for the degradation of aliphatic and aromatic HCs and polymers and the production of multiple
327
biosurfactants on the same chromosome. The occurrence of these numerous pathways might
328
be due to microbial environmental adaptations and be widely observed within the bacterial
329
world (Brzeszcz & Kaszycki, 2018). However, empirical evidence indicates that microbes show
330
metabolic pathway preferences (Brzeszcz & Kaszycki, 2018). The concordance with
331
experimental evidence and the predicted pathway repertoire show that the HADEG database
332
is a reliable resource for identifying and predicting hydrocarbon degradation pathways and
333
biosurfactant production in genomes.
334
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
9
4 Conclusions
335
The HADEG database is a bioinformatic resource that can be used to identify potential
336
microorganisms for developing HC-bioremediation methods. To the best of our knowledge,
337
HADEG, which contains 451 experimentally verified proteins, is the first database to integrate
338
different types of HCs (alkane, alkenes, aromatics, and polymers) and biosurfactants in a
339
single resource. Through ortholog analysis, we found that the HADEG proteins are distributed
340
in bacteria and fungi and that they are most highly represented in Proteobacteria. Additionally,
341
Pfam analysis showed that the domains of the HC degradation proteins correspond primarily
342
to polymers and aromatics. Most domains are not shared among HC groups, indicating that
343
their functions are HC-degradation-specific and that they can probably be used as biomarkers
344
for the identification of microbes with degradation abilities. Finally, we show that HADEG is a
345
reliable tool for annotating HC degradation and biosurfactant production proteins.
346
Author contributions
347
JR-V conceived the initial study. JR-V and HGC-S designed the experiments and analyzed
348
the data. JR-V, HGC-S, and LP-L contributed to the interpretation and discussion of the
349
results. LP-L provided financial support. All authors contributed to the writing of the manuscript
350
and approved the final version.
351
Funding
352
Jorge Rojas-Vargas is a doctoral student from Programa de Doctorado en Ciencias
353
Bioquímicas, Universidad Nacional Autónoma de México (UNAM), and received fellowship
354
965003 from the Consejo Nacional de Ciencia y Tecnología (CONACYT). Liliana Pardo-Lopez
355
received financial support from PASPA DGAPA-UNAM for a sabbatical year.
356
Acknowledgments
357
We thank the Unidad de Secuenciación Masiva y Bioinformática del Instituto de Biotecnología-
358
UNAM for giving us access to its computer cluster.
359
Conflict of interest
360
The authors declare that there are no conflicts of interest.
361
References
362
Amobonye, A., Bhagwat, P., Singh, S., & Pillai, S. (2021). Plastic biodegradation: Frontline
363
microbes and their enzymes. The Science of the Total Environment, 759, 143536.
364
Baldi, F., Ivos evi, N., Minacci, A., Pepi, M., Fani, R., Svetlic i, V., & Z
uti, V. (1999).
365
Adhesion of Acinetobacter venetianus to diesel fuel droplets studied with in situ
366
electrochemical and molecular probes. Applied and Environmental Microbiology,
367
65(5), 20412048.
368
Banat, I. M. (1995). Biosurfactants production and possible uses in microbial enhanced oil
369
recovery and oil pollution remediation: A review. Bioresource Technology, 51(1), 1
370
12.
371
Bhattacharjee, S., & Dutta, T. (2022). Chapter 1 - An overview of oil pollution and oil-spilling
372
incidents. In P. Das, S. Manna, & J. K. Pandey (Eds.), Advances in Oil-Water
373
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
10
Separation (pp. 315). Elsevier.
374
Boutet, E., Lieberherr, D., Tognolli, M., Schneider, M., & Bairoch, A. (2007).
375
UniProtKB/Swiss-Prot. In D. Edwards (Ed.), Plant Bioinformatics: Methods and
376
Protocols (pp. 89112). Humana Press.
377
Brzeszcz, J., & Kaszycki, P. (2018). Aerobic bacteria degrading both n-alkanes and aromatic
378
hydrocarbons: an undervalued strategy for metabolic diversity and flexibility.
379
Biodegradation, 29(4), 359407.
380
Buchholz, P. C. F., Feuerriegel, G., Zhang, H., Perez-Garcia, P., Nover, L.-L., Chow, J.,
381
Streit, W. R., & Pleiss, J. (2022). Plastics degradation by hydrolytic enzymes: The
382
plastics-active enzymes database-PAZy. Proteins, 90(7), 14431456.
383
Duarte, M., Jauregui, R., Vilchez-Vargas, R., Junca, H., & Pieper, D. H. (2014). AromaDeg,
384
a novel database for phylogenomics of aerobic bacterial degradation of aromatics.
385
Database: The Journal of Biological Databases and Curation, 2014, bau118.
386
Gambarini, V., Pantos, O., Kingsbury, J. M., Weaver, L., Handley, K. M., & Lear, G. (2022).
387
PlasticDB: a database of microorganisms and proteins linked to plastic
388
biodegradation. Database: The Journal of Biological Databases and Curation, 2022.
389
https://doi.org/10.1093/database/baac008
390
Gan, Z., & Zhang, H. (2019). PMBD: a Comprehensive Plastics Microbial Biodegradation
391
Database. Database: The Journal of Biological Databases and Curation, 2019.
392
https://doi.org/10.1093/database/baz119
393
Garcia-Gareta, E. (2019). Biomaterials for Skin Repair and Regeneration. Elsevier Science.
394
Geyer, R. (2020). Chapter 2 - Production, use, and fate of synthetic polymers. In T. M.
395
Letcher (Ed.), Plastic Waste and Recycling (pp. 1332). Academic Press.
396
Ghosal, D., Ghosh, S., Dutta, T. K., & Ahn, Y. (2016). Current State of Knowledge in
397
Microbial Degradation of Polycyclic Aromatic Hydrocarbons (PAHs): A Review.
398
Frontiers in Microbiology, 7, 1369.
399
Hajighasemi, M., Tchigvintsev, A., Nocek, B., Flick, R., Popovic, A., Hai, T., Khusnutdinova,
400
A. N., Brown, G., Xu, X., Cui, H., Anstett, J., Chernikova, T. N., Brüls, T., Le Paslier,
401
D., Yakimov, M. M., Joachimiak, A., Golyshina, O. V., Savchenko, A., Golyshin, P.
402
N., … Yakunin, A. F. (2018). Screening and Characterization of Novel Polyesterases
403
from Environmental Metagenomes with High Hydrolytic Activity against Synthetic
404
Polyesters. Environmental Science & Technology, 52(21), 1238812401.
405
Horton, A. A. (2022). Plastic pollution: When do we know enough? Journal of Hazardous
406
Materials, 422, 126885.
407
Huerta-Cepas, J., Szklarczyk, D., Heller, D., Hernández-Plaza, A., Forslund, S. K., Cook, H.,
408
Mende, D. R., Letunic, I., Rattei, T., Jensen, L. J., von Mering, C., & Bork, P. (2019).
409
eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology
410
resource based on 5090 organisms and 2502 viruses. Nucleic Acids Research,
411
47(D1), D309D314.
412
Ji, Y., Mao, G., Wang, Y., & Bartlam, M. (2013). Structural insights into diversity and n-
413
alkane biodegradation mechanisms of alkane hydroxylases. Frontiers in
414
Microbiology, 4, 58.
415
Jones, P., Binns, D., Chang, H.-Y., Fraser, M., Li, W., McAnulla, C., McWilliam, H., Maslen,
416
J., Mitchell, A., Nuka, G., Pesseat, S., Quinn, A. F., Sangrador-Vegas, A.,
417
Scheremetjew, M., Yong, S.-Y., Lopez, R., & Hunter, S. (2014). InterProScan 5:
418
genome-scale protein function classification. Bioinformatics , 30(9), 12361240.
419
Kaushal, J., Khatri, M., & Arya, S. K. (2021). Recent insight into enzymatic degradation of
420
plastics prevalent in the environment: A mini - review. Cleaner Engineering and
421
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
11
Technology, 2, 100083.
422
Khot, V., Zorz, J., Gittins, D. A., Chakraborty, A., Bell, E., Bautista, M. A., Paquette, A. J.,
423
Hawley, A. K., Novotnik, B., Hubert, C. R. J., Strous, M., & Bhatnagar, S. (2021).
424
CANT-HYD: A Curated Database of Phylogeny-Derived Hidden Markov Models for
425
Annotation of Marker Genes Involved in Hydrocarbon Degradation. Frontiers in
426
Microbiology, 12, 764058.
427
Krieger, C. J., Zhang, P., Mueller, L. A., Wang, A., Paley, S., Arnaud, M., Pick, J., Rhee, S.
428
Y., & Karp, P. D. (2004). MetaCyc: a multiorganism database of metabolic pathways
429
and enzymes. Nucleic Acids Research, 32(Database issue), D438-42.
430
Kumbar, S., Laurencin, C., & Deng, M. (2014). Natural and Synthetic Biomedical Polymers.
431
Elsevier Science.
432
Leinonen, R., Akhtar, R., Birney, E., Bower, L., Cerdeno-Tárraga, A., Cheng, Y., Cleland, I.,
433
Faruque, N., Goodgame, N., Gibson, R., Hoad, G., Jang, M., Pakseresht, N.,
434
Plaister, S., Radhakrishnan, R., Reddy, K., Sobhany, S., Ten Hoopen, P., Vaughan,
435
R., … Cochrane, G. (2011). The European Nucleotide Archive. Nucleic Acids
436
Research, 39(Database issue), D28-31.
437
McGinnis, S., & Madden, T. L. (2004). BLAST: at the core of a powerful and diverse set of
438
sequence analysis tools. Nucleic Acids Research, 32(Web Server issue), W20-5.
439
Moreno, R., & Rojo, F. (2017). Enzymes for aerobic degradation of alkanes in bacteria. In
440
Aerobic Utilization of Hydrocarbons, Oils and Lipids (pp. 125). Springer International
441
Publishing.
442
Moreno, R., & Rojo, F. (2019). Enzymes for Aerobic Degradation of Alkanes in Bacteria. In
443
F. Rojo (Ed.), Aerobic Utilization of Hydrocarbons, Oils, and Lipids (pp. 117142).
444
Springer International Publishing.
445
Muriel-Millán, L. F., Millán-López, S., & Pardo-López, L. (2021). Biotechnological
446
applications of marine bacteria in bioremediation of environments polluted with
447
hydrocarbons and plastics. Applied Microbiology and Biotechnology, 105(19), 7171
448
7185.
449
Ogata, H., Goto, S., Sato, K., Fujibuchi, W., Bono, H., & Kanehisa, M. (1999). KEGG: Kyoto
450
Encyclopedia of Genes and Genomes. Nucleic Acids Research, 27(1), 2934.
451
Oliveira, J. S., Araújo, W., Lopes Sales, A. I., Brito Guerra, A. de, Silva Araújo, S. C. da, de
452
Vasconcelos, A. T. R., Agnez-Lima, L. F., & Freitas, A. T. (2015). BioSurfDB:
453
knowledge and algorithms to support biosurfactants and biodegradation studies.
454
Database: The Journal of Biological Databases and Curation, 2015.
455
https://doi.org/10.1093/database/bav033
456
Pathak, U., Jhunjhunwala, A., Singh, S., Bajaj, N., & Mandal, T. (2022). Chapter 19 -
457
Potentiality of enzymes as a green tool in degradation of petroleum hydrocarbons. In
458
P. Das, S. Manna, & J. K. Pandey (Eds.), Advances in Oil-Water Separation (pp.
459
337351). Elsevier.
460
Patowary, K., Patowary, R., Kalita, M. C., & Deka, S. (2016). Development of an Efficient
461
Bacterial Consortium for the Potential Remediation of Hydrocarbons from
462
Contaminated Sites. Frontiers in Microbiology, 7, 1092.
463
Pfam: Family: Aldedh (PF00171). (n.d.). Retrieved July 31, 2022, from
464
http://pfam.xfam.org/family/pf00171
465
Pfam: Family: FA_desaturase (PF00487). (n.d.). Retrieved July 31, 2022, from
466
http://pfam.xfam.org/family/PF00487
467
Phale, P. S., Malhotra, H., & Shah, B. A. (2020). Chapter One - Degradation strategies and
468
associated regulatory mechanisms/features for aromatic compound metabolism in
469
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
12
bacteria. In G. M. Gadd & S. Sariaslani (Eds.), Advances in Applied Microbiology
470
(Vol. 112, pp. 165). Academic Press.
471
Prenafeta-Boldú, F. X., de Hoog, G. S., & Summerbell, R. C. (2018). Fungal Communities in
472
Hydrocarbon Degradation. In T. J. McGenity (Ed.), Microbial Communities Utilizing
473
Hydrocarbons and Lipids: Members, Metagenomics and Ecophysiology (pp. 136).
474
Springer International Publishing.
475
Prince, R. C., Gramain, A., & McGenity, T. J. (2010). Prokaryotic Hydrocarbon Degraders. In
476
K. N. Timmis (Ed.), Handbook of Hydrocarbon and Lipid Microbiology (pp. 1669
477
1692). Springer Berlin Heidelberg.
478
Ru, J., Huo, Y., & Yang, Y. (2020). Microbial Degradation and Valorization of Plastic Wastes.
479
Frontiers in Microbiology, 11, 442.
480
Saldívar-Guerra, E., & Vivaldo-Lima, E. (2013). Introduction to polymers and polymer types.
481
In Handbook of Polymer Synthesis, Characterization, and Processing (pp. 114).
482
John Wiley & Sons, Inc.
483
Sharma, J., Sundar, D., & Srivastava, P. (2021). Biosurfactants: Potential Agents for
484
Controlling Cellular Communication, Motility, and Antagonism. Frontiers in Molecular
485
Biosciences, 8, 727070.
486
Shimao, M. (2001). Biodegradation of plastics. Current Opinion in Biotechnology, 12(3),
487
242247.
488
Shrivastava, A. (2018). Introduction to Plastics Engineering. Elsevier Science.
489
Soberón-Chávez, G. (2010). Biosurfactants: From Genes to Applications. Springer Berlin
490
Heidelberg.
491
Soberón-Chávez, G., González-Valdez, A., Soto-Aceves, M. P., & Cocotl-Yañez, M. (2021).
492
Rhamnolipids produced by Pseudomonas: from molecular genetics to the market.
493
Microbial Biotechnology, 14(1), 136146.
494
Swain, S. K., & Jawaid, M. (2019). Nanostructured Polymer Composites for Biomedical
495
Applications. Elsevier Science.
496
Wagner, W., Sakiyama-Elbert, S., Yaszemski, M., & Zhang, G. (2020). Biomaterials Science:
497
An Introduction to Materials in Medicine. Elsevier Science & Technology.
498
Wang, W., & Shao, Z. (2013). Enzymes and genes involved in aerobic alkane degradation.
499
Frontiers in Microbiology, 4, 116.
500
Wentzel, A., Ellingsen, T. E., Kotlar, H.-K., Zotchev, S. B., & Throne-Holst, M. (2007).
501
Bacterial metabolism of long-chain n-alkanes. Applied Microbiology and
502
Biotechnology, 76(6), 12091221.
503
Xu, X., Liu, W., Tian, S., Wang, W., Qi, Q., Jiang, P., Gao, X., Li, F., Li, H., & Yu, H. (2018).
504
Petroleum Hydrocarbon-Degrading Bacteria for the Remediation of Oil Pollution
505
Under Aerobic Conditions: A Perspective Analysis. Frontiers in Microbiology, 9, 2885.
506
Zainudin, B. H., Wong, T. W., & Hamdan, H. (2020). Chapter 8 - Pectin as oral colon-specific
507
nano- and microparticulate drug carriers. In M. A. A. AlMaadeed, D. Ponnamma, &
508
M. A. Carignano (Eds.), Polymer Science and Innovative Applications (pp. 257286).
509
Elsevier.
510
Zhou, H., Zhang, S., Xie, J., Wei, H., Hu, Z., & Wang, H. (2020). Pyrene biodegradation and
511
its potential pathway involving Roseobacter clade bacteria. International
512
Biodeterioration & Biodegradation, 150, 104961.
513
514
515
516
517
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
13
518
Fig. 1. Alkane aerobic degradation pathways. A. Terminal oxidation (black color). B. Biterminal
519
oxidation (blue). C. Subterminal oxidation (green). D. Finnerty pathway (red).
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
14
A
536
537
B
538
539
Fig. 2. Taxonomic distribution of enzyme orthologs obtained with eggNOG-Mapper. A.
540
Hydrocarbon groups. B. Degradation pathways. A_alkane groups, B_alkenes, C_aromatics,
541
D_polymers, E_biosrufactants.
542
543
544
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
15
545
546
547
Fig. 3. HADEG Pfam network. Hydrocarbon degradation groups and biosurfactant production
548
are represented by squared: alkanes (red), alkenes (magenta), aromatics (yellow), polymers
549
(orange), and biosurfactants (blue). Pfams are represented by circles, and the color
550
convention is as described above. The shared protein domains among groups are shown on
551
green scale. The list of the Pfams is shown in Supplementary Table 7.
552
553
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
16
554
Fig. 4. BLAST annotation of genomes with the HADEG database and experimentally validated
555
activities. The corresponding hydrocarbon groups are used to organize both the genomes (x-
556
axis) and the HADEG pathways (y-axis). Bubble indicates hits above the confidence threshold.
557
The hits with experimental evidence are shown in black, and the predicted hits are shown in
558
gray. The number of hits in the genome is indicated by the size of the bubble. Supplementary
559
Table 8 contains a complete list of isolated genomes and their accession numbers.
560
561
562
563
564
565
566
567
568
569
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
17
A
570
571
572
B
573
574
575
576
Supplementary Fig. 1. Orthologous proteins obtained with eggNOG-mapper. A. Hydrocarbon
577
groups. B. Degradation pathways. A_alkane groups, B_alkenes, C_aromatics, D_polymers,
578
E_biosrufactants.
579
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted September 1, 2022. ; https://doi.org/10.1101/2022.08.30.505856doi: bioRxiv preprint
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
A growing accumulation of plastic wastes has become a severe environmental and social issue. It is urgent to develop innovative approaches for the disposal of plastic wastes. In recent years, reports on biodegradation of synthetic plastics by microorganisms or enzymes have sprung up, and these offer a possibility to develop biological treatment technology for plastic wastes. In this review, we have comprehensively summarized the microorganisms and enzymes that are able to degrade a variety of generally used synthetic plastics, such as polyethylene (PE), polystyrene (PS), polypropylene (PP), polyvinyl chloride (PVC), polyurethane (PUR), and polyethylene terephthalate (PET). In addition, we have highlighted the microbial metabolic pathways for plastic depolymerization products and the current attempts toward utilization of such products as feedstocks for microbial production of chemicals with high value. Taken together, these findings will contribute to building a conception of bio-upcycling plastic wastes by connecting the biodegradation of plastic wastes to the biosynthesis of valuable chemicals in microorganisms. Last, but not least, we have discussed the challenges toward microbial degradation and valorization of plastic wastes.
Chapter
Full-text available
Hydrocarbons have been part of the biosphere for millions of years, and a diverse group of prokaryotes has evolved to use them as a source of carbon and energy. To date, all the formally defined genera are eubacterial, in 7 of the 24 major phyla currently recognized (Tree of Life, 2009); principally in the Actinobacteria, the Bacteroidetes, the Firmicutes, and the Proteobacteria. Some Cyanobacteria have been shown to degrade hydrocarbons on a limited scale, but whether this is of any ecological significance remains to be seen – it is likely that all aerobic organisms show some basal metabolism of hydrocarbons by nonspecific oxygenases, and similar “universal” metabolism may occur in anaerobes. More recently, some thermophilic, acidophilic methanotrophs from the phylum Verrucomicrobia have also been isolated (Chapter 26, Vol. 3, Part 1); we may expect more diversity as research proceeds. There have been publications indicating that halophilic archaea are significant hydrocarbon-degraders in some environments, but these have not yet been rigorously, or formally, described.
Article
Full-text available
Alkanes are major constituents of crude oil. They are also present at low concentrations in diverse non-contaminated because many living organisms produce them as chemo-attractants or as protecting agents against water loss. Alkane degradation is a widespread phenomenon in nature. The numerous microorganisms, both prokaryotic and eukaryotic, capable of utilizing alkanes as a carbon and energy source, have been isolated and characterized. This review summarizes the current knowledge of how bacteria metabolize alkanes aerobically, with a particular emphasis on the oxidation of long-chain alkanes, including factors that are responsible for chemotaxis to alkanes, transport across cell membrane of alkanes, the regulation of alkane degradation gene and initial oxidation.
Article
Plastic polymers with different properties have been developed in the last 150 years to replace materials such as wood, glass and metals across various applications. Nevertheless, the distinct properties which make plastic desirable for our daily use also threaten our planet’s sustainability. Plastics are resilient, non-reactive and most importantly, non-biodegradable. Hence, there has been an exponential increase in plastic waste generation, which has since been recognised as a global environmental threat. Plastic wastes have adversely affected life on earth, primarily through their undesirable accumulation in landfills, leaching into the soil, increased greenhouse gas emission, etc. Even more damaging is their impact on the aquatic ecosystems as they cause entanglement, ingestion and intestinal blockage in aquatic animals. Furthermore, plastics, especially in the microplastic form, have also been found to interfere with chemical interaction between marine organisms, to cause intrinsic toxicity by leaching, and by absorbing persistent organic contaminants as well as pathogens. The current methods for eliminating these wastes (incineration, landfilling, and recycling) come at massive costs, are unsustainable, and put more burden on our environment. Thus, recent focus has been placed more on the potential of biological systems to degrade synthetic plastics. In this regard, some insects, bacteria and fungi have been shown to ingest these polymers and convert them into environmentally friendly carbon compounds. Hence, in the light of recent literature, this review emphasises the multifaceted roles played by microorganisms in this process. The current understanding of the roles played by actinomycetes, algae, bacteria, fungi and their enzymes in enhancing the degradation of synthetic plastics are reviewed, with special focus on their modes of action and probable enzymatic mechanisms. Besides, key areas for further exploration, such as the manipulation of microorganisms through molecular cloning, modification of enzymatic characteristics and metabolic pathway design, are also highlighted.
Chapter
Polymers and specifically polysaccharides receive widespread application in drug delivery system design. Apart from being abundant, biodegradable, biocompatible, low cost, and most importantly nontoxic, polysaccharides can be bioactive with anticancer, antiinfective, antiinflammatory, antioxidative, and antidiabetic attributes. Pectin is a cell wall polysaccharide constituting repeating units of α-1,4-linked-d-galacturonic acid. It can be isolated from citrus peel and apple pomace at low costs. Pectin is commonly categorized as high methoxyl pectin or low methoxyl pectin, based on the degree of esterification >50% and <50% respectively. Pectin is biodegradable, biocompatible, and safe for human consumption. It is a dietary fiber that plays a role in preventing colon cancer and demonstrates apoptotic, cell cycle arrest, and galectin-3 inhibitory activities. Pharmaceutically, pectin has been widely applied as a matrix or coat material in drug delivery systems, specifically in oral colon-specific carrier development. This chapter reviews the anticancer activities of pectin and its application in oral colon-specific drug delivery for the treatment/prevention of colon cancer and related ailments.
Article
Microbial mineralization plays a significant role in the removal of polycyclic aromatic hydrocarbons (PAHs) from polluted environments. Bacteria affiliated to the Roseobacter clade are ubiquitous and abundant in various environments, including PAH-polluted areas. However, very little is known about the PAH degradation mechanism utilized by the Roseobacter clade. In this study, eight bacterial strains belonging to the Roseobacter clade were isolated from sediments collected from the estuary of the Pearl River. Degradation of pyrene and two other typical PAHs (phenanthrene and benzo[a]pyrene) was studied in the eight isolated strains of Roseobacter clade bacteria (RCB) and three other type strains. The results revealed that all strains had low PAH-degrading efficiency when PAHs were used as the sole source of carbon. However, upon supplementation with an alternative carbon source, the degradation was greatly stimulated. This implies that RCB degrade PAHs via a co-metabolism pathway. A putative pyrene degradation pathway in RCB was re-constructed based on genomic analysis. pahE, a functional marker gene for PAH degradation, was detected in the genomes of RCB. PAH ring-hydroxylating dioxygenase alpha subunit domain family and beta subunit domain family and PAH degradation-related Rieske [2Fe–2S] domain were also identified in the test strains. These findings provide fundamental evidence that RCB are capable of degrading PAHs through a versatile metabolic pathway.
Chapter
The present chapter reviews and discusses recent advances in the ecophysiology, phylogeny, and biotechnological applications of fungi with respect to their ability to degrade hydrocarbons. There is a very wide fungal biodiversity with diverse enzymatic mechanisms that transform different hydrocarbon chemical structures, from short chain aliphatics to heavy weight polycyclic aromatics. Alkanes and alkylbenzenes are generally metabolized as the sole source of carbon and energy via specialized metabolic pathways that start with the substrate oxidation through cytochrome P450 monoxygenases. Unsaturated alkenes and alkynes, as well as alicyclics, are more recalcitrant to fungal degradation and are often converted to partly oxidized metabolites. Aromatic hydrocarbons ranging from the single benzene ring to the high-molecular-weight polycyclics are generally degraded via one or more of three independent enzymatic systems. The intracellular P450 monooxygenases that detoxify harmful chemicals are universally present in the microsomes of eukaryotic cells, while lignin-degrading fungi specifically produce extracellular peroxidases and laccases that biodegrade aromatic hydrocarbons. Laccases are not exclusively active in lignin biodegradation: other functions have been reported for these enzymes in nonligninolytic fungi. The low functional specificity and high redox potential of peroxidases and laccases enables the oxidation of a broad range of aromatic hydrocarbons and other recalcitrant contaminants. Such co-incidental biodegradation processes often result in partially degraded compounds that do not support fungal growth and that might be more toxic than the parent substrates. Relevant hydrocarbonoclastic fungal strains deposited in culture collections have been identified and their phylogenies revised and reassessed when necessary. The capacity to assimilate hydrocarbons in fungi may have evolved in the context of biotrophic interactions in environments that are rich in naturally biosynthesized alkanes and volatile alkylbenzenes. The ability to utilize hydrocarbons seems to correlate with virulence toward humans, as seen in phylogenetically unrelated genera of hydrocarbonoclastic fungi, e.g., Scedosporium (Microascales) and Exophiala-Cladophialophora (Chaetothyriales). Applied research on hydrocarbonoclastic fungi includes studies dedicated to preventing biodeterioration as well as on potential use of the same enzymatic capabilities for bioremediation purposes. Fungal contamination of fuels is a long-standing problem that has acquired new dimensions as new biofuel blends have emerged. Recent improvements in phylogenetic understanding of fungal biodeteriogens may provide enhanced biocontrol opportunities. In work related to restoration of ecosystems, the ability of hydrocarbonoclastic fungi to form extended mycelial networks, in combination with the broad capabilities of their catabolic enzymes, makes these fungi well suited for the bioremediation of hydrocarbon-polluted soils. However, some cases of unsatisfactory biodegradation efficiency in operations conducted at field scale and cases in which toxic intermediates were generated have turned research efforts towards synergistic biodegradation processes mediated by complex microbial populations (i.e., fungal-bacterial mixtures). The assimilatory biodegradation of volatile alkanes and alkylbenzenes by certain fungal species makes them ideal candidates for the biofiltration of air polluted with these compounds. However, the potential correlation between hydrocarbon utilization and capacity for human infection must be taken into account in the design of biofiltration systems in order to prevent unintended production of biohazardous conditions. Ongoing research is focusing on the precise delimitation of genetic mechanisms that underlie these two apparently converging ecological traits.
Book
Biosurfactants, tensio-active compounds produced by living cells, are now gaining increasing interest due to their potential applications in many different industrial areas in which to date almost exclusively synthetic surfactants have been used. Their unique structures and characteristics are just starting to be appreciated. In addition, biosurfactants are considered to be environmentally “friendly,” relatively non-toxic and biodegradable. This Microbiology Monographs volume deals with the most recent advances in the field of microbial biosurfactants, such as rhamnolipids, serrawettins, trehalolipids, mannosylerythritol lipids, sophorolipids, surfactin and other lipopeptides. Each chapter reviews the characteristics of an individual biosurfactant including the physicochemical properties, the chemical structures, the role in the physiology of the producing microbes, the biosynthetic pathways, the genetic regulation, and the potential biotechnological applications.
Chapter
Polymers are very large molecules, or macromolecules, formed by the union of many smaller molecules. This chapter gives an introduction to polymers and polymer types. First, it talks about mechanical and rheological properties of different types of polymers, polymer states, and molecular weight distribution. Next, the chapter reviews the different criteria for the classification of polymers. The classifications are based on structure, mechanism of polymerization, and chain topology. Then, the chapter discusses three naming systems of polymers. They are conventional nomenclature based on source or structure, IUPAC structure-based nomenclature, and trade and common names and abbreviations.