ArticlePDF AvailableLiterature Review

Emerging Trends in Genomic Approaches for Microbial Bioprospecting

Authors:

Abstract

Abstract Microorganisms constitute two out of the three domains of life on earth. They exhibit vast biodiversity and metabolic versatility. This enables the microorganisms to inhabit and thrive in even the most extreme environmental conditions, making them all pervading. The magnitude of biodiversity observed among microorganisms substantially supersedes that exhibited by the eukaryotes. These characteristics make the microbial world a very lucrative and inexhaustible resource for prospecting novel bioactive molecules. Despite their vast potential, over 99% of the microbial world still remains to be explored. The primary reason for this is that the culture-dependent methods used in the laboratories are grossly insufficient, as they support the growth of under 1% of the microorganisms found in nature. This limitation necessitated the development of techniques to circumvent culture dependency and gain access to the outstanding majority of the microorganisms. The development of culture-independent techniques has essentially reshaped the study of microbial diversity and community dynamics. Application of genomic and metagenomic approaches is contributing substantially towards characterization of the real microbial diversity. The amenability of these techniques to high throughput has opened the doors to explore the vast number of "uncultivable" microbial forms in substantially lesser time. The present article provides an update on the recent technological advances and emerging trends in exploring microbial community.
Review Article
Emerging Trends in Genomic Approaches
for Microbial Bioprospecting
K.B. Akondi and V.V. Lakshmi
Abstract
Microorganisms constitute two out of the three domains of life on earth. They exhibit vast biodiversity and
metabolic versatility. This enables the microorganisms to inhabit and thrive in even the most extreme envi-
ronmental conditions, making them all pervading. The magnitude of biodiversity observed among micro-
organisms substantially supersedes that exhibited by the eukaryotes. These characteristics make the microbial
world a very lucrative and inexhaustible resource for prospecting novel bioactive molecules. Despite their vast
potential, over 99% of the microbial world still remains to be explored. The primary reason for this is that the
culture-dependent methods used in the laboratories are grossly insufficient, as they support the growth of under
1% of the microorganisms found in nature. This limitation necessitated the development of techniques to
circumvent culture dependency and gain access to the outstanding majority of the microorganisms. The de-
velopment of culture-independent techniques has essentially reshaped the study of microbial diversity and
community dynamics. Application of genomic and metagenomic approaches is contributing substantially to-
wards characterization of the real microbial diversity. The amenability of these techniques to high throughput
has opened the doors to explore the vast number of ‘‘uncultivable’’ microbial forms in substantially lesser time.
The present article provides an update on the recent technological advances and emerging trends in exploring
microbial community.
Introduction
Microorganisms occupy every habitat on earth and
determine the biogeochemistry of the planet. For more
than a century, the identification and characterization of micro-
organisms has been carried out exclusively via traditional/
axenic culturing techniques. These studies contributed sub-
stantially towards the establishment and development of
public healthcare practices. They also revealed the key roles
that microorganisms play in geochemical cycles and biore-
mediation, thus offering insights into the vast potential of the
microbial world. As rich sources of novel bioactive molecules,
the microorganisms continue to attract substantial interest for
their pharmaceutical and therapeutic applications ( Joint et al.,
2010; Van Hamme et al., 2003; Widada et al., 2002). Despite
providing considerable insights into the microbial world,
culture-dependent techniques have some major limitations.
They show a considerable bias towards organisms that are
best adapted to laboratory conditions. Microorganisms co-
exist as mixed communities in nature. The composition and
dynamics of each community are determined by biotic and
abiotic factors. Environmental factors such as pH, tempera-
ture, and salinity play a vital role in determining the type of
microflora. Numerous studies have found that the cultivable
microorganisms constitute only a fraction of the overall
community diversity. For example, the microorganisms ini-
tially identified to play a predominant role in bioremediation
via culture techniques were later realized to contribute mar-
ginally. Further, when the gene pools of various xenobiotics
degrading microbial communities were compared, the ‘un-
cultivable organisms’ were found to constitute a significant
fraction of total diversity (Ferrer et al., 2009; Handelsman,
2004, 2005). Vast disparities were also apparent between the
number and range of cultivable microorganisms and the
overall estimate of microbial diversity in marine biopros-
pecting studies. Characterizing the biodiversity of marine
microflora has been especially challenging due to the com-
plexity and dynamic nature of the marine environments ( Joint
et al., 2010). These and many other studies have revealed that
preferential growth of specific microorganisms under labo-
ratory culturing conditions gave a distorted/erroneous pic-
tures of the constitution of microbial communities (Eyers
et al., 2004; Malik et al., 2008; Torsvik et al., 2002). Hence,
investigations employing traditional culturing approaches
alone are becoming unacceptable for bioprospecting and
cannot be relied upon to determine the community
Department of Applied Microbiology, Sri Padmavati Women’s University, Tirupati, India.
OMICS A Journal of Integrative Biology
Volume 17, Number 2, 2013
ªMary Ann Liebert, Inc.
DOI: 10.1089/omi.2012.0082
61
composition accurately either qualitatively or quantitatively
(Daniel, 2005; Malik et al., 2008; Torsvik et al., 2002).
To address this issue, novel culturing techniques to simu-
late the natural habitat of various organisms in laboratory are
being developed and optimized to enhance microbial growth
(Widada et al., 2002). In addition, culture-independent meth-
ods (CIMs) employing current molecular approaches, are also
significantly contributing towards improving our under-
standing of the biodiversity of microorganisms (Malik et al.,
2008; Stenuit et al., 2008). The various strategies being adopted
for studying bacterial bioprospecting and biodiversity are
summarized in Figure 1. Collectively, these approaches serve
as powerful tools to tap into the biotechnological applications
of the vast resource of uncultured microorganisms.
Emergence of Culture-Independent Methods
The most significant surge in application of molecular tools
to study the microbial ecology and biodiversity began in 1995.
Analysis of specific cell constituents such as nucleic acids,
proteins, and lipids extracted directly from the environment
samples allow simultaneous study of both the cultivable and
noncultivable microorganisms (Greene and Voordouw, 2003;
Neufeld et al., 2004). Application of these strategies revealed a
startling diversity of the ‘uncultivable’ microbiota. Their
abundance renewed the excitement towards exploring these
unknown microorganisms. The initial CIMs were based on
PCR sequencing of clones from 5S rRNA cDNA library, which
were found to be unreliable. Analyzing the 1500 bp long 16S
FIG. 1. Current approaches in investigating biodiversity and bioprospection of microorganism. Microorganisms from
environmental samples can be isolated using traditional plate culture methods, but only a limited number of microorganisms
are able to adapt to the culture conditions. Culture enrichment techniques enhance the possibility of isolating organisms, if
selective conditions are provided to encourage growth of desired organisms. The high throughput methods allow simulta-
neous testing of a wide range of conditions and allow isolation of novel organisms of biological interest. The culture-
independent approaches are based on DNA isolation and generation of genomic or metagenomic libraries to explore the vast
diversity of uncultivable, as well as cultivable, microflora. These libraries can be further screened for novel bioactive mol-
ecules and functional characteristics.
62 AKONDI AND LAKSHMI
rRNA gene, however, was much more successful. This is be-
cause these sequences are ubiquitously distributed and show
high levels of conservation even among evolutionarily distant
species. 16S rRNA typing is widely applied for taxonomic
grouping of bacteria. The well-accepted criteria for defining a
new bacterial species is a less than 97% similarity in the 16S
rRNA gene sequence compared to the closest known bacterial
species (Lane et al., 1985; Nocker et al., 2007; Pace et al., 1985).
16S rRNA typing also allows comparison of microbial distri-
bution among different samples, in addition to quantifying
the relative abundance of each taxonomic group. The ro-
bustness and versatility of this technique makes it a good tool
to carry out phylogenetic inferences and gain insights into the
metabolic diversity of microorganisms. This technique has
provided the most convincing evidence of the vastness of
microbial diversity (Handelsman, 2004; Ludwig, 2010; Sanz
and Kochling, 2007).
In spite of the above advantages, relying solely on 16S
rRNA gene sequences for phylogenetic analyses does have
some limitations. First, the differentiation of closely-related
bacterial species can become ambiguous at times, since the
16S rRNA genes are highly conserved (Gu
¨rtler and Stanisich,
1996; Kolbert and Persing, 1999). The presence of multiple 16S
rRNA gene copies among different organisms (varying from 1
to 15), heterogeneity due to paralogous copies, occurrence of
lateral gene transfer in some cases can also result in incon-
clusive phylogenity (Klappenbach et al., 2001). To address
some of these challenges, Multi-Locus Sequence Analysis
(MLSA) has been developed. This technique in principle is
similar to Multi-Locus Sequence Typing (MLST), which is
widely used for bacterial typing in epidemiological studies
(Enright and Spratt, 1999; Platonov et al., 2000). MLSA utilizes
ubiquitous housekeeping and other functional genes present
as single copies, to complement 16SrRNA data. Some of the
genes targeted for MLSA genotyping approach include 23S
rRNA,RpoB, gyrB, and pheS. recA, (DNA repair protein) dnak,
atpD EF-Tu, EF-G hsp70, and hsp60, in addition to 16SrRNA
sequence (Lee and Cote, 2006; Ludwig, 2010). Additional
specific functional genes like pmoA, mxaF, and nod are helpful
when analyzing environmental samples to deduce correlation
between the extent of diversity and the metabolic function
(Horz et al., 2001). To strike a balance between the acceptable
identification power and time/cost for strain typing, internal
fragments (450–500 bases length) of multiple housekeeping/
functional genes (about 7–8 genes) are commonly used in the
laboratories (Ludwig, 2010). Genetic fingerprinting tech-
niques based on the separation of the phylogenetic markers
provide a reliable and specific profiling pattern for a given
microbial community. The application potential, advantages,
and limitations of the various finger printing techniques
available have been extensively reviewed earlier (Cardenas
and Tiedje, 2008; Fromin et al., 2002; Gabor et al., 2007; Nocker
et al., 2007). The application of these approaches has signifi-
cantly widened the horizon of phylogenetic and taxonomical
characterization of uncultured bacteria from diverse envi-
ronments (Ferrer et al., 2009).
The ability to conduct genome-wide analysis of large
communities of microbial phyla in the post-genomic era has
substantiated the complete dependence of the biosphere on
the metabolic activities of microorganisms. The total number
of characterized bacterial species to-date is limited to around
only 6000 (Rappe and Giovannoni, 2003; Vinuesa,2010).
However, the genome representation per g of soil is estimated
to be about 4000–7000 and total prokaryotic cell diversity
predicted to be an incredible 4–6 ·10
30
!( Joint et al, 2010;
Kaeberlein et al, 2002).The advancements in r-RNA based
phylogenetic approaches are currently allowing monitoring
of 50–200 microbial forms/g (Pontes et al., 2007; Malik et al.,
2008).Even at this stage, the current Gene Bank entries of 16S
RNA genes from uncultured prokaryotes significantly out-
numbering (over twice) those identified via culturing tech-
niques. Several new divisions of bacteria with little or no
affiliation to the known organisms have been characterized
from the fraction of uncultured organisms using CMI (Hallam
et al., 2006; Kunin et al., 2008b). Thus, the bulk of the micro-
biota still remains as a vast untapped resource for application
(Handelsman, 2004, 2005; Lovely, 2003; Vinuesa,2010). Sev-
eral studies have further revealed presence of huge genomic
diversity even within a single bacterial species (Malik et al.,
2008; Mira et al., 2010; Nocker et al., 2007; Tettelin et al., 2008).
Advances in sequencing technologies and subsequent re-
duction in cost are permitting complete genome sequencing of
several bacterial strains of individual species. Comparative
analysis of data from genomes of multiple strains/species of a
single bacterium is collectively termed as a pan-genome. The
pan-genomic repertoire is larger in magnitude by many or-
ders than any single genome. It comprises mainly of a ‘‘core
genome’’ containing the genes which are present in all char-
acterized strains. Other ‘‘dispensable genome’’ sets contain
genes that are present in two or more strains or genes that are
unique to a specific strain (Mira et al., 2010; Tettelin et al.,
2008). Thus, CIMs have become indispensable tools for in-
vestigating bacterial genetic diversity, population structures,
and understanding their ecological role in various habitats.
DNA Microarrays
Microarray technology is yet another important taxo-
nomical and functional tool that is widely used for genome
and proteome analysis of mixed microbial communities. The
property of a single-stranded DNA or RNA molecule to hy-
bridize with complementary probe molecules attached to a
solid support forms the basic principle of microarrays.
Readouts are detected as signals given off by the fluorescent
dyes incorporated in the sample (Zhou, 2003). The microarray
slides/chips are prepared by spotting an ex situ synthesized
probe or by directly assembling the probe in situ. In the latter
approach, thousands of probes are spotted on a single slide
using photolithographic masks and electrochemical reactions
(e.g., ‘‘Gene Chip’’ arrays). The microchip types differ based
on immobilization technology used, length, and nature of the
probes, as well labeling of the targets. Factors such as probe
density, specificity, sensitivity, quantification, and cost decide
the technique selected for a study. Compared to traditional
nucleic acid hybridization techniques, microarrays provide a
rapid and sensitive detection system capable of detecting upto
a single mismatch (Gao et al., 2007; Liu and Zhu. 2005; Zhou,
2003).
Three major classes of environmental microarray formats
are used for bioprospecting microbial community. These are
the community genome arrays (CGA), the phylogenetic
oligonucleotide arrays (POA), and functional gene arrays
(FGA) (Chandler et al., 2006; Rhee et al., 2004; Wu et al., 2001,
2004, 2006). Combination of microarrays types increases the
MICROBIAL BIOPROSPECTING 63
versatility to probe complex microbial communities. For ex-
ample, comprehensive FGA or Geochips that also have sev-
eral phylogenetic probes, facilitated study of microbial
communities dynamics in situ (He et al., 2007; Wu et al., 2001).
Coupling of whole community RNA amplification (WCRA)
with CGA allowed monitoring of the functional activities of
microbial communities in environments contaminated with
organic solvents, hydrocarbons, and uranium (Gao et al.,
2007; Wu et al., 2006). Another prominent step forward in
application of DNA arrays to community analysis is the use of
probes produced directly from environmental DNA without
any cultivation steps. Such metagenomic arrays (MGA) hold
potential for high-throughput screening and have been suc-
cessfully applied to characterize microbial community of a
groundwater microcosm and other natural environments
(Gentry et al., 2006; Sebat et al., 2003). These strategies reveal
many direct linkages between biogeochemical processes and
functional activities among microbial communities of the
environments. Thus, microarrays present the advantage of
miniaturization, for simultaneous gene function analysis in
real time (Chandler et al., 2006; Rhee et al., 2004).
Real-Time PCR
Real-time PCR (rt-PCR) is a robust and powerful tool
widely employed in microbial ecology for profiling and bio-
prospecting of environmental samples. This technique allows
real-time quantification of amplicons during or at end of each
cycle using fluorescent markers. In early exponential phase of
PCR, the amplification of targets is directly proportional to the
initial template concentration. This allows reliable quantifi-
cation, as the observed intensity of fluorescence is directly
proportional to the product concentration. rt-PCR is found to
be 100- to 10,000-fold more sensitive than the microarray-
based methods, with quantification limit of 1–2 genome
copies (Eyers et al., 2004; Inglis and Kalischuk, 2004). Further,
its real-time monitoring ability avoids all the time-consuming
post-PCR quantification steps. The high analytical sensitivity
for identification of specific genes in complex DNA mixtures
also makes rt- PCR highly suitable for analysis of environ-
mental samples (Powell et al., 2006; Ritalahti et al., 2006). Its
amenability to high-throughput allows scale up for genomic
and metagenomic level analyses. Community profiling of
challenging samples such as hydrocarbon-contaminated
Antarctic soil has been achieved using this approach. In this
study, real time changes in gene expression and influence of
biotic and abiotic factors were successfully recorded across
both spatial and temporal levels (Ritalahti et al., 2006).
Numerous variants of PCR have been used for DNA am-
plification from complex environmental samples to address
microbial communities profiling from diverse facets including
biodegradation of organic contaminants (Gabor et al., 2007;
Gilbride et al., 2006; Stenuit et al., 2006, 2008). These tech-
niques adopted in conjunction with real time analysis increase
their versatility and allow quantification. Multiplex rt-PCR
technique is one such variant that utilizes multiple primer sets
within a single PCR mix. As multiple genes are targeted at
once, several amplicons are formed simultaneously. Each
primer set is labeled with distinct fluorescent dyes having
nonoverlapping excitation ranges. The resulting spectra per-
mit independent detection of target amplification rates with
good correlation. Thus, careful design of primers sets, coupled
with use of optimized annealing conditions can provide a lot
of additional information in a single test run. This helps to
conserve the sample material and avoids well-to-well varia-
tion. Another important benefit of multiplex rt-PCR is the ease
with which normalization can be carried out to increase the
reliability of the results. To achieve this, reference genes (like
house-keeping genes) are amplified along with target genes as
internal controls to get more accurate quantification of target
genes. Normalization also permits reliable comparison be-
tween results obtained from different experiments.
Multiplex PCR has been widely used to save time and
resources in several bioprospecting and bioremediation
studies. Some of these include detection of mono- or dioxy-
genase enzymes attacking polycyclic aromatic hydrocarbon
(PAH) (Baldwin et al., 2003; Dionisi et al., 2004; Gilbride
et al., 2006; Harms et al., 2003; Wilson et al., 1999). Targeting
gene encoding Rieseke iron sulfate center (which is common
in dioxygenase enzyme) using generic primers and rt- PCR
could track the population shifts of PAH degrading micro-
organisms (Ce
´bron et al., 2008; Chandain et al., 2006). This
approach was also used to characterize microbial heteroge-
neity and functionality of the Anaeromyxobacter community at
the Oak Ridge IFC uranium-contaminated sub-surface envi-
ronment (Thomas et al., 2009). In another study, good cor-
relation was observed between data obtained from multiplex
rt-PCR and dot blot hybridization (validation test), as well as
C
14
-mineralization (direct indicator) for naphthalene degra-
dation by Proteobacteria sp. showing validation of the former
rt-PCR results (Nyyssonen et al., 2006). These studies illus-
trate the use of rt-PCR and its variants as powerful and re-
liable tools for assessing bioremediation potential in an
environment, as well as for bioprospecting of novel enzymes
without isolation/cultivation of bacteria. Thus speed, sensi-
tivity, accuracy, and amenability to robotic automation
makes rt- PCR occupy a prominent position among the mo-
lecular tools (Cardenas and Tiedje, 2008; Lerat et al., 2005;
Powell et al., 2006).
PCR-Independent Amplification Techniques
New PCR-independent amplification techniques are
emerging as popular tools to access genomic information
from very low abundance microbial sources that otherwise
remain inaccessible This approach avoids generic biases as-
sociated with the PCR-dependent methods such as artifacts/
errors resulting from PCR or skewing due to unequal am-
plification. Whole genome amplification using multiple
displacement amplification (MDA) techniques exhibited re-
markably uniform amplification across the genomic targets in
comparison to PCR-based whole genome amplification. As
MDA used ø29 DNA polymerase and random exonuclease
resistant primers, there was no need for thermal cycling
(Abulencia et al., 2006; Binga et al., 2008; Dean et al., 2002).
This method generated larger sized products with lower error
frequency and was amicable to single cell genome sequencing
as well. Combination of MDA and CGA technique was suc-
cessful in analyzing oligotrophic microbial communities in
groundwater contaminated with uranium and other metals
(Wu et al., 2006). T7 polymerase-based linear amplification
approach using fusion primers has been utilized for mRNA-
based metatranscriptome analyses (Gao et al., 2007). These
and other emerging techniques overcome the various
64 AKONDI AND LAKSHMI
limitations of PCR and open vistas to explore low density/
uncultivable organisms present in a microbial community.
Innovations in Culturing Techniques
In an effort to enhance cultivability of more microbial types
in laboratory, novel culturing techniques are being actively
developed. These techniques are based on mimicking the
natural habitat in which the microorganisms of interest grow
and thrive. Techniques such as dilution-to-extinction, cultur-
ing in arrays, diffusion chambers, and micro-droplet encap-
sulation are being successfully applied to significantly
improve the cultivability of as-yet uncultivable marine or-
ganisms in low-nutrient media (Connon and Giovannoni,
2002; Dionisi et al., 2012; Kaeberlein et al., 2002; Nicholas et al.,
2008; Zengler et al., 2002, 2005). Providing a nutrient-poor
media increased the recovery percentage of the cultured
forms by several orders of magnitude compared to what was
achieved earlier using nutrient-rich media. The studies re-
sulted in isolation of several previously uncultured marine
bacteria and bacterio-planktons ( Joint et al., 2010; Penesyan
et al., 2009). Further, the novel cultivation methods signifi-
cantly increased the proportion of recovered microorganisms
from marine environments from 10% to 25%, just in the last
decade (Connon and Giovannoni, 2002; Lee et al., 2010).
Optimizing the physical parameters such as temperature, pH,
and pressure has permitted the isolation of several ex-
tremophiles in laboratory that produce novel cold/heat
adaptive enzymes (Cowan et al., 2005; Dionisi et al., 2012).
More recently, the availability of information regarding the
cell attachment characteristics, cellular signaling pathways,
and alternate electron acceptor requirements is also aiding in
further optimization of the culture parameters ( Joint et al.,
2010; Lee et al., 2009; Maldonado et al., 2005). Second gener-
ation automated high throughput systems like isolation chips
(ichips) and micro-petridishes available, allow enhanced
cultivability. They have few hundreds to million growth
compartments that can be inoculated even with single cells.
Integration of these novel culturing techniques with fluores-
cence microscopy allows high-throughput screening of mul-
tiple cell arrays on a large scale (Lee et al., 2010). Another
approach uses gel micro-droplets to encapsulate single cells
for large-scale parallel microbial cultivation under low-
nutrient flux conditions (Keller and Zengler, 2004). As the
micro-colonies are formed in agarose, its porous nature fa-
cilitates easy diffusion of nutrients, signaling molecules, and
waste metabolites. Growth within these microcapsules is de-
tected by flow cytometry (Toledo et al., 2002; Zengler et al.,
2002, 2005). The advantage of the micro-droplet and microbial
trap methods is that the beads formed are easier to handle due
to being physically distinct and much larger in size than
bacterial cells. The studies of Zengler and colleagues (2005)
efficiently employed micro-droplet and encapsulation proce-
dure to enrich actinomycetes by allowing their efficient colo-
nization. The study identified several new clades of marine
actinomycetes, which could not be detected previously in the
environmental gene library. These approaches have sub-
stantially widened the scope of microbial bioprospecting
(Ingham et al., 2007, Nicholas et al., 2010; Sprenkels et al.,
2007). Although novel culturing techniques allow isolation of
several novel organisms, many of them are found to undergo
only limited number of divisions in the laboratory (Doinisi
et al., 2012; Joint et al., 2010). Adaptability/domestication of
the recovered strain to laboratory conditions are a bottleneck
that needs to be addressed for the successful large scale cul-
tivation. Co-culturing with helper organisms and/or identi-
fying signal peptides has been found to aid culturing of
hitherto uncultivable strains (Lewis et al., 2010; Nicholas et al.,
2008).
The Metagenomic Shift
All the genomic approaches described thus far focus on
deciphering the complete genetic complement of a single or-
ganism. However, microorganisms exist in nature as com-
munities of varying complexity. The role of an individual
organism in an ecosystem is dictated by the composition of its
surrounding microbial community. Metagenomics focuses on
microbial community profiling and transcends the limitations
of studying individual organisms. The term ‘‘metagenomics’’
was coined by Handelsman and associates in 1998 and is also
known as community genomics, ecogenomics, or environ-
mental genomics. In metagenomics, genome sequences from
an entire community of organisms inhabiting a common en-
vironment are sampled. The process requires no prior sepa-
ration of organisms from their habitat or maintenance of the
microorganisms as pure or mixed cultures in laboratory.
Metagenomic strategies also circumvent several limitations
occurring due to direct DNA cloning. The approach mini-
mizes improper representations of the microbial community
as observed while screening a finite number of clones (Cowan
et al., 2005). As nucleic acids are extracted directly from the
environmental sample, the genomic data obtained includes
both characterized and novel microbial forms of the com-
munity. In principle, any environment is amenable to meta-
genomic analysis, provided good quality nucleic acids can be
extracted from the sample material. The advancements in
sequencing technology are now catering for massive scale
sequencing of the vast metagenomic repertoire.
The strength of metagenomics lies in its potential for ser-
endipitous discovery as the approach by-passes the major
limitations of classical approaches in microbiology (Binga
et al., 2008; Cowan et al., 2005; Ferrer et al., 2009). This field
area has gained a lot of significance in the last decade by
becoming the center of focus for several studies. Numerous
metagenomics projects have been initiated for analysis of
microbial communities in diverse environments, including
oceans, soils, thermal vents, hot springs, and the human
micro-biome (Sebat et al., 2003; Hugenholtz and Tyson, 2008;
Langer et al., 2006). Metagenomics studies can be conducted
on three different scales. Small-scale studies selectively inves-
tigate a function of interest in a given microbial community.
These projects are mainly initiated by a single investigator or
laboratory. Middle-scale projects are collaborative efforts em-
ploying multidisciplinary approaches to thoroughly investi-
gate the community of interest. Large-scale projects, on the
other hand, are global initiatives undertaken to understand
select microbial communities in depth and to obtain detailed
profiles (Ferrer et al., 2009; Langer et al., 2006).
The design of metagenomics projects is crucial, as the re-
sults obtained largely depend on the design strength of the
study. The design process is broadly categorized into pre-
sequencing, sampling, data generation, sequence processing,
gene prediction, annotation, and data analysis stages (Kunin
MICROBIAL BIOPROSPECTING 65
et al., 2008a, b). Prior knowledge about the dominant popu-
lation in the community of interest is helpful for optimizing
the design selection, as well as during subsequent interpre-
tation of the results. A crucial factor that needs to be ascer-
tained right at the beginning itself is the sequence coverage
requirement for the proposed study. This is because, unlike
complete genome sequencing studies where the genome size
is already known, metagenomic analyses do not have a fixed
end point. Determining the quantum of genome to be se-
quenced is especially challenging when studying diverse
microbial communities as the abundance of different organ-
isms is not uniform within the community. Further, the gene
coding densities also vary significantly among different spe-
cies. In view of these issues, to ensure a decent representation
of the genomes of the community, coverage of 6 ·to 8 ·fold
is taken as the standard (Dinsdale et al., 2008; Ferrer et al.,
2009; Kunin et al., 2008a). Certain studies have shown that
even extremely low coverage of <0.01 ·was sufficient to de-
tect genetic gradients in case of dominant population of
stratified hyper saline mat community (Goldberg et al., 2006;
Kunin et al., 2008a, b). Ultimately, the objectives of the study
guide the decisions made on coverage parameters, with an
average genome sizes reaching over 100 Mb for moderate
metagenomics libraries. The Sanger (dye terminator) se-
quencing technique has been employed as the method of
choice for obtaining metagenomic sequence data for many
years. However in recent times, pyrosequencing is also
gaining wider applicability. The advantages of this method
over the Sanger method is that it requires no prior cloning,
which avoids cloning bias. Further, the sequencing cost per
base is much lower in pyrosequencing, thus permitting se-
quencing of large repertoires at affordable cost (Edwards
et al., 2006; Shendure and Ji, 2008). The major limitation of
pyrosequencing is its short average read length. Due to this,
the approach relies heavily on similarity searches against
reference databases as gene calling or assemblies are usually
not feasible (Wommack et al., 2008). Combining both the se-
quencing technologies together has also being explored for
producing high-quality draft assemblies (Cardenas and
Tiedje, 2008; Gabor et al., 2007; Goldberg et al., 2006).
The metagenomics data sets obtained are annotated by
mapping the genes and gene fragments into families. This
provides an estimate of their relative representation. Assign-
ing roles to the proteins encoded by the sequenced genes is the
next crucial step. The two principal strategies currently used
for gene prediction are evidence-based gene calling method
and ‘‘ab initio’approach (Kunin et al., 2008a; Raes et al., 2007).
Gene prediction is generally followed by functional annota-
tion, which is similar to that in genomic annotation. To en-
hance the function assignment of the sequenced data,
gene-centric trends can also be adopted in metagenomics
(Tringe and Rubin, 2005). The annotated data is further sup-
plemented with collateral nonsequence data (i.e., ‘‘metada-
ta’’). For environmental studies, this information generally
includes geographical data such as global positioning system
coordinates, depth/height from where samples are collected,
and environmental parameters such as pH, temperature, and
salinity of the site (Kunin et al., 2008a, b; Urich et al., 2008;
Venter et al., 2004). It is crucial to record the meta information
during the initial sampling process itself, as re-sampling is not
always directly comparable for analysis. The metadata sig-
nificantly aids in interpreting the sequence data and is par-
ticularly useful during comparative analysis of temporal or
spatial distribution. As mentioned earlier, genome closure is
not possible for most metagenomes due to the vast and un-
known size. Hence, finishing becomes a viable option only for
those data sets pertaining to dominant populations within the
metagenome (Hallam et al., 2006). Some of the notable studies
that obtained complete or near-complete draft-level coverage
of dominant genome assemblies include biofilms in acid mine
drainage, activated sludge, and hypersaline environments
(Hugenholtz and Tyson, 2008; Kunin et al., 2008b).
Metagenomics is being extensively applied to explore bio-
synthetic diversity of microorganisms from varied environ-
ments. The studies have led to identification of new genes,
novel biosynthetic pathways, and bioremediation mecha-
nisms of important xenobiotic compounds (Baldwin et al.,
2003; Cowan et al., 2005; Dionisi et al., 2012; Lewis et al., 2010).
Light-driven proton pump mediated by proteorhodopsin was
first identified in bacterioplanktons from environmental DNA
samples using the metagenomics approach (Harms et al.,
2003; Robertson and Steer, 2004; Sanz and Kochling, 2007). A
more recent discovery pertains to the implication of Archaea
as one of the main ammonia oxidizers. Using bioinformatics
tools, an ammonia mono-oxygenase gene was initially iden-
tified next to the gene encoding small subunit ribosomal RNA
and their role was later confirmed experimentally both in
marine and terrestrial ecosystems (Baldwin et al., 2003; Daniel
2005; Schneiker et al., 2006). Co-localization studies combin-
ing FISH and digital image analysis are providing compara-
tive analysis of temporal or spatial information in structured
ecosystems in metagenome analysis (Handelsman, 2005;
Malik 2008; Sanz and Kochling, 2007; Wagner et al., 2006).
Intracellular fluorescent biosensors or whole-cell-based bio-
sensors are also being widely applied for environmental
sensing and detecting bioremediation of specific contami-
nants (Bhattacharyyaa et al., 2005; Williamson et al., 2005).
These studies show the presence of extensive metabolic in-
teractions and high interdependency between members of the
community. Further, they provide means for assessing the
‘true’ biodiversity of the microbial communities by cir-
cumventing the limitations imposed by the low cultivability
of the microorganisms (Cowan et al., 2005; Daniel, 2005;
Dinsdale et al., 2008; Ferrer et al., 2009). Metagenomics as-
semblages of microorganisms are also providing answers to
several fundamental questions in microbial ecology by aiding
in assessing the diversity and functionality of microorganism
objectively and quantitatively in situ.
Constructing metagenomic libraries from complex envi-
ronmental sample though is conceptually simple; it can be
technically very challenging. Co-extraction of inhibitory
substances such as humic acids, organic matter, and clay
particles can significantly interfere in the amplification step
(Gabor et al., 2007; Pontes et al., 2007). As discussed earlier,
determining the minimum size of the metagenomic library is a
major challenge. The high sequencing cost associated with a
large metagenome repertoire from complex environments can
still be prohibitive (Malik et al., 2008; Stenuit et al., 2006, 2008;
Wommack et al., 2008). In addition, improvements in the
three aspects namely, resolution, classification of short me-
tagenomic fragments, and better means for robust functional
assignment and verification can further improve the appli-
cation potential of metagenomics (Hendelsman, 2004, 2005;
Kunin et al., 2008a, b; Lewis et al., 2010). Thus, metagenomics
66 AKONDI AND LAKSHMI
has emerged as a valuable tool with applications in diverse
fields including medicine, alternative energy, environmental
remediation, biotechnology, agriculture, biodefense, and fo-
rensics (Cowan et al., 2005; Dinsdale et al., 2008; Dionisi et al.,
2012).
Integration of ‘‘Omic’’ Approaches for Microbial
Bioprospecting
Elucidating the phylogenetic diversity of microorganisms
is only the first step in understanding the biodiversity. It is
vital to determine the function of each gene at protein or
metabolic levels in addition to gathering sequencing data.
Around 30%–40% of the fully sequenced genomes from var-
ious organisms contain genes with no assigned function, as
they are unrelated to any known gene. Thus, the greater
challenge lies in assigning functions at the biochemical level to
the newly identified genes. Bridging this void is the focus of
the rapidly emerging discipline of functional genomics and
proteomics (Ferrer et al., 2009; Langer et al., 2006; Valenzuela
et al., 2006). Three different types of function-driven ap-
proaches are widely recognized. These are the phenotypic
detection of gene activity, heterologous complementation of
host strains, and induced gene expression (Ferrer et al., 2009).
At the genomic level, these functional approaches have led to
discovery of several new enzymes, antibiotics, and other
biomolecules with therapeutic and biotechnological applica-
tions (Eyers et al., 2004; Langer et al., 2006; Lee et al., 2010). As
function-driven approaches typically involve low-throughput
screens based on visual detection, there were initial difficul-
ties in adopting them to a metagenomic scale. However, au-
tomated colony picking, pipetting robotics, use of microtiter
plates, availability of sensitive activity assays, targeted bio-
molecules screening, and informatics assisted data manage-
ment have aided in automation of the functional screens,
thereby making them viable for application at metagenomic
levels (Dionisi et al., 2012; Gentry et al., 2006; Kunin et al.,
2008a). The capacity for high-throughput is vital as the
number of ‘‘hits’’ obtained during metagenome screening is
typically very low (<2 out of 10,000 clones screened). Other
high throughput techniques like microarrays, real time anal-
ysis of expressed gene, and PCR independent analysis
already discussed above are also employed in functional
metagenome analyses. As an alternative to elaborate func-
tional screens, the induced gene expression approach uses
diverse strategies to enrich/select community genomes with
desired traits through promoter activity rather than via phe-
notypic expression (Lorenz and Eck, 2005). Substrate-induced
gene expression (SIGEX) and its variants utilizing promoter-
trap gfp-expression vector, in combination with fluorescence-
activated cell sorting, have been highly successful in liquid
cultures for efficient, large scale selection of clones (Uchiyama
and Miyazaki, 2010; Yun and Ryu, 2005).
Metagenomic libraries have thus become good sources for
bioprospecting of novel biocatalyst and biomolecules, even
from yet to be cultivable organisms. Combination of the meta-
genomic approaches with heterologous expression systems aid
in substantial utilization of the microbial biodiversity for
human welfare. However, a single gene can generate a
number of distinguishable functional entities at protein level
as a result of differential splicing (Benndorf et al., 2007). The
entire sets of metabolites produced by cellular proteins in
response to various environmental stimuli are examined in
metabolomics. As all the metabolites present in a system are
targeted, there is little scope for bias related to the metabolites
examined. Thus, in addition to gene profiling, global protein
and metabolite profiling is crucial, especially while investi-
gating mechanisms of complex pathways such as bioreme-
diation (Malik et al., 2008; Stenuit et al., 2008). This makes
comprehensive characterization of the physiological func-
tions and elucidating the ecological roles of the novel organ-
isms a challenging task. Progressive integration of various
high-throughput approaches is particularly fruitful in the
field of environmental microbiology. They offer advantage of
miniaturization, automation, massive parallelization of time
consuming steps, along with ‘‘real-time’’ analysis (Dinsdale
et al., 2008; Paul et al., 2005; Zhao and Poh, 2008).
Achieving synergy between the emerging –omic ap-
proaches in the long run can offer comprehensive character-
ization of the biological and ecological function of microbial
communities in an environment. Integration of complemen-
tary ‘‘omics’’ techniques is envisaged to provide greater in-
sight into genome structure, micro-heterogeneity, lateral gene
transfer, and nutrient cycling among the members of micro-
bial community of a region. This ‘systems biology’ outlook in
a way holds promise to provide the holistic overview required
to understand the microbial ecosystems and resolve several
fundamental questions in microbial ecology and evolution.
Author Disclosure Statement
No competing financial interests exist.
References
Abulencia CB, Wyborski DL, Garcia JA, et al. (2006). Environ-
mental whole-genome amplification to access microbial pop-
ulations in contaminated sediments. Appl Environ Microbiol
72, 3291–3301.
Baldwin BR, Nakatsu CH, and Nies L. (2003). Detection and
enumeration of aromatic oxygen-ase genes by multiplex and
real-time PCR. Appl Environ Microbiol 69, 3350–3358.
Benndorf D, Balcke GU, Harms H, and Von Bergen M. (2007).
Functional metaproteome analysis of protein extracts from
contaminated soil and groundwater. ISME J 1, 224–234.
Bhattacharyyaa J, Reada D, Amosc S, Dooleyc S, Killhama K,
and Pato-Na GI. (2005). Biosensor-based diagnostics of con-
taminated groundwater: Assessment and remediation strat-
egy. Environ Pollution 134, 485–492.
Binga EK, Lasken RS, and Neufeld JD. (2008). Something from
nothing: The impact of multiple displacement amplification on
microbial ecology. ISME J 2, 233–241.
Cardenas E, and Tiedje JM. (2008). New tools for discovering
and characterizing microbial diversity. Curr Opin Biotechnol
19, 544–549.
Ce
´bron A, Norini MP, Beguiristain T, and Leyval C. (2008). Real-
time PCR quantification of PAHring hydroxylating dioxy-
genase (PAH-RHD [alpha]) genes from Gram positive and
Gram negative bacteria in soil and sediment samples. J Mi-
crobioll Methods 73, 148–159.
Chadhain SMN, Norman RS, Pesce KV, Kukor JJ, and Zylstra GJ.
(2006). Microbial dioxygenase gene population shifts during
polycyclic aromatic hydrocarbon biodegradation. Appl En-
viron Microbiol 72, 4078–4087.
Chandler DP, Jarrell AE, Roden ER, et al. (2006). Suspension
array analysis of 16S rRNA from Fe
+2
and SO
42-
reducing
MICROBIAL BIOPROSPECTING 67
bacteria in uranium contaminated sediments undergoing
bioremediation. Appl Environ Microbiol 72, 4672–4687.
Connon SA, and Giovannoni SJ. (2002). High-throughput
methods for culturing microorganisms in very-low-nutrient
media yield diverse new marine isolates. Appl Environ Mi-
crobiol 68, 3878–3885.
Cowan D, Meyer Q, Stafford W, Muyanga S, Cameron R, and
Wittwer P. (2005). Metagenomic gene discovery: Past, present
and future. Trends Biotechnol 23, 321–329.
Daniel R. (2005). The metagenomics of soil. Nature Rev Micro-
biol 3, 470–478.
Dean FB, Hosono S, Fang L, et al., (2002). Comprehensive
human genome amplification using multiple displacement
amplification. Proc Natl Acad Sci USA 99, 5261–5266.
Dinsdale EA, Edwards RA, Hall D, et al.(2008). Functional
metagenomic profiling of nine biomes. Nature 452, 629–633.
Dionisi H, Chewning C, Morgan K, Menn F, Easter J, and Sayler
G. (2004). Abundance of dioxygenase genes similar to Ral-
stonia sp. Strain U2 nagAc is correlated with naphthalene
concentrations in coal tar-contaminated freshwater sediments.
Appl Environ Microbiol 70,3988–3995.
Dionisi H, Lozada M, Nelda M, and Olivera L. (2012). Biopro-
spection of marine micro-organisms: Biotechnological appli-
cations and methods. Revista Argentina Microbiol 44, 49–60.
Edwards RA, Rodriguez-Brito B, Wegley L, et al. (2006). Using
pyrosequencing to shed light on deep mine microbial ecology.
BMC Genom 7, 57–70.
Enright MC, and Spratt BG. (1999). Multilocus sequence typing.
Trends Microbiol 7, 482–487.
Eyers L, George I, Schuler L, Stenuit B, Agathos SN, and Fan-
troussi S. (2004). Environmental genomics: Exploring the un-
mined richness of microbes to degrade xenobiotics. Appl
Microbiol Biotechnol 66, 123–130.
Ferrer M, Beloqui A, Timmis KN, and Golyshin PN. (2009).
Metagenomics for mining new genetic resources of microbial
communities. J Mol Microbiol Biotechnol 16, 109–123.
Fromin NJ, Hamelin S, Tarnawski D, et al. (2002). Statistical
analysis of denaturing gel electrophoresis (DGE) fingerprint-
ing patterns. Environ Microbiol 4, 634–643.
Gabor E, Liebeton K, Niehaus F, Eck J, and Lorenz P. (2007).
Updating the metagenomics toolbox. Biotechnol J 2, 201–206.
Gao H, Yang ZK, Gentry TJ, Wu L, Schadt CW, and Zhou J.
(2007). Microarray-based analysis of microbial community
RNAs by whole-community RNA amplification. Appl En-
viron Microbiol 73,563–571.
Gentry TJ, Wickham GS, Schadt CW, He Z, and Zhou J. (2006).
Microarray applications in microbial ecology research. Microb
Ecol 52,159–175.
Gilbride KA, Lee DY, and Beaudette LA. (2006). Molecular
techniques in wastewater: Understanding microbial commu-
nities, detecting pathogens, and real-time process control.
J Microbiol Methods 66, 1–20.
Goldberg SM, Johnson J, Busam D, et al. (2006). A Sanger/
pyrosequencing hybrid approach for the generation of high-
quality draft assemblies of marine microbial genomes. Proc
Natl Acad Sci USA 3, 11240–11245.
Greene AE, and Voordouw G. (2003). Analysis of environmental
microbial community by reverse sample genome probing.
J Microbiol Methods 53, 211–219.
Gu
¨rtler VI, and Stanisich VA. (1996). New approaches to typing
and identification of bacteria using the 16S-23S r DNA spacer
region. Microbiology 142, 3–16.
Hallam SJ, Konstantinidis KT, Putnam N, et al. (2006). Genomic
analysis of the uncultivated marine crenarchaeote Cen-
archaeum symbiosum. Proc Natl Acad Sci USA 103, 18296–
18301.
Handelsman J, Rondon MR, Brady SF, Clardy J, and Goodman
RM. (1998). Molecular biological access to the chemistry of
unknown soil microbes: A new frontier for natural products.
Chem Biol 5, 245–249.
Handelsman J. (2004). Metagenomics: Application of genomics
to uncultured microorganisms. Microbiol Mol Biol Rev 68,
669–685.
Handelsman J. (2005). Sorting out metagenomes. Nature Bio-
technol 23, 38–39.
Harms G, Layton AC, Dionisi HM, et al. (2003). Real-time PCR
quantification of nitrifying bacteria in a municipal waste water
treatment plant. Environ Sci Technol 37, 343–351.
He Z, Gentry TJ, Schadt CW, et al. (2007). GeoChip: A com-
prehensive microarray for investigating biogeo-chemical,
ecological and environmental processes. ISME J 1, 67–77.
Horz H, Yimga MT, and Liesack W. (2001). Detection of me-
thanotroph diversity on roots of submerged rice plants by
molecular retrieval of pmoA, mmoX, mxaF, and 16S rRNA
and ribosomal DNA, including pmoA-based terminal restric-
tion fragment length polymorphism prowling. Appl Environ
Microbiol 67, 4177–4185.
Hugenholtz P, and Tyson GW. (2008). Metagenomics. Nature
455, 481–483.
Ingham CJ, Sprenkels A, Bomer J, et al. (2007). The micro-petri
dish, a million-well growth chip for the culture and high-
throughput screening of microorganisms. Proc Natl Acad Sci
USA 104, 17–22.
Inglis GD, and Kalischuk LD. (2004). Direct quantification of
Campylobacter jejuni and Campylobacter lanienae in faeces of
cattle by real-time quantitative PCR. Appl Environ Microbiol
70, 2296–2306.
Joint I, Muhling M, and Querellou J. (2010). Culturing marine
bacteria. An essential prerequisite for biodiscovery. Microbiol
Biotechnol 3, 564–575.
Kaeberlein T, Lewis K, and Epstein SS. (2002). Isolating ‘‘un-
cultivable’’ microorganisms in pure culture in a simulated
natural environment. Science 296, 1127–1129.
Keller M, and Zengler K. (2004). Tapping into microbial diver-
sity. Nature Rev Microbiol 2, 141–150.
Klappenbach JA, Saxman PR, Cole JR, and Schmidt TM. (2001).
Rrndb: The ribosomal RNA operon copy number database.
Nucleic Acids Res 29, 181–184.
Kolbert CP, and Persing DH. (1999). Ribosomal DNA sequenc-
ing as a tool for identification of bacterial pathogens. Curr
Opin Microbiol 2,299–305.
Kunin V, Copeland A, Lapidus A, Konstantinos M, and Hu-
genholtz P. (2008a). Bioinformatician’s guide to metage-
nomics. Microbiol Mol Biol Rev 72, 557–578.
Kunin V, Raes J, Harris K, et al. (2008b). 75Millimeter-scale ge-
netic gradients and community-level molecular convergence
in a hypersaline microbial mat. Mol Syst Biol 4, 198–204.
Lane DJ, Pace B, Olsen GJ, Stahl DA, Sogin ML, and. Pace NR.
(1985). Rapid determination of 16S ribosomal RNA sequences for
phylogenetic analyses. Proc Natl Acad Sci USA 82, 6955–6959.
Langer M, Gabor EM, Liebeton K, et al., (2006). Metagenomics:
An inexhaustible access to nature’s diversity. Biotechnol J 1,
815–821.
Lee HY, and Cote JC. (2006). Phylogenetic analysis of c-
proteobacteria inferred from nucleotide sequence comparisons
of the house-keeping genes adk,aroE and gdh: Comparisons
with phylogeny inferred from 16S rRNA gene sequences.
J Gen Appl Microbiol 52, 147–158.
68 AKONDI AND LAKSHMI
Lee HS, Kwon KK, Kang SG, Cha SS, Kim SJ, and Lee JH. (2010).
Approaches for novel enzyme discovery from marine envi-
ronments. Curr Opin Biotechnol 21, 353–357.
Lee AK, Lewis DM, and Ashman PJ. (2009). Microbial flocculation,
a potentially low-cost harvesting technique for marine micro-
algae for production of biodiesel. J Appl Phycol 21, 559–567.
Lerat S, England LS, Vincent ML, et al. (2005). Real-time poly-
merase chain reaction quantification of the transgenes for
roundup ready corn and roundup ready soybean in soil
samples. J Agricult Food Chem 53, 1337–1342.
Lewis K, Epstein S, D’onofrio A, and Ling LL. (2010). Un-
cultured microorganisms as a source of secondary metabo-
lites. J Antibiot 63, 1–9.
Liu WT, and Zhu L. (2005). Environmental microbiology-on-a-
chip and its future impacts. Trends Biotechnol 23, 174–179.
Lorenz P, and Eck J. (2005) Metagenomics and industrial ap-
plications. Nature Rev Microbiol 3, 510–516.
Lovely DR. (2003). Cleaning up with genomics: Applying molec-
ular biology to bioremediation. Nature Rev Microbiol 1, 35–44.
Ludwig W. (2010). Molecular phylogeny of microorganisms is
rRNA still a useful marker? In: Molecular Phylogeny of Micro-
organisms. Oren A, and Papke RT, eds. Caister Academic
Press, Norfolk, U.K., 65–85.
Maldonado L, Stach J, Pathom-Aree W, Ward A, Bull A, and
Goodfellow M. (2005). Diversity of cultivable Actinobacteria in
geographically widespread marine sediments. Antonie van
Leeuwenhoek 87, 11–18.
Malik Seidu B, Michael Megharaj M, and Naidu R. (2008). The use
of molecular techniques to characterize the microbial commu-
nities in contaminated soil and water. Environ Intl 34, 265–276.
Mira A, Martı
´n-Cuadrado AB, D’auria G, and Rodrı
´guez-Valera
F. (2010). The bacterial pan-genome: A new paradigm in mi-
crobiology. Intl Microbiol 13, 45–57.
Neufeld JD, Yu Z, Lam W, and Mohnw W. (2004). Serial analysis
of ribosomal sequence tags (SARST): A high-throughput
method for profiling complex microbial communities. Environ
Microbiol 6, 131–144.
Nichols D, Lewis K, Orjala J, et al. (2008). Short peptide induces
an ‘‘uncultivable’’ microorganism to grow in vitro. Appl En-
viron Microbiol 74, 4889–4897.
Nichols D, Cahoon N, Trakhtenberg EM, et al. (2010). Use of
ichip for high-throughput in situ cultivation of ‘‘uncultivable’’
microbial species. Appl Environ Microbiol 76, 2445–2450.
Nocker A, Burn M, and Camper AK. (2007). Genotypic microbial
community profiling: A critical technical review. Microbial
Ecol 54, 276–289.
Nyyssonen M, Piskonen R, and Itavaara M. (2006). A targeted
real-time PCR assay for studying naphthalene degradation in
the environment. Microb Ecol 52, 533–543.
Pace NR, Stahl DA, Lane DJ, and Olsen GJ. (1985). Analyzing
natural microbial populations by rRNA sequences. ASM
News 51, 4–12.
Paul D, Pandey G, Pandey J, and Jain RK. (2005). Accessing
microbial diversity for bioremediation and environmental
restoration. Trends Biotechnol 23, 135–142.
Penesyan A, Marshall-Jones Z, Holmstrom C, Kjelleberg S, and
Egan S. (2009). Antimicrobial activity observed among cul-
tured marine epiphytic bacteria reflects their potential as a
source of new drugs. FEMS Microbiol Ecol 69, 113–124.
Platonov AE, Shipulin GA, and Platonova OV. (2000). Multi-
locus sequence typing: A new method and the first results in
the genotyping of bacteria. Russ J Genet 36,481–487.
Powell SM, Ferguson SH, Bowman JP, and Snape I. (2006). Using
real-time PCR to assess changes in the hydrocarbon-degrading
microbial community in antarctic soil during bioremediation.
Microb Ecol 52, 523–532.
Pontes DS, Lima BCI, Chartone SE, and Maral Nascimento AM.
(2007). Molecular approaches: Advantages and artifacts in
assessing bacterial diversity. J Indust Microbiol Biotechnol 34,
463–473.
Raes J, Foerstner KU, and Bork P. (2007). Get the most out of
your metagenome: Computational analysis of environmental
sequence data. Curr Opin Microbiol 10, 490–498.
Rappe MS, and Giovannoni SJ. (2003). The uncultured microbial
majority. Ann Rev Microbiol 57, 369–394.
Rhee SK, Liu X, Wu L, Chong SC, Wan X, and Zhou J. (2004).
Detection of genes involved in biodegradation and biotrans-
formation in microbial communities by using 50- mer oligo-
nucleotide microarrays. Appl Environ Microbiol 70, 4303–4317.
Ritalahti KM, Amos BK, Sung Y, Wu Q, Koenigsberg SS, and
Loffler FE. (2006). Quantitative PCR targeting 16S rRNA and
reductive dehalogenase genes simultaneously monitors multiple
Dehalococcoides strains. Appl Environ Microbiol 72, 2765–2774.
Robertson LA, and Steer BA. (2004). Recent progress in bioca-
talyst discovery and optimization. Curr Opin Chem Biol 8,
141–149.
Sanz JL, and Kochling T. (2007). Molecular biology techniques
used in waste water treatment: An overview. Process Biochem
42, 119–133.
Schneiker S, Dos Santos V, Bartels D, et al. (2006). Genome se-
quence of the ubiquitous hydrocarbon-degrading marine
bacterium Alcanivorax borkumensis. Nature Biotechnol 24, 997–
1004.
Sebat JL, Colwell FS, and Crawford RL. (2003). Metagenomic
profiling: Microarray analysis of an environmental genomic
library. Appl Environ Microbiol 69, 4927–4934.
Shendure J, and Ji H. (2008). Next-generation DNA sequencing.
Nature Biotechnol 26, 1135–1114.
Sprenkels CJA, Bomer J, Molenaar D, et al. (2007). The micro-
petri dish, a million-well growth chip for the culture and high-
throughput screening of microorganisms. Proc Natl Acad Sci
USA 104, 18217–18222.
Stenuit B, Eyers L, Rozenberg R, Habib-Jiwan JL, and Agathos
SN. (2006). Aerobic growth of Escherichia coli with 2,4,6-
trinitrotoluene (TNT) as the sole nitrogen source and evidence
of TNT denitration by whole cells and cell-free extracts. Appl
Environ Microbiol 72, 7945–7948.
Stenuit B, Eyers L, Schuler L, Agathos SN, and George I. (2008).
Emerging high-throughput approaches to analyze bioreme-
diation of sites contaminated with hazardous and/or recalci-
trant wastes. Biotechnol Adv 26, 561–575.
Tettelin H, Riley D, Cattuto C, and Medini D. (2008). Com-
parative genomics: The bacterial pan-genome. Curr Opin
Microbiol 11, 472–477.
Thomas SH, Padilla-Crespo E, Jardine PM, Sanford RA, and
Loffler FE. (2009). Diversity and distribution of Anaeromyx-
obacter strains in a uranium-contaminated subsurface envi-
ronment with a non-uniform groundwater flow. Appl Environ
Microbiol 75, 3679–3687.
Toledo KG, Rappe M, Elkins J, Mathur EJ, Short JM, and Keller
M. (2002). Cultivating the uncultured. Proc Natl Acad Sci USA
99, 15681–15686.
Torsvik V, Ovreas L, and Thingstad TF. (2002). Prokaryotic
diversity-magnitude, dynamics, and controlling factors. Sci-
ence 296, 1064–1066.
Tringe SG, and Rubin EM. (2005). Metagenomics: DNA
sequencing of environmental samples. Nature Rev Genet 6,
805–814.
MICROBIAL BIOPROSPECTING 69
Urich T, Lanzen A, Qi J, Huson DH, Schleper C, and Schuster
SC. (2008). Simultaneous assessment of soil microbial com-
munity structure and function through analysis of the meta-
transcriptome. PloS ONE 3, e2527.
Uchiyama T, and Miyazaki K. (2010). Product-induced gene
expression, a product-responsive reporter assay used to screen
metagenomics libraries for enzyme-encoding genes. Appl
Environ Microbiol 76, 7029–7035.
Valenzuela L, Chian BS, Orell A, et al. (2006). Genomics, meta-
genomics and proteomics in biomining microorganisms. Bio-
technol Adv 24, 197–211.
Van Hamme JD, Singh A, and Ward OP. (2003). Recent advances
in petroleum microbiology. Microbiol Mol Biol Rev 67, 503–549.
Venter JC, Remington K, Heidelberg JF, et al. (2004). Environ-
mental genome shotgun sequencing of the Sargasso sea. Sci-
ence 304, 66–74.
Vinuesa P. (2010). Multilocus sequence analysis and bacterial
species phylogeny estimation. In: Molecular Phylogeny of Mi-
croorganisms. Oren A, and Papke RT. eds. Academic Press.
Wagner M, Nielsen PH, Loy A, Nielsen JL, and Daims H. (2006).
Linking microbial community structure with function: Fluor-
escence in situ hybridization-micro-autoradiography and iso-
tope arrays. Curr Opin Biotechnol 17, 83–91.
Widada J, Nojiri H, and Omori T. (2002). Recent developments
in molecular techniques for identification and monitoring of
xenobiotic-degrading bacteria and their catabolic genes in
bioremediation. Appl Environ Microbiol 60, 45–59.
Wilson VL, Tatford BC, Yin X, Rajki SC, Walsh MM, and Larock
P. (1999). Species specific detection of hydrocarbon-utilizing
bacteria. J Microbiol Methods 39, 59–78.
Williamson LL, Borlee BR, Schloss PD, Guan C, Allen HK, and
Handelsman J. (2005). Intracellular screen to identify meta-
genomic clones that induce or inhibit a quorum-sensing bio-
sensor. Appl Environ Microbiol 71, 6335–6344.
Wommack KE, Bhavsar J, and Ravel J. (2008). Metagenomics:
Read length matters. Appl Environ Microbiol 74, 1453–1463.
Wu L, Liu X, Schadt CW, and Zhou J. (2006). Microarray-based
analysis of subnanogram quantities of microbial community
DNAs by using whole-community genome amplification.
Appl Environ Microbiol 72, 4931–4941.
Wu L, Thompson DK, Liu X, et al. (2004). Development and
evaluation of microarray-based whole-genome hybridization
for detection of microorganisms within the context of envi-
ronmental applications. Environ Sci Technol 38, 6775–6782.
Wu L, Thompson DK, Li G, Hurt RA, Tiedje JM, and Zhou J.
(2001). Development and evaluation of functional gene arrays
for detection of selected genes in the environment. Appl En-
viron Microbiol 67, 5780–5790.
Yun J, and Ryu S. (2005). Screening for novel enzymes from
metagenome and SIGEX, as a way to improve it. Microb Cell
Factories 4, 8.
Zengler K, Toledo G, Rappe M, Elkins J, Mathur EJ, Short JM,
and Keller M. (2002). Cultivating the uncultured. Proc Natl
Acad Sci USA 99, 15681–15686.
Zengler K, Walcher M, Clark G, et al. (2005). High throughput
cultivation of microorganisms using microcapsules. Methods
Enzymol 397, 124–130.
Zhao B, and Poh CL. (2008). Insights into environmental biore-
mediation by microorganisms through functional genomics
and proteomics. Proteomics 8,874–881.
Zhou J. (2003). Microarrays for bacterial detection and microbial
community analysis. Curr Opin Microbiol 6, 288–294.
Address correspondence to:
Prof. V.V. Lakshmi
Department of Applied Microbiology
Sri Padmavati Women’s University
Tirupati 517502
India
E-mail: vedula_lak28@yahoo.co.in
70 AKONDI AND LAKSHMI
... Genetic differences can affect the characteristics of E. coli, especially in relation to the medical field. Therefore, molecular analysis based on genotypic traits is important to identify and characterize, study the evolution and epidemiology of the pathogenicity of a bacterium [10]. One of the molecular techniques used in this study is to use Enterobacterial Repetitive Intergenic Consensus-PCR (ERIC-PCR). ...
... ERIC-PCR is a DNA amplification method using ERIC sequences. The ERIC sequence is a short sequence (126 bp) with a conserved repeat area and a noncoding area, i.e., a sequence that is not encoded into protein [10] and is usually found in bacteria belonging to the family Enterobacteriaceae. This technique is used because it is simple, fast, and discriminatory [11]. ...
Article
Full-text available
BACKGROUND: Bali is a favorite tourism destination in the world. As a major tourist destination, the incidence of illness that afflicts tourists greatly affects the image of tourism. Diarrhea is a health problem that is most often experienced and is a major obstacle for foreign tourists when traveling, especially to Bali. Escherichia coli (E. coli) bacteria cause diarrhea more often than viruses in some developing countries. Genetic differences can affect the characteristics of E. coli, especially in relation to the medical field. AIM: We would like to assess the genetic diversity of the different pathogenic E. coli from various clinical isolates including those from traveler’s diarrhea in Bali, Indonesia. MATERIALS AND METHODS: One of the molecular techniques used in this study is to use enterobacterial repetitive intergenic consensus-polymerase chain reaction (ERIC-PCR). The sample in this study was the feces of foreign tourists with traveler’s diarrhea in Bali. This study carried out research procedures in the form of Isolation of E. coli genome DNA from culture, amplification of E. coli 16S rRNA encoding genes, sequencing of E. coli 16S rRNA encoding genes, phylogenetic tree construction, and then analysis of E. coli genetic diversity with ERIC-PCR sequences. RESULTS: The results showed that the ERIC-PCR method was more discriminatory than other methods to analyze the genetic diversity of E. coli from fecal samples of patients with traveler’s diarrhea. It was found that clonal variability based on the genetic similarity of all sample E. coli isolates varied from 0% to 100%. CONCLUSIONS: This shows that the source of transmission and the strains of E. coli that cause it comes from diverse populations.
... Not only for lipid production but it can also be applied to any bioproduct production using microbial sources. This approach is also applicable for plant and animal sources explaining the biochemical process in detail (Akondi & Lakshmi, 2013). These possibility studies could suggest all the researchers to frame the detailed skeleton for their in vitro studies and pilot-scale studies, involving their compound of interest. ...
Article
Full-text available
Yarrowia lipolytica is used as a model in this study to screen the potential candidates for inflating the innate lipid content of the cell. This study focuses on reducing the lipid degradation that occurs by the β-oxidation process and discursively increasing the innate lipid content. Acyl-CoA oxidase-1, the primary and initial enzyme involved in the lipid degradation pathway, was selected as a target and blocked using various lipid analogous compounds. The blocking study was carried out using molecular docking and dynamic studies using computation tools. The largest active site pocket located around the Phe-394 amino acid of the target protein is taken as a site for docking. The molecular docking was performed for the selected compounds (citric acid, Finsolv, lactic acid, oxalic acid, Tween-80 and Triton X-100) and the docking results were compared with the outcome of the standard molecule (octadecatrienoic acid). Citric acid, Finsolv, Tween-80 and Triton X-100 were found to be the potential candidates for blocking the target molecule in the static condition using docking studies, revealing a minimum binding energy requirement than the standard molecule. They were further taken for a dynamics study using GROMACS software. The RMSD, RMSF, number of hydrogen bond interactions and radius of gyration of the complex molecules were studied in a dynamic approach for 100 ns. Citric acid has been found to be the potential hit compound to block acyl-CoA oxidase-1 enzyme with its maximum hydrogen interaction and minimum fluctuations. It also revealed out the minimum total energy requirement for the complex formation.
... Regardless of the complication of reservoir ecosystem, advanced molecular, genomics, transcriptomics, metabolomics, and above all next-generation sequencing techniques are being applied to explore the in situ microbial diversity, function, and distribution. The abovementioned sophisticated community analysis tools are used to decipher the role of microorganisms in petroleum bioprospecting, bioremediation of oil contaminated sites, and in studying microbial activities for enhanced oil recovery (Akondi and Lakshmi 2013;Hu et al. 2016;Shekhar et al. 2020). Microbial communities in oil field and reservoir sites have been explored by culture-based and omics-mediated culture-free approach. ...
Article
Full-text available
Hydrocarbon is a primary source of energy in the current urbanized society. Considering the increasing demand, worldwide oil productions are declining due to maturity of oil fields and because of difficulty in discovering new oil fields to substitute the exploited ones. To meet current and future energy demands, further exploitation of oil resources is highly required. Microorganisms inhabiting in these areas exhibit highly diverse catabolic activities to degrade, transform, or accumulate various hydrocarbons. Enrichment of hydrocarbon-utilizing bacteria in oil basin is caused by continuous long duration and low molecular weight hydrocarbon microseepage which plays a very important role as an indicator for petroleum prospecting. The important microbial metabolic processes in most of the oil reservoir are sulfate reduction, fermentation, acetogenesis, methanogenesis, NO3⁻ reduction, and Fe (III) and Mn (IV) reduction. The microorganisms residing in these sites have critical control on petroleum composition, recovery, and production methods. Physical characteristics of heavy oil are altered by microbial biotransformation and biosurfactant production. Considering oil to be one of the most vital energy resources, it is important to have a comprehensive understanding of petroleum microbiology. This manuscript reviews the recent research work referring to the diversity of bacteria in oil field and reservoir sites and their applications for enhancing oil transformation in the target reservoir and geomicrobial prospecting scope for petroleum exploration.
... This includes cis-genic hybrids, where replicons are swapped between species [75] and more ambitious attempts to engineer synthetic nitrogen fixation in nonlegumes [76]. Inspired by Akondi and Lakshmi [77]. Figure 2. Here, we describe a set of practices that correspond to the four stages of invasion that will prevent microbial inocula from becoming invasive. ...
Article
The appeal of using microbial inoculants to mediate plant traits and productivity in managed ecosystems has increased over the past decade, because microbes represent an alternative to fertilizers, pesticides, and direct genetic modification of plants. Using microbes bypasses many societal and environmental concerns because microbial products are considered a more sustainable and benign technology. In our desire to harness the power of plant–microbial symbioses, are we ignoring the possibility of precipitating microbial invasions, potentially setting ourselves up for a microbial Jurassic Park? Here, we outline potential negative consequences of microbial invasions and describe a set of practices (Testing, Regulation, Engineering, and Eradication, TREE) based on the four stages of invasion to prevent microbial inoculants from becoming invasive. We aim to stimulate discussion about best practices to proactively prevent microbial invasions.
Conference Paper
Full-text available
Colletotrichum es el agente causal de la antracnosis en los cultivos de maracuyá generando importantes pérdidas económicas. Los microorganismos antagonistas representan una alternativa eficiente y biosegura al manejo químico de la antracnosis. Por ello, el objetivo de este trabajo fue aislar, caracterizar fenotípicamente y evaluar la actividad antagonista contra Colletotrichum gloesporoides ((Penz.) Penz. & Sacc) in vitro de 8 cepas de levaduras (M1-M8), obtenidas del frutoplano y filoplano del maracuyá, con el fin de contribuir al desarrollo de productos biocontroladores para el sector frutícola. Las evaluaciones fisiológicas mostraron que los morfotipos (M2, M5 y M7) fueron potenciales antagonistas con óptimo crecimiento en diferentes condiciones: pH 3-5, 30°C, 1-5% NaCl y asimilaron diferentes fuentes de carbono. Estos morfotipos presentaron baja actividad enzimática sobre polímeros vegetales. Además, de acuerdo a las pruebas fisiológicas y de inhibición, la cepa M2 es agente biocontrolador promisorio contra C. gloesporoides. Los análisis moleculares indican que esta cepa corresponde a Meyerozyma caribbica Kurtzman & M. Suzuki. Las cepas de esta especie han sido identificadas como potenciales biocontroladoras para la protección pre y poscosecha de frutales como mangos, aguacates, entre otros, siendo inocuas para el consumidor y seguras para el medio ambiente. Se recomienda continuar las investigaciones en M. caribbica para potenciar su uso comercial y así facilitar la exportación de frutas a mercados especializados en el exterior.
Chapter
The silkworm gut microbiome is very complex, dynamic and heterogeneous in nature. The beneficial gut microbiome plays essential roles in nutrient acquisition, digestion, absorption, growth, development, immunity response and environmental adaptation. Therefore it is essential to understand the dynamics of the gut microbial members and their interactions with the host insects. However, conventional culture-based methods used in the laboratories are insufficient, as they support the growth of less than 1% of microorganisms. Due to current progress in genomics and sequencing methodologies, microbial community studies using culture independent molecular procedures have begun a new epoch of insect microbial ecology. Recent developments in molecular “Omic techniques” such as metagenomics, and metatranscriptomics has gained a lot of momentum in silkworm microbiology and these techniques have been recently utilized to understand microbial community structure, services and their interactions with silkworms at the micro-scale. Omic techniques have revealed phenomenal microbial diversity in silkworm gut samples. The diversity and distribution of the gut microbiome is significantly influenced by larval developmental stages, gender, feeding, pesticides, environmental perturbations and pathogenesis. In this chapter, different high throughput sequencing technologies (molecular “Omic techniques”) and their limitations will be addressed. The present chapter provides an update on the recent technological advances and emerging trends in exploring gut microbial communities in silkworms towards translation research particularly to understand microbiome functions.
Chapter
Microorganisms are omnipresent and exhibit vast biodiversity and metabolic versatility. Such attributes allow microorganisms to flourish and pervade even in extreme environmental conditions. As observed, majority of microbes present in diverse environments are not culturable using standard laboratory and pure culture practices. Metagenomics is concerned with sampling and analysis of DNA of all species in a given microbial community and used to evade the limitations of culturing microbes by facilitating direct DNA extraction, sequencing and selection of binning methods for enriching microbial communities. In this chapter, structural and functional approaches of metagenomics will be discussed along with recent trends and applications such as those offered by bioinformatics tools and online databases with respect to next generation sequencing based metagenomics. Furthermore, insights to ‘Meta-Omics’ approaches- metatranscriptomics, metaproteomics and metabolomics to obtain information that complements the metagenomic data will be viewed.
Article
Full-text available
To investigate the extent of genetic stratification in structured microbial communities, we compared the metagenomes of 10 successive layers of a phylogenetically complex hypersaline mat from Guerrero Negro, Mexico. We found pronounced millimeter-scale genetic gradients that were consistent with the physicochemical profile of the mat. Despite these gradients, all layers displayed near-identical and acid-shifted isoelectric point profiles due to a molecular convergence of amino-acid usage, indicating that hypersalinity enforces an overriding selective pressure on the mat community.
Article
Full-text available
Low-biomass samples from nitrate and heavy metal contaminated soils yield DNA amounts that have limited use for direct, native analysis and screening. Multiple displacement amplification (MDA) using ?29 DNA polymerase was used to amplify whole genomes from environmental, contaminated, subsurface sediments. By first amplifying the genomic DNA (gDNA), biodiversity analysis and gDNA library construction of microbes found in contaminated soils were made possible. The MDA method was validated by analyzing amplified genome coverage from approximately five Escherichia coli cells, resulting in 99.2 percent genome coverage. The method was further validated by confirming overall representative species coverage and also an amplification bias when amplifying from a mix of eight known bacterial strains. We extracted DNA from samples with extremely low cell densities from a U.S. Department of Energy contaminated site. After amplification, small subunit rRNA analysis revealed relatively even distribution of species across several major phyla. Clone libraries were constructed from the amplified gDNA, and a small subset of clones was used for shotgun sequencing. BLAST analysis of the library clone sequences showed that 64.9 percent of the sequences had significant similarities to known proteins, and ''clusters of orthologous groups'' (COG) analysis revealed that more than half of the sequences from each library contained sequence similarity to known proteins. The libraries can be readily screened for native genes or any target of interest. Whole-genome amplification of metagenomic DNA from very minute microbial sources, while introducing an amplification bias, will allow access to genomic information that was not previously accessible.
Article
We designed a real-time PCR assay able to recognize dioxygenase large-subunit gene sequences with more than 90% similarity to the Ralstonia sp. strain U2 nagAc gene (nagAc-like gene sequences) in order to study the importance of organisms carrying these genes in the biodegradation of naphthalene. Sequencing of PCR products indicated that this real-time PCR assay was specific and able to detect a variety of nagAc-like gene sequences. One to 100 ng of contaminated-sediment total DNA in 25-mul reaction mixtures produced an amplification efficiency of 0.97 without evident PCR inhibition. The assay was applied to surficial freshwater sediment samples obtained in or in close proximity to a coal tar-contaminated Superfund site. Naphthalene concentrations in the analyzed samples varied between 0.18 and 106 mg/kg of dry weight sediment. The assay for nagAc-like sequences indicated the presence of (4.1 +/- 0.7) X 10(3) to (2.9 +/- 0.3) X 10(5) copies of nagAc-like dioxygenase genes per mug of DNA extracted from sediment samples. These values corresponded to (1.2 +/- 0.6) X 10(5) to (5.4 +/- 0.4) X 10(7) Copies of this target per g of dry weight sediment when losses of DNA during extraction were taken into account. There was a positive correlation between naphthalene concentrations and nagAc-like gene copies per microgram of DNA (r = 0.89) and per gram of dry weight sediment (r = 0.77). These results provide evidence of the ecological significance of organisms carrying nagAc-like genes in the biodegradation of naphthalene.
Article
Comparative characterization (molecular typing) of isolates within a bacterial species is one of the major problems in microbiology and epidemiology. However, it is rather difficult to correlate data obtained in various laboratories, because traditional, including molecular, methods employed in typing pathogenic microorganisms are difficult to standardize. In 1998, Maiden et al. proposed multilocus sequence typing (MLST); through which alleles of several housekeeping genes are directly assessed by nucleotide sequencing, each unique allele combination determining a sequence type of a strain. The advantages of this approach are that the culturing of pathogenic microorganisms could be excluded, as their gene fragments are amplified directly from biological samples, and that the sequencing data are unambiguous, easy to standardize, and electronically portable. The latter makes it possible to generate an expandable global database for each species at an Internet site, in order to use it for the purposes of genotyping pathogenic bacteria (and other infectious agents). MLST protocols have been elaborated for Neisseria meningitidis, Streptococcus pneumoniae, and Helicobacter pylori; those for Streptococcus pyogenes, Staphylococcus aureus, and Haemophilus influenzae are now being developed. Basic principles and the first results of MLST have been reviewed, including data on the distribution and microevolution of N. meningitidis clones causing epidemic meningococcal infection, the relative recombination and mutation rates in the N. meningitidis genome, the identification of antibiotic-resistant S. pneumoniae clones causing severe systemic infection, the grouping of H. pylori isolates from various geographic regions, etc.
Article
Although the applicability of small subunit ribosomal RNA (16S rRNA) sequences for bacterial classification is now well accepted, the general use of these molecules has been hindered by the technical difficulty of obtaining their sequences. A protocol is described for rapidly generating large blocks of 16S rRNA sequence data without isolation of the 16S rRNA or cloning of its gene. The 16S rRNA in bulk cellular RNA preparations is selectively targeted for dideoxynucleotide-terminated sequencing by using reverse transcriptase and synthetic oligodeoxynucleotide primers complementary to universally conserved 16S rRNA sequences. Three particularly useful priming sites, which provide access to the three major 16S rRNA structural domains, routinely yield 800-1000 nucleotides of 16S rRNA sequence. The method is evaluated with respect to accuracy, sensitivity to modified nucleotides in the template RNA, and phylogenetic usefulness, by examination of several 16S rRNAs whose gene sequences are known. The relative simplicity of this approach should facilitate a rapid expansion of the 16S rRNA sequence collection available for phylogenetic analyses.