ArticlePDF Available

PDTD: A web-accessible protein database for drug target identification

Authors:

Abstract and Figures

Target identification is important for modern drug discovery. With the advances in the development of molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins with three-dimensional (3D) structures. To complete this task, a reverse docking program and a drug target database with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Target Fishing Docking) http://www.dddc.ac.cn/tarfisdock, which has been used widely by others. Recently, we have constructed a protein target database, Potential Drug Target Database (PDTD), and have integrated PDTD with TarFisDock. This combination aims to assist target identification and validation. PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100 protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830 known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related diseases, biological functions as well as associated regulating (signaling) pathways. Each target is categorized by both nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock, PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2 file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores. PDTD serves as a comprehensive and unique repository of drug targets. Integrated with TarFisDock, PDTD is a useful resource to identify binding proteins for active compounds or existing drugs. Its potential applications include in silico drug target identification, virtual screening, and the discovery of the secondary effects of an old drug (i.e. new pharmacological usage) or an existing target (i.e. new pharmacological or toxic relevance), thus it may be a valuable platform for the pharmaceutical researchers. PDTD is available online at http://www.dddc.ac.cn/pdtd/.
Content may be subject to copyright.
BioMed Central
Page 1 of 7
(page number not for citation purposes)
BMC Bioinformatics
Open Access
Database
PDTD: a web-accessible protein database for drug target
identification
Zhenting Gao1,3, Honglin Li*1,2, Hailei Zhang2, Xiaofeng Liu1, Ling Kang2,
Xiaomin Luo1, Weiliang Zhu1, Kaixian Chen1, Xicheng Wang*2 and
Hualiang Jiang*1,3
Address: 1Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of
Sciences, Shanghai 201203, China, 2Department of Engineering Mechanics, State Key Laboratory of Structural Analysis for Industrial Equipment,
Dalian University of Technology, Dalian 116023, China and 3School of Pharmacy, East China University of Science and Technology, Shanghai
200237, China
Email: Zhenting Gao - zhentg@mail.shcnc.ac.cn; Honglin Li* - hlli@mail.shcnc.ac.cn; Hailei Zhang - hailei@nus.edu.sg;
Xiaofeng Liu - xxffliu@gmail.com; Ling Kang - klxju@163.com; Xiaomin Luo - xmluo@mail.shcnc.ac.cn;
Weiliang Zhu - wlzhu@mail.shcnc.ac.cn; Kaixian Chen - kxchen@mail.shcnc.ac.cn; Xicheng Wang* - guixum@dlut.edu.cn;
Hualiang Jiang* - hljiang@mail.shcnc.ac.cn
* Corresponding authors
Abstract
Background: Target identification is important for modern drug discovery. With the advances in the development of
molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins
with three-dimensional (3D) structures. To complete this task, a reverse docking program and a drug target database
with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Target Fishing Docking)
http://www.dddc.ac.cn/tarfisdock, which has been used widely by others. Recently, we have constructed a protein target
database, Potential Drug Target Database (PDTD), and have integrated PDTD with TarFisDock. This combination aims
to assist target identification and validation.
Description: PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100
protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and
several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830
known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related
diseases, biological functions as well as associated regulating (signaling) pathways. Each target is categorized by both
nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease
name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be
downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock,
PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2
file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores.
Conclusion: PDTD serves as a comprehensive and unique repository of drug targets. Integrated with TarFisDock,
PDTD is a useful resource to identify binding proteins for active compounds or existing drugs. Its potential applications
include in silico drug target identification, virtual screening, and the discovery of the secondary effects of an old drug (i.e.
new pharmacological usage) or an existing target (i.e. new pharmacological or toxic relevance), thus it may be a valuable
platform for the pharmaceutical researchers. PDTD is available online at http://www.dddc.ac.cn/pdtd/.
Published: 19 February 2008
BMC Bioinformatics 2008, 9:104 doi:10.1186/1471-2105-9-104
Received: 14 August 2007
Accepted: 19 February 2008
This article is available from: http://www.biomedcentral.com/1471-2105/9/104
© 2008 Gao et al; licensee BioMed Central Ltd.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104
Page 2 of 7
(page number not for citation purposes)
Background
Until 2000, only ~500 drug targets had been reported [1],
among which only 120 drug targets are actually marketed
[2]. The completion of human genome and numerous
pathogen genomes suggests that there are 30,000 to
40,000 genes and at least the same number of proteins,
and many of these proteins are potential targets for drug
discovery. It has been estimated that there are more than
2,000 potential drug targets with at least one drug candi-
date in clinical trial [2,3]. This is a reservoir for drug dis-
covery and target identification. However, how to
extensively utilize this source is a challenge. Expressing all
these proteins and screening compounds against the cor-
responding models constructed based on the proteins is
extremely unpractical, because it is intolerably expensive
and time-consuming. Recent promising advancement in
docking-based virtual screening has demonstrated the
efficiency of this approach in discovering lead (active)
compounds [4,5]. On the other hand, reverse (or inverse)
docking approaches have become promising computa-
tional tools to find the probable target proteins for active
compounds, natural products or old drugs [6-10]. Both
these two researches need the information of target pro-
teins, in particular the information of structures and active
sites. However, such information of most drug targets is
dispersedly deposited in the literatures or other databases
like Protein Data Bank (PDB). Therefore, it is in dire need
of a database containing comprehensive information of
the potential target proteins.
Recently, some notable efforts have been made to par-
tially satisfy this requirement. The Therapeutic Target
Database (TTD) is one such example [11], which provides
information about the known therapeutic targets, disease
conditions and the corresponding drugs. DrugBank is a
bioinformatics/cheminformatics resource that combines
detailed drug data with comprehensive drug target infor-
mation [12]. A number of ligand-protein interaction data-
bases have also emerged including LigBase [13], PDBsite
[14], SitesBase [15], MSDsite [16], PDB-Ligand [17] and
AffinDB [18]. Unfortunately, these databases were not
specifically designed for discovering new leads by using
virtual screening approaches and new targets by using
reverse docking. They also cannot be used to figure out
specific pharmaceutical information related to the sec-
ondary effects of an old drug (i.e. new pharmacological
usage) or an existing target (i.e. new pharmacological or
toxic relevance). Ideally, a target database may provide
not only abundant information about the potential target
proteins such as 3D structures, binding (active) sites, bio-
logical (pharmacological) functions, related diseases, but
also appropriate computational tools to mine the infor-
mation about targets. Herein, we present a web-accessible
protein database, PDTD (Potential Drug Target Database).
Integrated with our reverse docking server, TarFisDock [8],
PDTD is a valuable platform for target identification.
Construction and content
Fundamentally, PDTD has dual functions of querying
drug target information and identifying the potential
binding proteins of an active compound or an existing
drug by using reverse docking approach. Accordingly,
PDTD contains two sub-databases types, one is the struc-
tural sub-database and another is the informatics sub-
database. All data are associated with a relational database
implemented using MySQL and can be queried through
web interface. Through three computational engines,
search engine, visualization engine and TarFisDock, users
can implement interactive query and computation with
the PDTD (Figure 1). The structural sub-database stores
each protein in both PDB format and mol2 format with
Amber charges; sequence and active site information were
also included in the structural sub-database. The infor-
matics database stores the data of target categories, related
disease information, biological functions and associated
regulating (signaling) pathways. PDTD currently contains
>1100 entries covering the information of >800 known
drug targets.
The target proteins in PDTD were selected from scientific
literatures [1,19-21] and several online databases such as
TTD [11], DrugBank [12] and Thomson Pharma [22].
Since PDTD is designed to search the probable binding
proteins for new active compounds or existing drugs by
using reverse docking, it only contains the proteins with
known 3D structures determined experimentally by the X-
ray crystallographic or NMR methods. The coordinates of
proteins were isolated from the PDB. Since not all PDB
structures are of equal quality, a protein structure is
selected according to the following criteria when it has
several redundant records in PDB: (i) select the structure
without mutation and missing residues around the active
site; (ii) select the structure with high resolution; (iii)
select the structure complexed with ligand. For each
selected protein in PDTD, amino acid residues within 6.5
Å around the bound ligand were used to define the bind-
ing (active) site. A PDB entry could contain data on a
number of binding sites. If so, a separate entry was gener-
ated in the PDTD to accommodate each of the sites.
HETATM records in PDB files were used to define the lig-
ands. PDB and mol2 files of each protein were also stored
in the structural sub-database. All kinds of structures for a
drug target can be visualized using the "Jmol" JAVA applet
[23].
Most of drug targets in PDTD have been collected with
single structure (709 cases). Since our reverse docking pro-
gram, TarFisDock, has not considered the flexibility of
proteins, PDTD contains some redundancy for the flexible
BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104
Page 3 of 7
(page number not for citation purposes)
proteins. For example, HIV-1 protease has 27 entries and
dihydrofolate reductase (DHFR) has 14 entries in the
database. The redundancy of each target is shown in Fig-
ure 2a.
According to therapeutic areas, the drug targets may be
categorized into 14 types (Figure 2b). It is convenient for
users to custom a special list when they predict potential
binding targets for small molecules using TarFisDock.
Among targets having explicit therapeutic functions, the
targets related neoplastic disease are most populated, fol-
lowing are hormones and hormones antagonists related
targets. Targets related to viral infections are also major
contributors in PDTD. The distribution of biochemical
classification is shown in Figure 2c, indicating that PDTD
mainly consists of enzymes, receptors, ion channels,
transporters, nuclear receptors, binding protein, structural
proteins, signaling proteins, factors, regulators and hor-
mones. The targets which can not be assigned into any of
these biochemical classes are grouped into an "unknown"
class. The selected drug targets are enriched in enzymes
(80.2%). G-protein coupled receptors (GPCR) and other
receptors which account for most drug targets seldom
have crystal structures, resulting that the ratio of receptor
targets in PDTD is only 4.2%.
Utility and Discussion
Web interface: query, download and exploration
PDTD is supported with a friendly designed web interface
so that users can easily query the target information, and
retrieve, visualize or download the distributions of the
drug target files as they desire (Figure 3). PDTD has been
designed to provide fast and easy access to target informa-
tion. The popular MySQL backend was chosen as database
server. Using the scripting language PHP, special care was
taken to generate a clearly structured layout which enables
fast and easy navigation.
All the data can be accessed and retrieved directly via the
web browser, PDTD consists of a classification table and a
keyword search box. The user can search a drug target
manually from the classification table, or automatically
by using the keyword search function, such as PDB ID, tar-
get name, or disease. Every target has its own result page
containing comprehensive information including PDB
ID, target name, target category, related disease, its struc-
ture, and active site. The PDTD was carefully annotated
according to information found in the PDB, UniProt [24],
KEGG [25] and Enzyme Structures Database [26]. PDTD
also provides hyperlinks to other databases like TTD and
DrugBank, which allow easy navigation for more infor-
mation about target structure, source and function (See
Links to other databases). The related structures for each
PDTD system architectureFigure 1
PDTD system architecture. The system is implemented in MySQL and PHP script, user can freely access the database at
http://www.dddc.ac.cn/pdtd/.
BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104
Page 4 of 7
(page number not for citation purposes)
Functional and biochemical classifications of PDTD protein entriesFigure 2
Functional and biochemical classifications of PDTD protein entries. (A) Database redundancy. Each bar represents
the number of targets that have the same amounts of copies in the PDTD. Break is applied to the y axes. (B) Distribution of
drug targets according to their therapeutic areas. (C) Distribution of drug targets according to their biochemical criteria, which
include enzymes, receptors, ion channels, transporters, nuclear receptors, binding protein, structural proteins, signaling pro-
tein, and factors, regulators and hormones.
BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104
Page 5 of 7
(page number not for citation purposes)
Screen shots of the PDTDFigure 3
Screen shots of the PDTD. A screen shot of the PDTD showing several possible view of information describing the drug
target. Not all fields are shown.
BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104
Page 6 of 7
(page number not for citation purposes)
drug target can be downloaded freely from the detailed
page by clicking on "MORE" button. Furthermore, user
can download the classified target structures and all target
files from the "Download" page.
Also, users can customize the list of drug targets in which
they want to perform reverse docking process to predict
the potential binding targets for any small molecule,
which to our knowledge is not provided by other public
websites. Consequently, TarFisDock will output the list of
results with binding poses of molecules against each tar-
gets, along with corresponding information, disease,
annotation and links to other databases, which are also
presented (See the Applications below).
Applications
In bringing together the reverse docking server TarFisDock
[8], PDTD has been widely used to identify binding pro-
teins for small molecule. The binding proteins for several
molecules have been verified through bioassay and crystal
structure determination for target-ligand complexes [9].
In general, one drug molecule may interact with several
targets including targets associated with side effect (toxic-
ity). TarFisDock provides multiple options for selecting
protein targets. These clues are useful for further experi-
mental test in discovering new pharmacological efficacy
or toxicity for an existing drug. In general, combining with
PDTD, TarFisDock web sever is a convenient tool for
"fishing" the target proteins of small molecules, the user
just inputs the structure of querying compound and cus-
tomizes a target list from PDTD (a list of all the targets is
recommended). The results can be downloaded in the
form of mol2 file with the binding pose of each com-
pound and a list of potential binding targets according to
their ranking scores.
In addition, benchmark searches for the old version of
PDTD (698 entries) were performed using TarFisDock
taking vitamin E, an anti-oxidant, and 4H-tamoxifen, an
anti-cancer agent as probes [8]. In this study, similar
benchmark searches for the current version of PDTD
(1186 entries) have been carried out. For vitamin E, eight
(12 entries) of the twelve experimentally verified targets
fall into the top 10% candidates picked up from the PDTD
by the TarFisDock program. For 4H-tamoxifen, five (14
entries) of the eleven experimentally confirmed targets
appear amongst the top 10% of the TarFisDock predicted
candidates. In addition, the PDTD was searched by the
TarFisDock using N-trans-caffeoyltyramine (compound
1), an active natural product discovered by anti-H. pylori
screening in our lab, as a probe in the previous research
[9]. Homology search revealed that, among the fifteen
candidates discovered by reverse docking, diami-
nopimelate decarboxylase (DC) and peptide deformylase
(PDF) are possible binding proteins of compound 1.
Enzymatic assay demonstrated compound 1 and its deriv-
ative compound 2 are the potent inhibitors against the H.
pylori PDF (HpPDF) with IC50 values of 10.8 and 1.25 µM,
respectively. X-ray crystal structures of HpPDF and the
complexes of HpPDF with 1 and 2 were determined, indi-
cating that these two inhibitors bind well with the HpPDF
binding pocket [9]. To exemplify the applications of
PDTD combining with TarFisDock, the brief results of
these three benchmark examples have been uploaded to
the PDTD homepage under the "Benchmark" option.
Links to other databases
General links are given to related drug and target informa-
tion with other databases [11,12]. Each data in PDTD is
linked to the Protein Data Bank, DrugBank, there are also
hypertext links to UniProt [24], Kegg [25] and Enzyme
Structures Database [26] for further structural and func-
tional information.
Conclusion
In summary, PDTD is a comprehensive, web-accessible
database of drug targets, which focuses on those drug tar-
gets with known 3D-structures. By far, PDTD has collected
>1100 entries covering >800 known and potential drug tar-
gets from the Protein Data Bank. PDB structure, mol2 file
and active site information of each drug target were
extracted from the crystal structure, and all the information
can be viewed with molecular visualization tools and can
be downloaded freely by users. Drug targets of PDTD were
categorized by two criteria: therapeutic areas and biochem-
ical criteria. Each target was carefully annotated by brows-
ing several databases, such as DrugBank, TTD, and UniProt.
All these information were stored in informatics sub-data-
base, which was associated to structural sub-database with
a relational database. Users can also use our reverse docking
program to search PDTD for finding the possible binding
protein(s) of a small molecule.
One drug molecule may interact with several targets includ-
ing targets associated with side effect (toxicity). By search-
ing PDTD, TarFisDock may provide multiple options of the
binding proteins for a small molecule. These clues are use-
ful for further experimental test in discovering new targets
and new pharmacological efficacy or toxicity for an existing
drug. Thus, combining with TarFisDock, PDTD is a good
web-accessible protein database for identifying drug targets
and for discovering new usages of old drugs [27,28]. The
user just inputs the structure of querying compound and
customizes a target list from PDTD (a list of all the targets
is recommended), TarFisDock may provide possible bind-
ing proteins of the compound. The results can be down-
loaded in the form of mol2 file with the binding pose of
each compound and a list of potential binding targets
according to their ranking scores.
BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104
Page 7 of 7
(page number not for citation purposes)
PDTD will be updated continuously. We intend to classify
the drug targets according more completely to their biolog-
ical functions, which will be achieved by integrating and/or
linking PDTD with other bioinformatics databases. For
example, links can be directed to the databases of SOURCE
[29] and Gene Ontology [30] for more descriptions of
functional annotations, ontologies, and gene expression
data.
Availability and requirements
PDTD is freely available for academic user at http://
www.dddc.ac.cn/pdtd. To download the files of PDTD,
users must complete a simple registration process and agree
not to republish the data without explicit permission. Users
are invited to contact us through the 'Contact' link and to
participate in the user forum at http://www.dddc.ac.cn/tar
fisdock/forum/.
Authors' contributions
ZG developed the web interface, designed the relational
database scheme, and integrated the database-PDTD with
the reverse docking program-TarFisDock. HL developed the
reverse docking program, participated in the design of web
interface, and contributed to writing the manuscript. HZ,
XL, and LK were major data contributors of the current sys-
tem. XL, WZ and KC provided comments and suggestions
about the features of the database. XW and HJ conceived
the idea of the database, provided direction for its develop-
ment and revised the subsequent drafts of this manuscript.
All authors read and approved the final manuscript.
Acknowledgements
We thank all the colleagues in Drug Discovery and Design Center for their
contributions in literature searching and necessary dealing with files. The
work was partly supported by the Special Fund for Major State Basic
Research Project (grant 2002CB512802), the National Natural Science Foun-
dation of China (grants 20721003 and 10572033), and the 863 Hi-Tech Pro-
gram of China (grant 2007AA02Z304). HL was also sponsored by the
Shanghai Postdoctoral Scientific Program.
References
1. Drews J: Drug discovery: a historical perspective. Science 2000,
287(5460):1960-1964.
2. Hopkins AL, Groom CR: The druggable genome. Nat Rev Drug Dis-
cov 2002, 1(9):727-730.
3. Russ AP, Lampel S: The druggable genome: an update. Drug Dis-
cov Today 2005, 10(23-24):1607-1610.
4. Kitchen DB, Decornez H, Furr JR, Bajorath J: Docking and scoring
in virtual screening for drug discovery: methods and applica-
tions. Nat Rev Drug Discov 2004, 3(11):935-949.
5. Sperandio O, Miteva MA, Delfaud F, Villoutreix BO: Receptor-based
computational screening of compound databases: the main
docking-scoring engines. Curr Protein Pept Sci 2006, 7(5):369-393.
6. Chen YZ, Zhi DG: Ligand-protein inverse docking and its poten-
tial use in the computer search of protein targets of a small
molecule. Proteins 2001, 43(2):217-226.
7. Paul N, Kellenberger E, Bret G, Muller P, Rognan D: Recovering the
true targets of specific ligands by virtual screening of the pro-
tein data bank. Proteins 2004, 54(4):671-680.
8. Li H, Gao Z, Kang L, Zhang H, Yang K, Yu K, Luo X, Zhu W, Chen K,
Shen J, Wang X, Jiang H: TarFisDock: a web server for identifying
drug targets with docking approach. Nucleic Acids Res 2006,
34(Web Server issue):W219-24.
9. Cai J, Han C, Hu T, Zhang J, Wu D, Wang F, Liu Y, Ding J, Chen K, Yue
J, Shen X, Jiang H: Peptide deformylase is a potential target for
anti-Helicobacter pylori drugs: reverse docking, enzymatic
assay, and X-ray crystallography validation. Protein Sci 2006,
15(9):2071-2081.
10. Muller P, Lena G, Boilard E, Bezzine S, Lambeau G, Guichard G, Rognan
D: In silico-guided target identification of a scaffold-focused
library: 1,3,5-triazepan-2,6-diones as novel phospholipase A2
inhibitors. J Med Chem 2006, 49(23):6768-6778.
11. Chen X, Ji ZL, Chen YZ: TTD: Therapeutic Target Database.
Nucleic Acids Res 2002, 30(1):412-415.
12. Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P,
Chang Z, Woolsey J: DrugBank: a comprehensive resource for in
silico drug discovery and exploration. Nucleic Acids Res 2006,
34(Database issue):D668-72.
13. Stuart AC, Ilyin VA, Sali A: LigBase: a database of families of
aligned ligand binding sites in known protein sequences and
structures. Bioinformatics 2002, 18(1):200-201.
14. Ivanisenko VA, Pintus SS, Grigorovich DA, Kolchanov NA: PDBSite:
a database of the 3D structure of protein functional sites.
Nucleic Acids Res 2005, 33(Database issue):D183-7.
15. Gold ND, Jackson RM: SitesBase: a database for structure-based
protein-ligand binding site comparisons. Nucleic Acids Res 2006,
34(Database issue):D231-4.
16. Velankar S, McNeil P, Mittard-Runte V, Suarez A, Barrell D, Apweiler
R, Henrick K: E-MSD: an integrated data resource for bioinfor-
matics. Nucleic Acids Res 2005, 33(Database issue):D262-5.
17. Shin JM, Cho DH: PDB-Ligand: a ligand database based on PDB
for the automated and customized classification of ligand-
binding structures. Nucleic Acids Res 2005, 33(Database
issue):D238-41.
18. Block P, Sotriffer CA, Dramburg I, Klebe G: AffinDB: a freely acces-
sible database of affinities for protein-ligand complexes from
the PDB. Nucleic Acids Res 2006, 34(Database issue):D522-6.
19. Bonday ZQ, Dhanasekaran S, Rangarajan PN, Padmanaban G: Import
of host delta-aminolevulinate dehydratase into the malarial
parasite: identification of a new drug target. Nat Med 2000,
6(8):898-903.
20. Gibbs JB: Mechanism-based target identification and drug dis-
covery in cancer research. Science 2000, 287(5460):1969-1973.
21. Hardman JG, Limbird LE, Gilman AG: Goodman and Gilman's The
Pharmacological Basis of Therapeutics. 10th edition. New York
, McGraw-Hill ; 2001.
22. Thomson Pharma [http://www.thomson-pharma.com]
23. Jmol: an open-source Java viewer for chemical structures in
3D [http://www.jmol.org/]
24. Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S,
Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA,
O'Donovan C, Redaschi N, Yeh LS: UniProt: the Universal Protein
knowledgebase. Nucleic Acids Res 2004, 32(Database
issue):D115-9.
25. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG
resource for deciphering the genome. Nucleic Acids Res 2004,
32(Database issue):D277-80.
26. Enzyme Structures Database [http://www.ebi.ac.uk/thornton-
srv/databases/enzymes/]
27. O'Connor KA, Roth BL: Finding new tricks for old drugs: an effi-
cient route for public-sector drug discovery. Nat Rev Drug Discov
2005, 4(12):1005-1014.
28. Chong CR, Sullivan DJ Jr.: New uses for old drugs. Nature 2007,
448(7154):645-646.
29. Diehn M, Sherlock G, Binkley G, Jin H, Matese JC, Hernandez-Boussard
T, Rees CA, Cherry JM, Botstein D, Brown PO, Alizadeh AA:
SOURCE: a unified genomic resource of functional annota-
tions, ontologies, and gene expression data. Nucleic Acids Res
2003, 31(1):219-223.
30. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis
AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L,
Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM,
Sherlock G: Gene ontology: tool for the unification of biology.
The Gene Ontology Consortium. Nat Genet 2000, 25(1):25-29.
... The imidazo [1,2-c] quinazolines (4a-h) and (5a-b), when underwent the oxidative dehydrogenation conditions, i.e., potassium permanganate and silica gel, using acetonitrile at room temperature separately, afforded the respective targeted products (6a-i) in high yields [38]. A similar study was carried out by Claudi et al. following exhaustive dehydrogenation [39]. ...
... Triazoloquinazoline constitutes a pharmacologically interesting class of compounds showing a diverse range of biological profiles. This class of compounds and their derivatives have shown prominent biological activities such as anti-hypertonic activity [31], antirheumatic and antianaphylactic activity [21], anti-hypertensive [32], neuro-stimulating activity [21], anti-inflammatory [33], antiviral [34], anti-fungal [35], anti-microbial [35], anti-bacterial [36], anti-oxidant [37], anti-convulsant [38], adenosine receptor antagonists [39], and significant cytotoxic activities [40,41]. In summary, triazoloquinazoline is an important class of organic compounds that has drawn attention to its potential as a pharmacologically active agent. ...
... The condensation of hydrazine derivative 30 with orthoesters produced compound 34 which was modified using a pinacol ester derivative 35 to afford the product 36 [48]. Different researchers employed the intermediate 31 to get various substituted triazoloquinazoline derivatives (37,38,39,40) under different reaction conditions [12,38,[49][50][51] ...
Book
Full-text available
This book investigates and identifies novel therapeutic compounds for the treatment of a range of illnesses. Heterocyclic compounds are a significant class of substances with biological activity. Among them, quinazoline has attracted a lot of interest because of its important biological properties. Numerous compounds with quinazoline moiety have been shown to exhibit a wide range of therapeutic properties, including antioxidant, antifungal, antiviral, antidiabetic, anticancer, anti-inflammatory, and antibacterial activities. This book presents a comprehensive overview of quinazoline and its derivatives. The chapters address recent advances in the synthesis of several different heterocyclic compounds, the use of computational studies for finding new active quinazoline derivatives, the biological activities of quinazoline, and much more.
... Currently, it has 9283 entries of binding site from 3678 proteins and 5608 ligands. In addition to that, potential drug target database (PDTD) is an alternative web with accessible target database for small molecules target identification [39]. The target proteins of PDTD are collected from both scientific studies and other databases containing therapeutic target database (TTD), Drug Bank, and Thomson Pharma. ...
Chapter
Full-text available
Quinazoline derivatives have shown promising pharmacological activities against various diseases, including cancer, inflammation, and cardiovascular disorders. Computational studies have become an important tool in the discovery and optimization of new quinazoline derivatives. In this chapter, the importance and application of computational studies in finding new active quinazoline derivatives were discussed. The various computational techniques, such as molecular docking, molecular dynamics simulations, quantum mechanics calculations, and machine learning algorithms, which have been used to predict the biological activities and optimize the structures of quinazoline derivatives, were described. Examples of successful applications of computational studies in the discovery of new quinazoline derivatives with improved pharmacological activities were added. Overall, computational studies have proven to be valuable in the development of new quinazoline derivatives and have the potential to accelerate the drug discovery process.
Chapter
Large datasets are frequently gathered, stored and analysed in the big data age with the goal of guiding biological discoveries and validating hypotheses. There is no question that the introduction of new technologies and open data initiatives has significantly enhanced the volume and diversity of data. The whole drug development process uses big data, from identifying targets and mechanisms of action to finding new leads and therapeutic candidates. With the intention of giving readers a broad overview of the computing resources and databases accessible, these approaches are shown and explored. We believe that big data leveraging should prioritize personalized care and be cost-effective. On the basis of their synergy, we suggest combining information technologies with (chemo) informatics tools to accomplish this.
Chapter
The pipeline of drug discovery consists of a number of processes; drug–target interaction determination is one of the salient steps among them. Computational prediction of drug–target interactions can facilitate in reducing the search space of experimental wet lab-based verifications steps, thus considerably reducing time and other resources dedicated to the drug discovery pipeline. While machine learning-based methods are more widespread for drug–target interaction prediction, network-centric methods are also evolving. In this chapter, we focus on the process of the drug–target interaction prediction from the perspective of using machine learning algorithms and the various stages involved for developing an accurate predictor.Key wordsDrug–target interactions SMOTE Feature engineering Molecular descriptors Genomic space
Article
Full-text available
Traditional Chinese medicine (TCM) is characterized by multi-components, multiple targets, and complex mechanisms of action and therefore has significant advantages in treating diseases. However, the clinical application of TCM prescriptions is limited due to the difficulty in elucidating the effective substances and the lack of current scientific evidence on the mechanisms of action. In recent years, the development of network pharmacology based on drug systems research has provided a new approach for understanding the complex systems represented by TCM. The determination of drug targets is the core of TCM network pharmacology research. Over the past years, many web tools for drug targets with various features have been developed to facilitate target prediction, significantly promoting drug discovery. Therefore, this review introduces the widely used web tools for compound-target interaction prediction databases and web resources in TCM pharmacology research, and it compares and analyzes each web tool based on their basic properties, including the underlying theory, algorithms, datasets, and search results. Finally, we present the remaining challenges for the promising future of compound-target interaction prediction in TCM pharmacology research. This work may guide researchers in choosing web tools for target prediction and may also help develop more TCM tools based on these existing resources.
Article
Drug discovery and development is a time-consuming and costly procedure that necessitates a substantial effort. Drug repurposing has been suggested as a method for developing medicines that takes less time than developing brand new medications and will be less expensive. Also known as drug repositioning or re-profiling, this strategy has been in use from the time of serendipitous drug discoveries to the modern computer aided drug designing and use of computational chemistry. In the light of the COVID-19 pandemic too, drug repurposing emerged as a ray of hope in the dearth of available medicines. Data availability by electronic recording, libraries, and improvements in computational techniques offer a vital substrate for systemic evaluation of repurposing candidates. In the not-too-distant future, it could be possible to create a global research archive for us to access, thus accelerating the process of drug development and repurposing. This review aims to present the evolution, benefits and drawbacks including current approaches, key players and the legal and regulatory hurdles in the field of drug repurposing. The vast quantities of available data secured in multiple drug databases, assisting in drug repurposing is also discussed.
Chapter
Completely revised and updated, the 2nd edition of The Handbook of Medicinal Chemistry draws together contributions from authoritative practitioners to provide a comprehensive overview of the field as well as insight into the latest trends and research. An ideal companion for students in medicinal chemistry, drug discovery and drug development, while also communicating core principles, the book places the discipline within the context of the burgeoning platform of new modalities now available to drug discovery. The book also highlights the role chemistry has to play in wider target validation and translational technologies. This is a carefully curated compilation of writing from global experts using their broad experience of medicinal chemistry, project leadership and drug discovery and development from an industry, academic and charity perspective to provide unparalleled insight into the field.
Article
Full-text available
Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.
Article
Full-text available
A number of proteins and nucleic acids have been explored as therapeutic targets. These targets are subjects of interest in different areas of biomedical and pharmaceutical research and in the development and evaluation of bioinformatics, molecular modeling, computer-aided drug design and analytical tools. A publicly accessible database that provides comprehensive information about these targets is therefore helpful to the relevant communities. The Therapeutic Target Database (TTD) is designed to provide information about the known therapeutic protein and nucleic acid targets described in the literature, the targeted disease conditions, the pathway information and the corresponding drugs/ligands directed at each of these targets. Cross-links to other databases are also introduced to facilitate the access of information about the sequence, 3D structure, function, nomenclature, drug/ligand binding properties, drug usage and effects, and related literature for each target. This database can be accessed at http://xin.cz3.nus.edu.sg/group/ttd/ttd.asp and it currently contains entries for 433 targets covering 125 disease conditions along with 809 drugs/ligands directed at each of these targets. Each entry can be retrieved through multiple methods including target name, disease name, drug/ligand name, drug/ligand function and drug therapeutic classification.
Article
Cancer as a disease in the human population is becoming a larger health problem, and the medicines used as treatments have clear limitations. In the past 20 years, there has been a tremendous increase in our knowledge of the molecular mechanisms and pathophysiology of human cancer. Many of these mechanisms have been exploited as new targets for drug development in the hope that they will have greater antitumor activity with less toxicity to the patient than is seen with currently used medicines. The fruition of these efforts in the clinic is just now being realized with a few encouraging results.
Article
Driven by chemistry but increasingly guided by pharmacology and the clinical sciences, drug research has contributed more to the progress of medicine during the past century than any other scientific factor. The advent of molecular biology and, in particular, of genomic sciences is having a deep impact on drug discovery. Recombinant proteins and monoclonal antibodies have greatly enriched our therapeutic armamentarium. Genome sciences, combined with bioinformatic tools, allow us to dissect the genetic basis of multifactorial diseases and to determine the most suitable points of attack for future medicines, thereby increasing the number of treatment options. The dramatic increase in the complexity of drug research is enforcing changes in the institutional basis of this interdisciplinary endeavor. The biotech industry is establishing itself as the discovery arm of the pharmaceutical industry. In bridging the gap between academia and large pharmaceutical companies, the biotech firms have been effective instruments of technology transfer.
Article
The parasite Plasmodium berghei imports the enzyme delta-aminolevulinate dehydratase (ALAD), and perhaps the subsequent enzymes of the pathway from the host red blood cell to sustain heme synthesis. Here we have studied the mechanism of this import. A 65-kDa protein on the P. berghei membrane specifically bound to mouse red blood cell ALAD, and a 93-amino-acid fragment (ALAD-DeltaNC) of the host erythrocyte ALAD was able to compete with the full-length enzyme for binding to the P. berghei membrane. ALAD-DeltaNC was taken up by the infected red blood cell when added to a culture of P. falciparum and this led to a substantial decrease in ALAD protein and enzyme activity and, subsequently, heme synthesis in the parasite, resulting in its death.
Article
Ligand-protein docking has been developed and used in facilitating new drug discoveries. In this approach, docking single or multiple small molecules to a receptor site is attempted to find putative ligands. A number of studies have shown that docking algorithms are capable of finding ligands and binding conformations at a receptor site close to experimentally determined structures. These algorithms are expected to be equally applicable to the identification of multiple proteins to which a small molecule can bind or weakly bind. We introduce a ligand-protein inverse-docking approach for finding potential protein targets of a small molecule by the computer-automated docking search of a protein cavity database. This database is developed from protein structures in the Protein Data Bank (PDB). Docking is conducted with a procedure involving multiple-conformer shape-matching alignment of a molecule to a cavity followed by molecular-mechanics torsion optimization and energy minimization on both the molecule and the protein residues at the binding region. Scoring is conducted by the evaluation of molecular-mechanics energy and, when applicable, by the further analysis of binding competitiveness against other ligands that bind to the same receptor site in at least one PDB entry. Testing results on two therapeutic agents, 4H-tamoxifen and vitamin E, showed that 50% of the computer-identified potential protein targets were implicated or confirmed by experiments. The application of this approach may facilitate the prediction of unknown and secondary therapeutic target proteins and those related to the side effects and toxicity of a drug or drug candidate. Proteins 2001;43:217-226.
Article
A database comprising all ligand-binding sites of known structure aligned with all related protein sequences and structures is described. Currently, the database contains approximately 50000 ligand-binding sites for small molecules found in the Protein Data Bank (PDB). The structure–structure alignments are obtained by the Combinatorial Extension (CE) program (Shindyalov and Bourne, Protein Eng. , 11, 739–747, 1998) and sequence–structure alignments are extracted from the ModBase database of comparative protein structure models for all known protein sequences (Sanchez et al. , Nucleic Acids Res. , 28, 250–253, 2000). It is possible to search for binding sites in LigBase by a variety of criteria. LigBase reports summarize ligand data including relevant structural information from the PDB file, such as ligand type and size, and contain links to all related protein sequences in the TrEMBL database. Residues in the binding sites are graphically depicted for comparison with other structurally defined family members. LigBase provides a resource for the analysis of families of related binding sites. Availability: LigBase is accessible on the web at http://guitar.rockefeller.edu/ligbase. Contact: ash@guitar.rockefeller.edu; sali@rockefeller.edu *To whom correspondence should be addressed