ArticlePDF Available

PDTD: A web-accessible protein database for drug target identification

February 2008
BMC Bioinformatics 9(1):104

February 2008
9(1):104

DOI:10.1186/1471-2105-9-104

Source
PubMed

License
CC BY 2.0

Authors:

Honglin Li

Chinese Academy of Sciences

Xiaofeng Liu

East China University of Science and Technology

Show all 10 authorsHide

Target identification is important for modern drug discovery. With the advances in the development of molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins with three-dimensional (3D) structures. To complete this task, a reverse docking program and a drug target database with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Target Fishing Docking) http://www.dddc.ac.cn/tarfisdock, which has been used widely by others. Recently, we have constructed a protein target database, Potential Drug Target Database (PDTD), and have integrated PDTD with TarFisDock. This combination aims to assist target identification and validation. PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100 protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830 known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related diseases, biological functions as well as associated regulating (signaling) pathways. Each target is categorized by both nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock, PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2 file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores. PDTD serves as a comprehensive and unique repository of drug targets. Integrated with TarFisDock, PDTD is a useful resource to identify binding proteins for active compounds or existing drugs. Its potential applications include in silico drug target identification, virtual screening, and the discovery of the secondary effects of an old drug (i.e. new pharmacological usage) or an existing target (i.e. new pharmacological or toxic relevance), thus it may be a valuable platform for the pharmaceutical researchers. PDTD is available online at http://www.dddc.ac.cn/pdtd/.

PDTD system architecture. The system is implemented in MySQL and PHP script, user can freely access the database at .

…

Functional and biochemical classifications of PDTD protein entries. (A) Database redundancy. Each bar represents the number of targets that have the same amounts of copies in the PDTD. Break is applied to the y axes. (B) Distribution of drug targets according to their therapeutic areas. (C) Distribution of drug targets according to their biochemical criteria, which include enzymes, receptors, ion channels, transporters, nuclear receptors, binding protein, structural proteins, signaling protein, and factors, regulators and hormones.

…

Screen shots of the PDTD. A screen shot of the PDTD showing several possible view of information describing the drug target. Not all fields are shown.

…

Figures - available via license: Creative Commons Attribution 2.0 Generic

Content may be subject to copyright.

Available via license: CC BY 2.0

Content may be subject to copyright.

BioMed Central

Page 1 of 7

(page number not for citation purposes)

BMC Bioinformatics

Open Access

Database

PDTD: a web-accessible protein database for drug target

identification

Zhenting Gao1,3, Honglin Li*1,2, Hailei Zhang2, Xiaofeng Liu1, Ling Kang2,

Xiaomin Luo1, Weiliang Zhu1, Kaixian Chen1, Xicheng Wang*2 and

Hualiang Jiang*1,3

Address: 1Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of

Sciences, Shanghai 201203, China, 2Department of Engineering Mechanics, State Key Laboratory of Structural Analysis for Industrial Equipment,

Dalian University of Technology, Dalian 116023, China and 3School of Pharmacy, East China University of Science and Technology, Shanghai

200237, China

Email: Zhenting Gao - zhentg@mail.shcnc.ac.cn; Honglin Li* - hlli@mail.shcnc.ac.cn; Hailei Zhang - hailei@nus.edu.sg;

Xiaofeng Liu - xxffliu@gmail.com; Ling Kang - klxju@163.com; Xiaomin Luo - xmluo@mail.shcnc.ac.cn;

Weiliang Zhu - wlzhu@mail.shcnc.ac.cn; Kaixian Chen - kxchen@mail.shcnc.ac.cn; Xicheng Wang* - guixum@dlut.edu.cn;

Hualiang Jiang* - hljiang@mail.shcnc.ac.cn

* Corresponding authors

Abstract

Background: Target identification is important for modern drug discovery. With the advances in the development of

molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins

with three-dimensional (3D) structures. To complete this task, a reverse docking program and a drug target database

with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Target Fishing Docking)

http://www.dddc.ac.cn/tarfisdock, which has been used widely by others. Recently, we have constructed a protein target

database, Potential Drug Target Database (PDTD), and have integrated PDTD with TarFisDock. This combination aims

to assist target identification and validation.

Description: PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100

protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and

several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830

known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related

diseases, biological functions as well as associated regulating (signaling) pathways. Each target is categorized by both

nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease

name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be

downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock,

PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2

file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores.

Conclusion: PDTD serves as a comprehensive and unique repository of drug targets. Integrated with TarFisDock,

PDTD is a useful resource to identify binding proteins for active compounds or existing drugs. Its potential applications

include in silico drug target identification, virtual screening, and the discovery of the secondary effects of an old drug (i.e.

new pharmacological usage) or an existing target (i.e. new pharmacological or toxic relevance), thus it may be a valuable

platform for the pharmaceutical researchers. PDTD is available online at http://www.dddc.ac.cn/pdtd/.

Published: 19 February 2008

BMC Bioinformatics 2008, 9:104 doi:10.1186/1471-2105-9-104

Received: 14 August 2007

Accepted: 19 February 2008

This article is available from: http://www.biomedcentral.com/1471-2105/9/104

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),

which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104

Page 2 of 7

(page number not for citation purposes)

Background

Until 2000, only ~500 drug targets had been reported [1],

among which only 120 drug targets are actually marketed

[2]. The completion of human genome and numerous

pathogen genomes suggests that there are 30,000 to

40,000 genes and at least the same number of proteins,

and many of these proteins are potential targets for drug

discovery. It has been estimated that there are more than

2,000 potential drug targets with at least one drug candi-

date in clinical trial [2,3]. This is a reservoir for drug dis-

covery and target identification. However, how to

extensively utilize this source is a challenge. Expressing all

these proteins and screening compounds against the cor-

responding models constructed based on the proteins is

extremely unpractical, because it is intolerably expensive

and time-consuming. Recent promising advancement in

docking-based virtual screening has demonstrated the

efficiency of this approach in discovering lead (active)

compounds [4,5]. On the other hand, reverse (or inverse)

docking approaches have become promising computa-

tional tools to find the probable target proteins for active

compounds, natural products or old drugs [6-10]. Both

these two researches need the information of target pro-

teins, in particular the information of structures and active

sites. However, such information of most drug targets is

dispersedly deposited in the literatures or other databases

like Protein Data Bank (PDB). Therefore, it is in dire need

of a database containing comprehensive information of

the potential target proteins.

Recently, some notable efforts have been made to par-

tially satisfy this requirement. The Therapeutic Target

Database (TTD) is one such example [11], which provides

information about the known therapeutic targets, disease

conditions and the corresponding drugs. DrugBank is a

bioinformatics/cheminformatics resource that combines

detailed drug data with comprehensive drug target infor-

mation [12]. A number of ligand-protein interaction data-

bases have also emerged including LigBase [13], PDBsite

[14], SitesBase [15], MSDsite [16], PDB-Ligand [17] and

AffinDB [18]. Unfortunately, these databases were not

specifically designed for discovering new leads by using

virtual screening approaches and new targets by using

reverse docking. They also cannot be used to figure out

specific pharmaceutical information related to the sec-

ondary effects of an old drug (i.e. new pharmacological

usage) or an existing target (i.e. new pharmacological or

toxic relevance). Ideally, a target database may provide

not only abundant information about the potential target

proteins such as 3D structures, binding (active) sites, bio-

logical (pharmacological) functions, related diseases, but

also appropriate computational tools to mine the infor-

mation about targets. Herein, we present a web-accessible

protein database, PDTD (Potential Drug Target Database).

Integrated with our reverse docking server, TarFisDock [8],

PDTD is a valuable platform for target identification.

Construction and content

Fundamentally, PDTD has dual functions of querying

drug target information and identifying the potential

binding proteins of an active compound or an existing

drug by using reverse docking approach. Accordingly,

PDTD contains two sub-databases types, one is the struc-

tural sub-database and another is the informatics sub-

database. All data are associated with a relational database

implemented using MySQL and can be queried through

web interface. Through three computational engines,

search engine, visualization engine and TarFisDock, users

can implement interactive query and computation with

the PDTD (Figure 1). The structural sub-database stores

each protein in both PDB format and mol2 format with

Amber charges; sequence and active site information were

also included in the structural sub-database. The infor-

matics database stores the data of target categories, related

disease information, biological functions and associated

regulating (signaling) pathways. PDTD currently contains

>1100 entries covering the information of >800 known

drug targets.

The target proteins in PDTD were selected from scientific

literatures [1,19-21] and several online databases such as

TTD [11], DrugBank [12] and Thomson Pharma [22].

Since PDTD is designed to search the probable binding

proteins for new active compounds or existing drugs by

using reverse docking, it only contains the proteins with

known 3D structures determined experimentally by the X-

ray crystallographic or NMR methods. The coordinates of

proteins were isolated from the PDB. Since not all PDB

structures are of equal quality, a protein structure is

selected according to the following criteria when it has

several redundant records in PDB: (i) select the structure

without mutation and missing residues around the active

site; (ii) select the structure with high resolution; (iii)

select the structure complexed with ligand. For each

selected protein in PDTD, amino acid residues within 6.5

Å around the bound ligand were used to define the bind-

ing (active) site. A PDB entry could contain data on a

number of binding sites. If so, a separate entry was gener-

ated in the PDTD to accommodate each of the sites.

HETATM records in PDB files were used to define the lig-

ands. PDB and mol2 files of each protein were also stored

in the structural sub-database. All kinds of structures for a

drug target can be visualized using the "Jmol" JAVA applet

[23].

Most of drug targets in PDTD have been collected with

single structure (709 cases). Since our reverse docking pro-

gram, TarFisDock, has not considered the flexibility of

proteins, PDTD contains some redundancy for the flexible

BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104

Page 3 of 7

(page number not for citation purposes)

proteins. For example, HIV-1 protease has 27 entries and

dihydrofolate reductase (DHFR) has 14 entries in the

database. The redundancy of each target is shown in Fig-

ure 2a.

According to therapeutic areas, the drug targets may be

categorized into 14 types (Figure 2b). It is convenient for

users to custom a special list when they predict potential

binding targets for small molecules using TarFisDock.

Among targets having explicit therapeutic functions, the

targets related neoplastic disease are most populated, fol-

lowing are hormones and hormones antagonists related

targets. Targets related to viral infections are also major

contributors in PDTD. The distribution of biochemical

classification is shown in Figure 2c, indicating that PDTD

mainly consists of enzymes, receptors, ion channels,

transporters, nuclear receptors, binding protein, structural

proteins, signaling proteins, factors, regulators and hor-

mones. The targets which can not be assigned into any of

these biochemical classes are grouped into an "unknown"

class. The selected drug targets are enriched in enzymes

(80.2%). G-protein coupled receptors (GPCR) and other

receptors which account for most drug targets seldom

have crystal structures, resulting that the ratio of receptor

targets in PDTD is only 4.2%.

Utility and Discussion

Web interface: query, download and exploration

PDTD is supported with a friendly designed web interface

so that users can easily query the target information, and

retrieve, visualize or download the distributions of the

drug target files as they desire (Figure 3). PDTD has been

designed to provide fast and easy access to target informa-

tion. The popular MySQL backend was chosen as database

server. Using the scripting language PHP, special care was

taken to generate a clearly structured layout which enables

fast and easy navigation.

All the data can be accessed and retrieved directly via the

web browser, PDTD consists of a classification table and a

keyword search box. The user can search a drug target

manually from the classification table, or automatically

by using the keyword search function, such as PDB ID, tar-

get name, or disease. Every target has its own result page

containing comprehensive information including PDB

ID, target name, target category, related disease, its struc-

ture, and active site. The PDTD was carefully annotated

according to information found in the PDB, UniProt [24],

KEGG [25] and Enzyme Structures Database [26]. PDTD

also provides hyperlinks to other databases like TTD and

DrugBank, which allow easy navigation for more infor-

mation about target structure, source and function (See

Links to other databases). The related structures for each

PDTD system architectureFigure 1

PDTD system architecture. The system is implemented in MySQL and PHP script, user can freely access the database at

http://www.dddc.ac.cn/pdtd/.

BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104

Page 4 of 7

(page number not for citation purposes)

Functional and biochemical classifications of PDTD protein entriesFigure 2

Functional and biochemical classifications of PDTD protein entries. (A) Database redundancy. Each bar represents

the number of targets that have the same amounts of copies in the PDTD. Break is applied to the y axes. (B) Distribution of

drug targets according to their therapeutic areas. (C) Distribution of drug targets according to their biochemical criteria, which

include enzymes, receptors, ion channels, transporters, nuclear receptors, binding protein, structural proteins, signaling pro-

tein, and factors, regulators and hormones.

BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104

Page 5 of 7

(page number not for citation purposes)

Screen shots of the PDTDFigure 3

Screen shots of the PDTD. A screen shot of the PDTD showing several possible view of information describing the drug

target. Not all fields are shown.

BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104

Page 6 of 7

(page number not for citation purposes)

drug target can be downloaded freely from the detailed

page by clicking on "MORE" button. Furthermore, user

can download the classified target structures and all target

files from the "Download" page.

Also, users can customize the list of drug targets in which

they want to perform reverse docking process to predict

the potential binding targets for any small molecule,

which to our knowledge is not provided by other public

websites. Consequently, TarFisDock will output the list of

results with binding poses of molecules against each tar-

gets, along with corresponding information, disease,

annotation and links to other databases, which are also

presented (See the Applications below).

Applications

In bringing together the reverse docking server TarFisDock

[8], PDTD has been widely used to identify binding pro-

teins for small molecule. The binding proteins for several

molecules have been verified through bioassay and crystal

structure determination for target-ligand complexes [9].

In general, one drug molecule may interact with several

targets including targets associated with side effect (toxic-

ity). TarFisDock provides multiple options for selecting

protein targets. These clues are useful for further experi-

mental test in discovering new pharmacological efficacy

or toxicity for an existing drug. In general, combining with

PDTD, TarFisDock web sever is a convenient tool for

"fishing" the target proteins of small molecules, the user

just inputs the structure of querying compound and cus-

tomizes a target list from PDTD (a list of all the targets is

recommended). The results can be downloaded in the

form of mol2 file with the binding pose of each com-

pound and a list of potential binding targets according to

their ranking scores.

In addition, benchmark searches for the old version of

PDTD (698 entries) were performed using TarFisDock

taking vitamin E, an anti-oxidant, and 4H-tamoxifen, an

anti-cancer agent as probes [8]. In this study, similar

benchmark searches for the current version of PDTD

(1186 entries) have been carried out. For vitamin E, eight

(12 entries) of the twelve experimentally verified targets

fall into the top 10% candidates picked up from the PDTD

by the TarFisDock program. For 4H-tamoxifen, five (14

entries) of the eleven experimentally confirmed targets

appear amongst the top 10% of the TarFisDock predicted

candidates. In addition, the PDTD was searched by the

TarFisDock using N-trans-caffeoyltyramine (compound

1), an active natural product discovered by anti-H. pylori

screening in our lab, as a probe in the previous research

[9]. Homology search revealed that, among the fifteen

candidates discovered by reverse docking, diami-

nopimelate decarboxylase (DC) and peptide deformylase

(PDF) are possible binding proteins of compound 1.

Enzymatic assay demonstrated compound 1 and its deriv-

ative compound 2 are the potent inhibitors against the H.

pylori PDF (HpPDF) with IC50 values of 10.8 and 1.25 µM,

respectively. X-ray crystal structures of HpPDF and the

complexes of HpPDF with 1 and 2 were determined, indi-

cating that these two inhibitors bind well with the HpPDF

binding pocket [9]. To exemplify the applications of

PDTD combining with TarFisDock, the brief results of

these three benchmark examples have been uploaded to

the PDTD homepage under the "Benchmark" option.

Links to other databases

General links are given to related drug and target informa-

tion with other databases [11,12]. Each data in PDTD is

linked to the Protein Data Bank, DrugBank, there are also

hypertext links to UniProt [24], Kegg [25] and Enzyme

Structures Database [26] for further structural and func-

tional information.

Conclusion

In summary, PDTD is a comprehensive, web-accessible

database of drug targets, which focuses on those drug tar-

gets with known 3D-structures. By far, PDTD has collected

>1100 entries covering >800 known and potential drug tar-

gets from the Protein Data Bank. PDB structure, mol2 file

and active site information of each drug target were

extracted from the crystal structure, and all the information

can be viewed with molecular visualization tools and can

be downloaded freely by users. Drug targets of PDTD were

categorized by two criteria: therapeutic areas and biochem-

ical criteria. Each target was carefully annotated by brows-

ing several databases, such as DrugBank, TTD, and UniProt.

All these information were stored in informatics sub-data-

base, which was associated to structural sub-database with

a relational database. Users can also use our reverse docking

program to search PDTD for finding the possible binding

protein(s) of a small molecule.

One drug molecule may interact with several targets includ-

ing targets associated with side effect (toxicity). By search-

ing PDTD, TarFisDock may provide multiple options of the

binding proteins for a small molecule. These clues are use-

ful for further experimental test in discovering new targets

and new pharmacological efficacy or toxicity for an existing

drug. Thus, combining with TarFisDock, PDTD is a good

web-accessible protein database for identifying drug targets

and for discovering new usages of old drugs [27,28]. The

user just inputs the structure of querying compound and

customizes a target list from PDTD (a list of all the targets

is recommended), TarFisDock may provide possible bind-

ing proteins of the compound. The results can be down-

loaded in the form of mol2 file with the binding pose of

each compound and a list of potential binding targets

according to their ranking scores.

BMC Bioinformatics 2008, 9:104 http://www.biomedcentral.com/1471-2105/9/104

Page 7 of 7

(page number not for citation purposes)

PDTD will be updated continuously. We intend to classify

the drug targets according more completely to their biolog-

ical functions, which will be achieved by integrating and/or

linking PDTD with other bioinformatics databases. For

example, links can be directed to the databases of SOURCE

[29] and Gene Ontology [30] for more descriptions of

functional annotations, ontologies, and gene expression

data.

Availability and requirements

PDTD is freely available for academic user at http://

www.dddc.ac.cn/pdtd. To download the files of PDTD,

users must complete a simple registration process and agree

not to republish the data without explicit permission. Users

are invited to contact us through the 'Contact' link and to

participate in the user forum at http://www.dddc.ac.cn/tar

fisdock/forum/.

Authors' contributions

ZG developed the web interface, designed the relational

database scheme, and integrated the database-PDTD with

the reverse docking program-TarFisDock. HL developed the

reverse docking program, participated in the design of web

interface, and contributed to writing the manuscript. HZ,

XL, and LK were major data contributors of the current sys-

tem. XL, WZ and KC provided comments and suggestions

about the features of the database. XW and HJ conceived

the idea of the database, provided direction for its develop-

ment and revised the subsequent drafts of this manuscript.

All authors read and approved the final manuscript.

Acknowledgements

We thank all the colleagues in Drug Discovery and Design Center for their

contributions in literature searching and necessary dealing with files. The

work was partly supported by the Special Fund for Major State Basic

Research Project (grant 2002CB512802), the National Natural Science Foun-

dation of China (grants 20721003 and 10572033), and the 863 Hi-Tech Pro-

gram of China (grant 2007AA02Z304). HL was also sponsored by the

Shanghai Postdoctoral Scientific Program.

References

1. Drews J: Drug discovery: a historical perspective. Science 2000,

287(5460):1960-1964.

2. Hopkins AL, Groom CR: The druggable genome. Nat Rev Drug Dis-

cov 2002, 1(9):727-730.

3. Russ AP, Lampel S: The druggable genome: an update. Drug Dis-

cov Today 2005, 10(23-24):1607-1610.

4. Kitchen DB, Decornez H, Furr JR, Bajorath J: Docking and scoring

in virtual screening for drug discovery: methods and applica-

tions. Nat Rev Drug Discov 2004, 3(11):935-949.

5. Sperandio O, Miteva MA, Delfaud F, Villoutreix BO: Receptor-based

computational screening of compound databases: the main

docking-scoring engines. Curr Protein Pept Sci 2006, 7(5):369-393.

6. Chen YZ, Zhi DG: Ligand-protein inverse docking and its poten-

tial use in the computer search of protein targets of a small

molecule. Proteins 2001, 43(2):217-226.

7. Paul N, Kellenberger E, Bret G, Muller P, Rognan D: Recovering the

true targets of specific ligands by virtual screening of the pro-

tein data bank. Proteins 2004, 54(4):671-680.

8. Li H, Gao Z, Kang L, Zhang H, Yang K, Yu K, Luo X, Zhu W, Chen K,

Shen J, Wang X, Jiang H: TarFisDock: a web server for identifying

drug targets with docking approach. Nucleic Acids Res 2006,

34(Web Server issue):W219-24.

9. Cai J, Han C, Hu T, Zhang J, Wu D, Wang F, Liu Y, Ding J, Chen K, Yue

J, Shen X, Jiang H: Peptide deformylase is a potential target for

anti-Helicobacter pylori drugs: reverse docking, enzymatic

assay, and X-ray crystallography validation. Protein Sci 2006,

15(9):2071-2081.

10. Muller P, Lena G, Boilard E, Bezzine S, Lambeau G, Guichard G, Rognan

D: In silico-guided target identification of a scaffold-focused

library: 1,3,5-triazepan-2,6-diones as novel phospholipase A2

inhibitors. J Med Chem 2006, 49(23):6768-6778.

11. Chen X, Ji ZL, Chen YZ: TTD: Therapeutic Target Database.

Nucleic Acids Res 2002, 30(1):412-415.

12. Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P,

Chang Z, Woolsey J: DrugBank: a comprehensive resource for in

silico drug discovery and exploration. Nucleic Acids Res 2006,

34(Database issue):D668-72.

13. Stuart AC, Ilyin VA, Sali A: LigBase: a database of families of

aligned ligand binding sites in known protein sequences and

structures. Bioinformatics 2002, 18(1):200-201.

14. Ivanisenko VA, Pintus SS, Grigorovich DA, Kolchanov NA: PDBSite:

a database of the 3D structure of protein functional sites.

Nucleic Acids Res 2005, 33(Database issue):D183-7.

15. Gold ND, Jackson RM: SitesBase: a database for structure-based

protein-ligand binding site comparisons. Nucleic Acids Res 2006,

34(Database issue):D231-4.

16. Velankar S, McNeil P, Mittard-Runte V, Suarez A, Barrell D, Apweiler

R, Henrick K: E-MSD: an integrated data resource for bioinfor-

matics. Nucleic Acids Res 2005, 33(Database issue):D262-5.

17. Shin JM, Cho DH: PDB-Ligand: a ligand database based on PDB

for the automated and customized classification of ligand-

binding structures. Nucleic Acids Res 2005, 33(Database

issue):D238-41.

18. Block P, Sotriffer CA, Dramburg I, Klebe G: AffinDB: a freely acces-

sible database of affinities for protein-ligand complexes from

the PDB. Nucleic Acids Res 2006, 34(Database issue):D522-6.

19. Bonday ZQ, Dhanasekaran S, Rangarajan PN, Padmanaban G: Import

of host delta-aminolevulinate dehydratase into the malarial

parasite: identification of a new drug target. Nat Med 2000,

6(8):898-903.

20. Gibbs JB: Mechanism-based target identification and drug dis-

covery in cancer research. Science 2000, 287(5460):1969-1973.

21. Hardman JG, Limbird LE, Gilman AG: Goodman and Gilman's The

Pharmacological Basis of Therapeutics. 10th edition. New York

, McGraw-Hill ; 2001.

22. Thomson Pharma [http://www.thomson-pharma.com]

23. Jmol: an open-source Java viewer for chemical structures in

3D [http://www.jmol.org/]

24. Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S,

Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA,

O'Donovan C, Redaschi N, Yeh LS: UniProt: the Universal Protein

knowledgebase. Nucleic Acids Res 2004, 32(Database

issue):D115-9.

25. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M: The KEGG

resource for deciphering the genome. Nucleic Acids Res 2004,

32(Database issue):D277-80.

26. Enzyme Structures Database [http://www.ebi.ac.uk/thornton-

srv/databases/enzymes/]

27. O'Connor KA, Roth BL: Finding new tricks for old drugs: an effi-

cient route for public-sector drug discovery. Nat Rev Drug Discov

2005, 4(12):1005-1014.

28. Chong CR, Sullivan DJ Jr.: New uses for old drugs. Nature 2007,

448(7154):645-646.

29. Diehn M, Sherlock G, Binkley G, Jin H, Matese JC, Hernandez-Boussard

T, Rees CA, Cherry JM, Botstein D, Brown PO, Alizadeh AA:

SOURCE: a unified genomic resource of functional annota-

tions, ontologies, and gene expression data. Nucleic Acids Res

2003, 31(1):219-223.

30. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis

AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L,

Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM,

Sherlock G: Gene ontology: tool for the unification of biology.

The Gene Ontology Consortium. Nat Genet 2000, 25(1):25-29.

Recent Advances on Quinazoline

Book

Full-text available

May 2024

Ali Al-kaf

This book investigates and identifies novel therapeutic compounds for the treatment of a range of illnesses. Heterocyclic compounds are a significant class of substances with biological activity. Among them, quinazoline has attracted a lot of interest because of its important biological properties. Numerous compounds with quinazoline moiety have been shown to exhibit a wide range of therapeutic properties, including antioxidant, antifungal, antiviral, antidiabetic, anticancer, anti-inflammatory, and antibacterial activities. This book presents a comprehensive overview of quinazoline and its derivatives. The chapters address recent advances in the synthesis of several different heterocyclic compounds, the use of computational studies for finding new active quinazoline derivatives, the biological activities of quinazoline, and much more.

Importance and Application of Computational Studies in Finding New Active Quinazoline Derivatives

Chapter

Full-text available

Nov 2023

Quinazoline derivatives have shown promising pharmacological activities against various diseases, including cancer, inflammation, and cardiovascular disorders. Computational studies have become an important tool in the discovery and optimization of new quinazoline derivatives. In this chapter, the importance and application of computational studies in finding new active quinazoline derivatives were discussed. The various computational techniques, such as molecular docking, molecular dynamics simulations, quantum mechanics calculations, and machine learning algorithms, which have been used to predict the biological activities and optimize the structures of quinazoline derivatives, were described. Examples of successful applications of computational studies in the discovery of new quinazoline derivatives with improved pharmacological activities were added. Overall, computational studies have proven to be valuable in the development of new quinazoline derivatives and have the potential to accelerate the drug discovery process.

Systems Approach for Identifying Drug Targets by Computational Approaches

Chapter

May 2024

Large datasets are frequently gathered, stored and analysed in the big data age with the goal of guiding biological discoveries and validating hypotheses. There is no question that the introduction of new technologies and open data initiatives has significantly enhanced the volume and diversity of data. The whole drug development process uses big data, from identifying targets and mechanisms of action to finding new leads and therapeutic candidates. With the intention of giving readers a broad overview of the computing resources and databases accessible, these approaches are shown and explored. We believe that big data leveraging should prioritize personalized care and be cost-effective. On the basis of their synergy, we suggest combining information technologies with (chemo) informatics tools to accomplish this.

Synergizing drug repurposing and target identification for neurodegenerative diseases

Chapter

Apr 2024

Network pharmacology in phytochemical research

Chapter

Jan 2024

Mining Chemogenomic Spaces for Prediction of Drug–Target Interactions

Chapter

Sep 2023

The pipeline of drug discovery consists of a number of processes; drug–target interaction determination is one of the salient steps among them. Computational prediction of drug–target interactions can facilitate in reducing the search space of experimental wet lab-based verifications steps, thus considerably reducing time and other resources dedicated to the drug discovery pipeline. While machine learning-based methods are more widespread for drug–target interaction prediction, network-centric methods are also evolving. In this chapter, we focus on the process of the drug–target interaction prediction from the perspective of using machine learning algorithms and the various stages involved for developing an accurate predictor.Key wordsDrug–target interactions SMOTE Feature engineering Molecular descriptors Genomic space

Comprehensive survey of target prediction web servers for Traditional Chinese Medicine

Article

Full-text available

Aug 2023

Traditional Chinese medicine (TCM) is characterized by multi-components, multiple targets, and complex mechanisms of action and therefore has significant advantages in treating diseases. However, the clinical application of TCM prescriptions is limited due to the difficulty in elucidating the effective substances and the lack of current scientific evidence on the mechanisms of action. In recent years, the development of network pharmacology based on drug systems research has provided a new approach for understanding the complex systems represented by TCM. The determination of drug targets is the core of TCM network pharmacology research. Over the past years, many web tools for drug targets with various features have been developed to facilitate target prediction, significantly promoting drug discovery. Therefore, this review introduces the widely used web tools for compound-target interaction prediction databases and web resources in TCM pharmacology research, and it compares and analyzes each web tool based on their basic properties, including the underlying theory, algorithms, datasets, and search results. Finally, we present the remaining challenges for the promising future of compound-target interaction prediction in TCM pharmacology research. This work may guide researchers in choosing web tools for target prediction and may also help develop more TCM tools based on these existing resources.

Repurposing Drugs: An Empowering Approach to Drug Discovery and Development

Article

Jul 2023

Drug discovery and development is a time-consuming and costly procedure that necessitates a substantial effort. Drug repurposing has been suggested as a method for developing medicines that takes less time than developing brand new medications and will be less expensive. Also known as drug repositioning or re-profiling, this strategy has been in use from the time of serendipitous drug discoveries to the modern computer aided drug designing and use of computational chemistry. In the light of the COVID-19 pandemic too, drug repurposing emerged as a ray of hope in the dearth of available medicines. Data availability by electronic recording, libraries, and improvements in computational techniques offer a vital substrate for systemic evaluation of repurposing candidates. In the not-too-distant future, it could be possible to create a global research archive for us to access, thus accelerating the process of drug development and repurposing. This review aims to present the evolution, benefits and drawbacks including current approaches, key players and the legal and regulatory hurdles in the field of drug repurposing. The vast quantities of available data secured in multiple drug databases, assisting in drug repurposing is also discussed.

Using machine learning algorithms to predict the activity of fullerene nanoparticles

Conference Paper

Jan 2023

Machine Learning in Drug Design

Chapter

Feb 2023

Completely revised and updated, the 2nd edition of The Handbook of Medicinal Chemistry draws together contributions from authoritative practitioners to provide a comprehensive overview of the field as well as insight into the latest trends and research. An ideal companion for students in medicinal chemistry, drug discovery and drug development, while also communicating core principles, the book places the discipline within the context of the burgeoning platform of new modalities now available to drug discovery. The book also highlights the role chemistry has to play in wider target validation and translational technologies. This is a carefully curated compilation of writing from global experts using their broad experience of medicinal chemistry, project leadership and drug discovery and development from an industry, academic and charity perspective to provide unparalleled insight into the field.

Gene Ontology: tool for the unification of biology. The Gene Ontology Consortium

Article

Full-text available

May 2000

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium

Article

Full-text available

Jun 2000

Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.

TTD: Therapeutic Target Database

Article

Full-text available

Feb 2002
NUCLEIC ACIDS RES

A number of proteins and nucleic acids have been explored as therapeutic targets. These targets are subjects of interest in different areas of biomedical and pharmaceutical research and in the development and evaluation of bioinformatics, molecular modeling, computer-aided drug design and analytical tools. A publicly accessible database that provides comprehensive information about these targets is therefore helpful to the relevant communities. The Therapeutic Target Database (TTD) is designed to provide information about the known therapeutic protein and nucleic acid targets described in the literature, the targeted disease conditions, the pathway information and the corresponding drugs/ligands directed at each of these targets. Cross-links to other databases are also introduced to facilitate the access of information about the sequence, 3D structure, function, nomenclature, drug/ligand binding properties, drug usage and effects, and related literature for each target. This database can be accessed at http://xin.cz3.nus.edu.sg/group/ttd/ttd.asp and it currently contains entries for 433 targets covering 125 disease conditions along with 809 drugs/ligands directed at each of these targets. Each entry can be retrieved through multiple methods including target name, disease name, drug/ligand name, drug/ligand function and drug therapeutic classification.

SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data

Article

Jan 2003

Goodman and Gilman's The Pharmacological Basis of Therapeutics, 10th Edition

Article

Jan 2002

Mort Rosenberg

Mechanism-Based Target Identification and Drug Discovery in Cancer Research

Article

Apr 2000

Jackson B. Gibbs

Cancer as a disease in the human population is becoming a larger health problem, and the medicines used as treatments have clear limitations. In the past 20 years, there has been a tremendous increase in our knowledge of the molecular mechanisms and pathophysiology of human cancer. Many of these mechanisms have been exploited as new targets for drug development in the hope that they will have greater antitumor activity with less toxicity to the patient than is seen with currently used medicines. The fruition of these efforts in the clinic is just now being realized with a few encouraging results.

Drug Discovery: A Historical Perspective

Article

Apr 2000

Drews JJ

Driven by chemistry but increasingly guided by pharmacology and the clinical sciences, drug research has contributed more to the progress of medicine during the past century than any other scientific factor. The advent of molecular biology and, in particular, of genomic sciences is having a deep impact on drug discovery. Recombinant proteins and monoclonal antibodies have greatly enriched our therapeutic armamentarium. Genome sciences, combined with bioinformatic tools, allow us to dissect the genetic basis of multifactorial diseases and to determine the most suitable points of attack for future medicines, thereby increasing the number of treatment options. The dramatic increase in the complexity of drug research is enforcing changes in the institutional basis of this interdisciplinary endeavor. The biotech industry is establishing itself as the discovery arm of the pharmaceutical industry. In bridging the gap between academia and large pharmaceutical companies, the biotech firms have been effective instruments of technology transfer.

Import of host ??-aminolevulinate dehydratase into the malarial parasite: Identification of a new drug target

Article

Sep 2000

The parasite Plasmodium berghei imports the enzyme delta-aminolevulinate dehydratase (ALAD), and perhaps the subsequent enzymes of the pathway from the host red blood cell to sustain heme synthesis. Here we have studied the mechanism of this import. A 65-kDa protein on the P. berghei membrane specifically bound to mouse red blood cell ALAD, and a 93-amino-acid fragment (ALAD-DeltaNC) of the host erythrocyte ALAD was able to compete with the full-length enzyme for binding to the P. berghei membrane. ALAD-DeltaNC was taken up by the infected red blood cell when added to a culture of P. falciparum and this led to a substantial decrease in ALAD protein and enzyme activity and, subsequently, heme synthesis in the parasite, resulting in its death.

Ligand-Protein Inverse Docking and its Potential Use in the Computer Search of Protein Targets of a Small Molecule

Article

May 2001

Ligand-protein docking has been developed and used in facilitating new drug discoveries. In this approach, docking single or multiple small molecules to a receptor site is attempted to find putative ligands. A number of studies have shown that docking algorithms are capable of finding ligands and binding conformations at a receptor site close to experimentally determined structures. These algorithms are expected to be equally applicable to the identification of multiple proteins to which a small molecule can bind or weakly bind. We introduce a ligand-protein inverse-docking approach for finding potential protein targets of a small molecule by the computer-automated docking search of a protein cavity database. This database is developed from protein structures in the Protein Data Bank (PDB). Docking is conducted with a procedure involving multiple-conformer shape-matching alignment of a molecule to a cavity followed by molecular-mechanics torsion optimization and energy minimization on both the molecule and the protein residues at the binding region. Scoring is conducted by the evaluation of molecular-mechanics energy and, when applicable, by the further analysis of binding competitiveness against other ligands that bind to the same receptor site in at least one PDB entry. Testing results on two therapeutic agents, 4H-tamoxifen and vitamin E, showed that 50% of the computer-identified potential protein targets were implicated or confirmed by experiments. The application of this approach may facilitate the prediction of unknown and secondary therapeutic target proteins and those related to the side effects and toxicity of a drug or drug candidate. Proteins 2001;43:217-226.

LigBase: A database of families of aligned ligand binding sites in known protein sequences and structures

Article

Feb 2002

A database comprising all ligand-binding sites of known structure aligned with all related protein sequences and structures is described. Currently, the database contains approximately 50000 ligand-binding sites for small molecules found in the Protein Data Bank (PDB). The structure–structure alignments are obtained by the Combinatorial Extension (CE) program (Shindyalov and Bourne, Protein Eng. , 11, 739–747, 1998) and sequence–structure alignments are extracted from the ModBase database of comparative protein structure models for all known protein sequences (Sanchez et al. , Nucleic Acids Res. , 28, 250–253, 2000). It is possible to search for binding sites in LigBase by a variety of criteria. LigBase reports summarize ligand data including relevant structural information from the PDB file, such as ligand type and size, and contain links to all related protein sequences in the TrEMBL database. Residues in the binding sites are graphically depicted for comparison with other structurally defined family members. LigBase provides a resource for the analysis of families of related binding sites. Availability: LigBase is accessible on the web at http://guitar.rockefeller.edu/ligbase. Contact: ash@guitar.rockefeller.edu; sali@rockefeller.edu *To whom correspondence should be addressed

PDTD: A web-accessible protein database for drug target identification

Abstract and Figures

Recommended publications

STRUCLA: A WWW meta-server for protein structure comparison and evolutionary classification

The Protein Data Bank and lessons in data management

DOCKGROUND resource for studying protein–protein interfaces

Automated and accurate deposition of structures solved by X-ray diffraction to the Protein Data Bank