ArticlePDF Available

Improving Women's Health Through Modernization of Our Bioinformatics Infrastructure

Authors:

Abstract and Figures

Our nationwide bioinformatics infrastructure used to detect important sex differences associated with medical product use is antiquated. The Food and Drug Administration (FDA) has embarked on an ambitious bioinformatics modernization effort that will improve our ability to assess the safety and effectiveness of new medical products. This, in turn, will improve our ability to detect important sex differences.
Content may be subject to copyright.
Improving Women’s Health Through
Modernization of Our Bioinformatics
Infrastructure
A Oliva
1
, E Pinnow
2
, R Levin
1
and K Uhl
2
Our nationwide bioinformatics infrastructure used
to detect important sex differences associated with
medical product use is antiquated. The Food and Drug
Administration (FDA) has embarked on an ambitious
bioinformatics modernization effort that will improve our
ability to assess the safety and effectiveness of new medical
products. This, in turn, will improve our ability to detect
important sex differences.
THE BIOINFORMATICS INFRASTRUCTURE PROBLEM
Bioinformatics is the design and development of computer-
based technology to support the life sciences.
1
At the FDA,
bioinformatics means having modern computer systems to
efficiently and effectively manage information regarding
FDA-regulated medical products. Ensuring the efficient
exchange of medical product information among the FDA
and its stakeholders is critical to the FDAs mission to protect
and promote public health (see Figure 1).
Through its Critical Path Initiative, the FDA is seeking to
facilitate a national effort to modernize the scientific process
through which a potential human drug, biological product,
or medical device is transformed from a discovery or ‘proof
of concept’ into a marketed medical product.
2
The FDA has
identified bioinformatics as an area where improvements can
increase efficiency, predictability, and productivity in the
development process. Effective bioinformatics modernization
requires an understanding of the existing problems and
addressing them.
To understand the current inefficiencies in our bioinfor-
matics infrastructure, one need only consider a time before
we developed our modern financial information infrastruc-
ture. Back then, we lacked a standard currency, and wealth
was maintained in a variety of different forms (e.g., gold,
silver, beads). Banks were few and wealth was commonly
stored under a mattress or hidden away under floorboards.
Money was counted by hand and transported in sacks. It is
easy to imagine the inherent inefficiencies of such a system.
Today, the US financial industry has a standard currency
(US dollars), has improved access to money and transaction
methods (e.g., banks, automatic teller machines, credit
cards), and has developed user-friendly interfaces to financial
information (e.g., automated money counting machines,
calculators). Our modern financial infrastructure allows
money to travel electronically both quickly and securely
wherever needed, and financial information is accessible
virtually anywhere. This modern financial infrastructure has
enabled modern financial management tools that were
previously unthinkable. For example, the Federal Reserve
now has tools to monitor billions of financial transactions
almost in real time to assess the health of the economy and to
make better monetary policy decisions. For the individual
user, some banks now provide account balances through
instant messages to an account holder’s cell phone.
EDUCATION
nature publishing group
Figure 1 The regulatory product information supply chain. Arrows
represent points at which product information is exchanged. The regulated
industry gives the FDA certain information about a product (e.g., clinical trial
data). The FDA evaluates that information and, if the product is approved, in
turn, makes certain product information available to the public (e.g., drug
labeling). The public in turn provides product information to the Agency
(e.g., adverse event reports), which the FDA processes, providing updates to
the regulated industry (e.g., recall) and the public if needed (e.g., new
labeling, warning box). PIE, Product Information Exchange.
1
Office of Critical Path Programs, Food and Drug Administration, Rockville, Maryland, USA;
2
Office of Women’s Health, Food and Drug Administration, Rockville,
Maryland, USA. Correspondence: A Oliva (armando.oliva@fda.hhs.gov)
Published online 7 November 2007. doi:10.1038/sj.clpt.6100437
192 VOLUME 83 NUMBER 1 | JANUARY 2008 | www.nature.com/cpt
Our bioinformatics infrastructure today is comparable to
the financial infrastructure of years past. If one compared
clinical study data to money, we have no widely implemented
standard currency for study data. Furthermore, companies
store their study data locally and in different formats,
limiting access and making it extremely difficult to exchange
data quickly and securely between authorized entities. The
available tools to analyze study data are outdated and difficult
to use, making the assessment of a medical product’s safety
and effectiveness time-consuming. The resulting inefficien-
cies make it difficult for the FDA to carry out its mission to
ensure that medical products are safe and effective. Unlike the
Federal Reserve’s ability to leverage our modern financial
information infrastructure to inform monetary policy, the
lack of a modern bioinformatics infrastructure hinders the
FDAs ability to make better informed health policy decisions
about new medical products.
We must maximize the usefulness of the information we
collect from clinical studies that are submitted to the FDA to
support marketing approval. To do this, the FDA and its
regulated industry must undergo a major transformation.
By implementing a standard currency for study data,
improving access to the data, and developing user-friendly
tools (i.e., interfaces) that can help us efficiently convert data
into knowledge, we can lower development costs, shorten the
time to market for new products, and enhance our ability to
communicate information about their safe and effective use,
thus promoting the public health. The FDA recognizes its
leadership role in this transformation.
THE SOLUTION
A modern bioinformatics environment requires enhancing
three key information management domains: (1) informa-
tion standards, (2) information access, and (3) interface or
tools that can efficiently analyze information. Standards,
access, and interface tools combine to influence the way we
process information and improve regulatory decision making
(see Figure 2).
Standards
By using data standards, we establish a common language for
managing health-related information. Data standards enable
both people and computers to communicate effectively and
efficiently. Although all communication with the agency must
be in Standard English, we must use a standard language that
computer systems can understand to automate the processing
of information. We must also use standard terminology:
formalandconsistentdenitionsaswellasexamplessothatwe
can effectively share concepts and ideas. Computer systems have
their own language with precise terminologies and dictionaries
(i.e., semantics) and grammar (i.e., syntax). Using precise,
standard machine-readable language is a necessary component
of an effective bioinformatics environment.
Access
Improving access means making it easy to send, receive, and
share information. Eliminating paper and exchanging informa-
tion electronically greatly improve access. Improved access also
means being able to quickly locate specific information contained
within electronic files and documents. It is not sufficient to easily
access a review of a new marketing application; we also need
quick access to detailed information within a review, such as
information about an unusual adverse event.
Interface
The third key information management domain is interface.
We must develop user-friendly interfaces or tools for
analyzing information that efficiently and effectively help us
convert information into knowledge. We must ensure that
users are properly trained to use these tools. Increasingly, the
modern interface tools to information are software programs
on a computer.
Improvements in all three key information management
domains are needed to enhance decision making and
improve the public health. Creation of a national modern
bioinformatics infrastructure is not something industry and
other relevant stakeholders can achieve without Federal
leadership—the government has already taken the lead by
launching its broad health information technology (Health
IT) effort.
3
In its Critical Path Initiative, the FDA identified
harnessing bioinformatics
4
as a key area needing the most
work. By harnessing bioinformatics to modernize our own
information management environment, the FDA will help
pave the road for its stakeholders to do the same.
It is the FDAs goal to achieve a modern bioinformatics
environment that enhances its decision-making capability.
Ideally, all information flowing through the regulatory
product information supply chain is standardized, both
information and knowledge are accessible electronically, and
the tools for analyzing information are both effective and
user friendly.
EXAMPLE: TRACKING WOMEN IN CLINICAL TRIALS
The inclusion of women in clinical trials is necessary to
identify sex-related safety and efficacy differences. A 2001
Figure 2 Three information management domains.
CLINICAL PHARMACOLOGY & THERAPEUTICS | VOLUME 83 NUMBER 1 | JANUARY 2008 193
EDUCATION
report by the Government Accountability Office found that 8
out of 10 drugs withdrawn since 1 January 1997 posed greater
health risks for women.
5
Congress expressed concern that the
FDA had paid insufficient attention to gender-based research,
mandating the FDA to pilot a database system that collects
demographic information.
6
Tracking the participation of women in clinical trials is a
good example of an existing inefficiency in our bioinfor-
matics infrastructure with a direct impact on women’s health.
We use this information to determine if an adequate number
of women participated in a trial (or a group of trials) to begin
to assess whether an important sex difference exists.
Surprisingly, there is no widely used standard to record the
sex of a subject in a clinical trial. Some call this important
variable ‘‘sex. Others call it gender, even though it is the
biological sex that is recorded. Some clinical trials record the
sex as ‘male’ or ‘female, others as ‘M’ or ‘‘F,’ still others as
‘0’ or ‘1, or ‘1’’ or ‘‘2’ (leaving the analyzer of the data
sometimes wondering which number corresponds to which
sex!). If one wished to write a computer program to count
the number of women that participated in 10 trials of a new
drug, a manual, time-consuming process is needed. One has
to first see how the sex information is recorded in each trial,
what terms were used to describe each sex, and then write a
computer program to analyze those specific 10 trials. Once the
analysis is complete, one discards the program because the next
10 trials will use different standards to document sex, and
another custom program will be needed. Without a standard
way of describing the sex of a subject, our ability to develop
reusable and computer-automated counting techniques (i.e,a
modern and efficient interface, analogous to money-counting
machines in the financial world) is substantially impaired.
The Clinical Data Interchange Standards Consortium
(CDISC)
7
is a non-profit organization that has developed
standards (called the Study Data Tabulation Model, or
SDTM) for the exchange of clinical trial data. They
recommend that all clinical trials always record sex informa-
tion in a variable called ‘SEX’’ and identify the sex of each
subject as M, F, or U (unknown). The FDA views the SDTM
as a positive step forward in standardizing clinical trial data
and recommends its use.
8
CDISC-formatted clinical trial data
are easier to analyze because they support the development of
reusable computer programs (interfaces) that will greatly
facilitate tracking the participation of women in clinical
trials, for example.
Implementation of the SDTM industry-wide is still in its
infancy. A recent FDA study revealed a significant learning
curve in using the standard correctly.
9
But, with time and
experience, we anticipate that wider and more consistent
adoption of the SDTM will yield efficiencies that will make
tracking the participation of women in clinical trials a
relatively trivial computational problem. The same data
standardization strategy will allow ease in tracking any
particular subpopulation in clinical trials (e.g., pediatrics,
elderly), so that clinical trial data become more useful even
beyond womens health.
This, of course, is just the first step. With standardized
electronic clinical trial data, to include adverse event and
laboratory data, modern computerized analytic tools can be
developed (prototypes of which are currently under evalua-
tion and limited use at the FDA) to easily understand the
effects of medical products in women and to efficiently
identify important sex and gender differences associated with
their use.
A common limitation of clinical trials is that they do not
enroll sufficient members of an important subgroup such as
women, often making it difficult to identify important sex
differences by analyzing a single trial. The availability of
standardized trial data makes it much easier to use powerful
statistical analytic techniques across many trials to increase
our ability to detect important subgroup differences, without
needing to conduct difficult, expensive, and time-consuming
data conversion each time.
NEXT STEPS
A modern bioinformatics infrastructure for clinical trial data
means that all data are available electronically (improved
access) and are standardized and that modern analytic tools
(interfaces) are available to quickly and efficiently analyze the
data to create useful knowledge about medical products. The
key is more widespread adoption of clinical trial data
standards industry-wide. The FDA is collaborating with
CDISC and Health Level 7 (HL7) (an American Standards
Institute (ANSI)-accredited standards development organiza-
tion working in the health-care arena)
10
to create a data
exchange standard that is completely harmonized with
existing HL7 standards currently being used in exchanging
Electronic Health Record (EHR) information. Although also
in their infancy, EHRs provide an exciting opportunity to
monitor the effects of medical products on a much larger
scale, and throughout the entire life cycle of a medical
product, from early development through widespread post-
marketing use. The CDISC-HL7 standard, once developed,
will support seamless integration of medical product
information wherever and whenever it is collected. The
ultimate goal is having high-quality standardized data about
medical products accessible electronically, exchanged quickly
and securely, and analyzed using modern analytic tools to
improve the public health.
Modernization of our bioinformatics infrastructure is
costly, complex, and time-consuming. Nonetheless, it is a
necessary step to improve the benefit–risk assessments of
medical products, which in turn will enhance our ability to
identify important sex differences associated with their use.
These bioinformatics modernization principles, when applied
to all types of non-clinical and clinical research data, e.g.,
pharmacokinetics, pharmacodynamics, genomics, biomar-
kers, create tremendous opportunities to improve our
understanding of the effects of the medical products that
we all use, so that we can truly personalize health care in the
future. No longer should we be satisfied with the simple
knowledge that a medical product is safe and effective in
194 VOLUME 83 NUMBER 1 | JANUARY 2008 | www.nature.com/cpt
EDUCATION
humans. We must strive to answer the more important
question: is this product safe and effective for this individual,
based on her important genetic and phenotypic markers?
Modernizing our bioinformatics infrastructure will move us
one giant step closer to answering that question.
CONFLICT OF INTEREST
The authors declared no conflict of interest.
& 2008 American Society for Clinical Pharmacology and Therapeutics
1. Lacroix, Z. & Critchlow, T. Bioinformatics: Managing Scientific Data.
(Morgan Kaufman Publishers: San Francisco, 2003).
2. US Food and Drug Administration. FDA’s Critical Path Initiative /http://
www.fda.gov/oc/initiatives/criticalpath/S. Accessed 28 August 2007.
3. Health and Human Services. Health information technology
/http://www.hhs.gov/healthit/S. Accessed 28 August 2007.
4. US Food and Drug Administration. Critical path opportunities list and
report. Topic 3: Harnessing Bioinformatics, R-15 /http://www.fda.gov/
oc/initiatives/criticalpath/reports/opp_report.pdfS. Accessed 28 August
2007 (2006).
5. US General Accounting Office. Drug safety: most drugs withdrawn in
recent years had greater health risks for women /http://www.gao.gov/
new.items/d01286r.pdfS. Accessed 28 August 2007.
6. 107th Congress (2001–2002) 1st Session (FY2002). Senate Committee
Report (S. Rep. 107–41) /http://frwebgate.access.gpo.gov/cgi-bin/
getdoc.cgi?dbname=107_cong_reports&docid=f:sr041.pdfS, 146.
Accessed 28 August 2007.
7. Clinical Data Interchange Standards Consortium /http://
www.cdisc.orgS. Accessed 28 August 2007.
8. US Food and Drug Administration. Study Data Specifications, Version 1.3
/http://www.fda.gov/cder/regulatory/ersr/Studydata.pdfS. Accessed
28 August 2007.
9. Pinnow, E. Evaluation of NDAs submitted to the FDA in CDISC SDTM
format: feasibility of using demographic domain data to track the
inclusion of women in trials /http://www.cdisc.org/publications/
interchange2006/session8/EllenPinnowCDISC2006Pinnow.pdfS.
Accessed 28 August 2007.
10. Health Level Seven /http://www.hl7.orgS. Accessed 28 August
2007.
CLINICAL PHARMACOLOGY & THERAPEUTICS | VOLUME 83 NUMBER 1 | JANUARY 2008 195
EDUCATION
... Improving the accessibility of new and existing data for review and for incorporation into meta-analyses is needed [227]. AE and SAE data stratified by sex should be included in the primary or secondary manuscript and/or available in an appendix to the primary manuscript. ...
... Other novel approaches to improve data availability in women include enhancing the bioinformatics infrastructure to standardize electronic clinical data (Clinical Data Interchange Standards Consortium http://www.cdisc.org) to allow tracking of recruitment, analyzing data across clinical trials that alone do not have sufficient numbers of women for meaningful analyses, and analyzing of adverse events. Recently, the FDA has embarked on an ambitious bioinformatics modernization effort that includes an effort to harmonize with the standards being used in exchange of Electronic Health Records, which may improve our ability to detect important sex differences [227]. ...
Article
Full-text available
In 2001, the Institute of Medicine’s (IOM) report, “Exploring the Biological Contributions to Human Health: Does Sex Matter?” advocated for better understanding of the differences in human diseases between the sexes, with translation of these differences into clinical practice. Sex differences are well documented in the prevalence of cardiovascular (CV) risk factors, the clinical manifestation and incidence of cardiovascular disease (CVD), and the impact of risk factors on outcomes. There are also physiologic and psychosocial factors unique to women that may affect CVD risk, such as issues related to reproduction. The Society for Women’s Health Research (SWHR) CV Network compiled an inventory of sex-specific strategies and methods for the study of women and CV health and disease across the lifespan. References for methods and strategy details are provided to gather and evaluate this information. Some items comprise robust measures; others are in development. To address female-specific CV health and disease in population, physiology, and clinical trial research, data should be collected on reproductive history, psychosocial variables, and other factors that disproportionately affect CVD in women. Variables related to reproductive health include the following: age of menarche, menstrual cycle regularity, hormone levels, oral contraceptive use, pregnancy history/complications, polycystic ovary syndrome (PCOS) components, menopause age, and use and type of menopausal hormone therapy. Other factors that differentially affect women’s CV risk include diabetes mellitus, autoimmune inflammatory disease, and autonomic vasomotor control. Sex differences in aging as well as psychosocial variables such as depression and stress should also be considered. Women are frequently not included/enrolled in mixed-sex CVD studies; when they are included, information on these variables is generally not collected. These omissions limit the ability to determine the role of sex-specific contributors to CV health and disease. Lack of sex-specific knowledge contributes to the CVD health disparities that women face. The purpose of this review is to encourage investigators to consider ways to increase the usefulness of physiological and psychosocial data obtained from clinical populations, in an effort to improve the understanding of sex differences in clinical CVD research and health-care delivery for women and men.
... 20 The Food and Drug Administration recently embarked on a bioinformatics modernization effort that will help to track female participation in clinical trials and assess the safety and effectiveness of new drugs to detect sex differences, if any. 21 Notably, 13% of the studies on which the CVD prevention guidelines for women were based had all-male study populations. Although there are no compelling reasons to believe that the treatments studied in these trials would necessarily behave differently in women than in men, sex-specific differences in the pathophysiology of CVD, in baseline characteristics, and in response to treatment might affect direct translation of these results from men to women. ...
Article
The 2007 American Heart Association guidelines for cardiovascular disease prevention in women drew heavily on results from randomized clinical trials; however, representation of women in trials of cardiovascular disease prevention has not been systematically assessed. We abstracted 156 randomized clinical trials cited by the 2007 women's prevention guidelines to determine female representation over time and by clinical indication, prevention type, location of trial conduct, and funding source. Both women and men were represented in 135 of 156 (86.5%) trials; 20 trials enrolled only men; 1 enrolled only women. Among all trials, the proportion of women increased significantly over time, from 9% in 1970 to 41% in 2006. Considering only trials that enrolled both women and men, female enrollment was 18% in 1970 and increased to 34% in 2006. Female representation was higher in international versus United States-only trials (32.7% versus 26.7%) and primary versus secondary prevention trials (42.6% versus 26.6%). Female enrollment was comparable in government/foundation-funded versus industry-funded trials (31.9% versus 31.5%). Representation of women was highest among trials in hypertension (44%), diabetes (40%), and stroke (38%) and lowest for heart failure (29%), coronary artery disease (25%), and hyperlipidemia (28%). By contrast, women accounted for 53% of all individuals with hypertension, 50% with diabetes, 51% with heart failure, 49% with hyperlipidemia, and 46% with coronary artery disease. Sex-specific results were discussed in only 31% of primary trial publications. Enrollment of women in randomized clinical trials has increased over time but remains low relative to their overall representation in disease populations. Efforts are needed to reach a level of representation that is adequate to ensure evidence-based sex-specific recommendations.
Article
Differences in pathophysiology, diagnosis, and treatment of women with cardiovascular disease compared with men has become a major focus during the past decade. Nevertheless, little attention has focused on improving the quality of healthcare in women compared with other areas of cardiovascular medicine. To address this deficit, Duke University Medical Center convened a national Duke Clinical Research Institute (DCRI) Think Tank meeting, including basic science and clinical researchers, payers, legislators, clinical experts, government regulators, and members of the pharmaceutical and device industries. This report provides an overview of the discussions and proposed solutions. Discussion concentrated on the development of strategies to improve the quality of health care for women with heart disease. Key components to improve quality care include: (1) enhance the quantity and quality of evidence-based medicine to guide care in women through improvements in trial design, enrollment and retention of women subjects, results analysis and reporting, and better incentives to perform research in women; (2) provide incentives to develop better data in women through mandating changes in the drug and device development and approval processes; (3) incorporate specific recommendations for women into guidelines when data are sufficient; and (4) apply proven sex-based differences in risk stratification, diagnostic testing, and drug usage and dosing in clinical care. Examples of possible strategies are included. The above approach represents a necessary, but not sufficient, platform to improve the overall quality of healthcare in women with cardiovascular disease.
Article
Life science data integration and interoperability is one of the most challenging problems facing bioinformatics today. In the current age of the life sciences, investigators have to interpret many types of information from a variety of sources: lab instruments, public databases, gene expression profiles, raw sequence traces, single nucleotide polymorphisms, chemical screening data, proteomic data, putative metabolic pathway models, and many others. Unfortunately, scientists are not currently able to easily identify and access this information because of the variety of semantics, interfaces, and data formats used by the underlying data sources. Bioinformatics: Managing Scientific Data tackles this challenge head-on by discussing the current approaches and variety of systems available to help bioinformaticians with this increasingly complex issue. The heart of the book lies in the collaboration efforts of eight distinct bioinformatics teams that describe their own unique approaches to data integration and interoperability. Each system receives its own chapter where the lead contributors provide precious insight into the specific problems being addressed by the system, why the particular architecture was chosen, and details on the systems strengths and weaknesses. In closing, the editors provide important criteria for evaluating these systems that bioinformatics professionals will find valuable.
Article
Health information technology in Europe is expanding across the continent to link health care providers, patients, and insurance companies. Most European health care is at least partially publicly funded while much health IT infrastructure is privately provided.
Critical path opportunities list and report Topic 3: Harnessing Bioinformatics, R-15
  • Us
  • Drug Food
  • Administration
US Food and Drug Administration. Critical path opportunities list and report. Topic 3: Harnessing Bioinformatics, R-15 /http://www.fda.gov/ oc/initiatives/criticalpath/reports/opp_report.pdfS. Accessed 28 August 2007 (2006).
Evaluation of NDAs submitted to the FDA in CDISC SDTM format: feasibility of using demographic domain data to track the inclusion of women in trials
  • E Pinnow
Pinnow, E. Evaluation of NDAs submitted to the FDA in CDISC SDTM format: feasibility of using demographic domain data to track the inclusion of women in trials /http://www.cdisc.org/publications/ interchange2006/session8/EllenPinnowCDISC2006Pinnow.pdfS. Accessed 28 August 2007.
  • Z Lacroix
  • T Critchlow
Lacroix, Z. & Critchlow, T. Bioinformatics: Managing Scientific Data. (Morgan Kaufman Publishers: San Francisco, 2003).
  • Human Health
  • Services
Health and Human Services. Health information technology /http://www.hhs.gov/healthit/S. Accessed 28 August 2007.
Drug safety: most drugs withdrawn in recent years had greater health risks for women
US General Accounting Office. Drug safety: most drugs withdrawn in recent years had greater health risks for women /http://www.gao.gov/ new.items/d01286r.pdfS. Accessed 28 August 2007.