Académique Documents
Professionnel Documents
Culture Documents
1
moment to moment in response to tens of time and with which it associates and reacts.
thousands of intra- and extracellular envi- Studies to explore protein structure and
ronmental signals. A proteins chemistry and activities, known as proteomics, will be the
behavior are specified by the gene sequence focus of much research for decades to come
and by the number and identities of other and will help elucidate the molecular basis of
proteins made in the same cell at the same health and disease.
2
A Major MilestoneAchieving the
Working Draft Human Genome Sequence
In February 2001, HGP and sequence data, comprising overlap-
Celera Genomics scientists pub- ping fragments totaling 3.9 billion
lished the long-awaited details of bases and providing sevenfold
the working-draft DNA sequence. coverage (sequenced seven times) of
Although the draft is filled with the human genome. Over 30% is
mysteries, the first panoramic view high-quality, finished sequence, with
of the human genetic landscape has eight- to tenfold coverage, 99.99%
revealed a wealth of information and accuracy, and few gaps.
some early surprises. Papers describ-
ing research observations in the journals Nature High-Quality Version
(Feb. 15, 2001) andScience (Feb. 16, 2001) are
freely accessible (see www.ornl.gov/hgmis/ Expected in 2003
project/journals/journals.html). The entire working draft will be finished to
Although clearly not a Holy Grail or high quality by 2003. Coincidentally, that year also
Rosetta Stone for deciphering all of biology will be the 50th anniversary of Watson and Cricks
two early metaphors commonly used to publication of DNA structure that launched the era
describe the coveted prizethe sequence is a of molecular genetics (see www.nature.com/
magnificent and unprecedented resource that genomics/human/watson-crick). Much will
will serve as a basis for research and discovery remain to be deciphered even then. Some
throughout this century and beyond. It will highlights follow from Nature, Science, and The
have diverse practical applications and a pro- Wellcome Trust philanthropy (an HGP funder).
found impact upon how we view ourselves and
our place in the tapestry of life.
One insight already gleaned from the
sequence is that, even on the molecular level, Bioinformatics Boom:
we are more than the sum of our 35,000 or so
genes. Surprisingly, this newly estimated number Managing the Data
of genes is only one-third as great as previously Massive quantities of genomic data and high-
thought and only twice as many as those of a throughput technologies are now enabling studies
tiny transparent worm, although the numbers on a vastly larger scale than ever before, for
may be revised as more computational and example, in monitoring and comparing the activity
experimental analyses are performed. At once of tens of thousands of genes simultaneously in
humbled and intrigued by this finding, scientists cancerous and noncancerous tissue. Advanced
suggest that the genetic key to human com- computational tools and interdisciplinary experts
are needed to capture, represent, store, integrate,
plexity lies not in the number of genes but in
distribute, and analyze all the data.
how gene parts are used to build different
products in a process called alternative splicing. Bioinformatics is the term coined for the new field
Other sources of added complexity are the that merges biology, computer science, and
thousands of chemical modifications made to information technology to manage and analyze
proteins and the repertoire of regulatory the data, with the ultimate goal of understanding
mechanisms controlling these processes. and modeling living systems. Computing and
information demands will continue to rise with the
The draft encompasses 90% of the human
explosive torrent of data from large-scale studies
genomes euchromatic portion, which contains at the molecular, cellular, and whole organism
the most genes. In constructing the working levels (tour of the public DNA sequence database
draft, the 16 genome sequencing centers GenBank: www.ncbi.nlm.nih.gov/Tour/).
produced over 22.1 billion bases of raw
3
What Does the Working Draft Tell Us?
By the Numbers
4
Humans share most of the same protein The ratio of germline (sperm or egg cell)
families with worms, flies, and plants, but the mutations is 2:1 in males vs females.
number of gene family members has Researchers point to several reasons for the
expanded in humans, especially in proteins higher mutation rate in the male germline,
involved in development and immunity. including the greater number of cell divisions
required for sperm formation than for eggs.
The human genome has a much greater
portion (50%) of repeat sequences than the Applications, Future Challenges
mustard weed (11%), the worm (7%), and
the fly (3%). Deriving meaningful knowledge from the
DNA sequence will define research through the
Although humans appear to have stopped coming decades to inform our understanding of
accumulating repetitive DNA over 50 million biological systems. This enormous task will
years ago, there seems to be no such decline require the expertise and creativity of tens of
in rodents. This may account for some of the thousands of scientists from varied disciplines in
fundamental differences between hominids both the public and private sectors worldwide.
and rodents, although estimates of gene The draft sequence already is having an
numbers are similar in both species. Scien- impact on finding genes associated with disease.
tists have proposed many theories to explain Genes have been pinpointed and associated with
evolutionary contrasts between humans and numerous diseases and disorders including breast
other organisms, including those of life span, cancer, muscle disease, deafness, and blindness.
litter sizes, inbreeding, and genetic drift. Additionally, finding the DNA sequences underly-
ing such common diseases as cardiovascular
Variations and Mutations disease, diabetes, arthritis, and cancers is being
Scientists have identified about 1.4 million aided by the human SNP maps generated in the
locations where single-base DNA differences HGP in cooperation with the private sector. These
(SNPs, see Sequence Variation, p. 11) occur genes and SNPs provide focused targets for the
in humans. This information promises to development of effective new therapies.
revolutionize the processes of finding chro- One of the greatest impacts of having the
mosomal locations for disease-associated sequence may well be in enabling an entirely
sequences and tracing human history. new approach to biological research. In the past,
researchers studied one or a few genes at a time.
With whole-genome sequences and new auto-
More on the Working Draft mated, high-throughput technologies, they can
Papers from Science and Nature approach questions systematically and on a
www.ornl.gov/hgmis/project/journals/ grand scale. They can study all the genes in a
journals.html genome, for example, or all the gene products
Sequence Access Sites in a particular tissue or organ or tumor, or how
www.ornl.gov/hgmis/project/journals/ tens of thousands of genes and proteins work
sequencesites.html together in interconnected networks to orches-
trate the chemistry of life.
5
After the HGP, the Next Steps . . .
6
Medicine and the New Genetics:
Gene Testing, Pharmacogenomics, Gene Therapy
DNA underlies every aspect of colon cancers. Although they have
our health, both in function and limitations, these tests sometimes
dysfunction. Obtaining a are used to make risk estimates
detailed picture of how in presymptomatic individuals
genes and other DNA with a family history of the
sequences function together disorder. One potential benefit
and interact with environmental to using these gene tests is that
factors ultimately will lead to the they could provide information that
discovery of pathways involved in helps physicians and patients manage
normal processes and in disease the disease or condition more effec-
pathogenesis. Such knowledge will tively. Regular colonoscopies for those
have a profound impact on the way having mutations associated with
disorders are diagnosed, treated, colon cancer, for instance, could
and prevented and will bring about prevent thousands of deaths each year.
revolutionary changes in clinical and Some scientific limitations are that
public health practice. Some of these the tests may not detect every muta-
transformative developments are tion associated with a particular condi-
described below. tion (many are as yet undiscovered),
and the ones they do detect may
present different risks to different
Gene Tests people and populations. Another
DNA-based tests are among the important consideration in gene testing
first commercial medical applications is the lack of effective treatments or
of the new genetic discoveries. Gene preventive measures for many diseases
tests can be used to diagnose disease, and conditions now being diagnosed or
confirm a diagnosis, provide prognostic predicted.
information about the course of disease, con- Revealing information about risk of future
firm the existence of a disease in asymptomatic disease can have significant emotional and
individuals, and, with varying degrees of psychological effects as well. Moreover, the
accuracy, predict the risk of future disease in absence of privacy and antidiscrimination legal
healthy individuals or their progeny. protections can lead to discrimination in
Currently, several hundred genetic tests employment or insurance or other misuse of
are in clinical use, with many more under personal genetic information. Additionally,
development, and their numbers and varieties because genetic tests reveal information about
are expected to increase rapidly over the next individuals and their families, test results can
decade. Most current tests detect mutations affect family dynamics. Results also can pose
associated with rare genetic disorders that risks for population groups if they lead to
follow Mendelian inheritance patterns. These group stigmatization.
include myotonic and Duchenne muscular Other issues related to gene tests include
dystrophies, cystic fibrosis, neurofibromatosis their effective introduction into clinical practice,
type 1, sickle cell anemia, and Huntingtons the regulation of laboratory quality assurance,
disease. the availability of testing for rare diseases, and
Recently, tests have been developed to the education of healthcare providers and
detect mutations for a handful of more com- patients about correct interpretation and
plex conditions such as breast, ovarian, and attendant risks.
7
side effects than many current medicines.
Pharmacogenomics: Moving Ideally, the new genomic drugs could be given
Away from One-Size-Fits-All earlier in the disease process. As knowledge
becomes available to select patients most likely
Therapeutics to benefit from a potential drug,
Within the next decade, researchers will pharmacogenomics will speed the design of
begin to correlate DNA variants with individual clinical trials to bring the drugs to market
responses to medical treatments, identify particu- sooner.
lar subgroups of patients, and develop drugs
customized for those populations. The disci- Gene Therapy, Enhancement
pline that blends pharmacology with genomic
The potential for using genes themselves
capabilities is called pharmacogenomics.
to treat disease or enhance particular traits has
More than 100,000 people die each year
captured the imagination of the public and the
as the result of adverse responses to medications
biomedical community. This largely experimen-
that are beneficial to others. Another 2.2 mil-
tal fieldgene transfer or gene therapyholds
lion experience serious reactions, while potential for treating or even curing such
others fail to respond at all. DNA variants in genetic and acquired diseases as cancers and
genes involved in drug metabolism, particularly AIDS by using normal genes to supplement or
the cytochrome P450 multigene family, are the replace defective genes or bolster a normal
focus of much current research in this area. function such as immunity.
Enzymes encoded by these genes are respon- More than 500 clinical gene-therapy trials
sible for metabolizing most drugs used today, involving about 3500 patients have been
including many for treating psychiatric, neuro- identified worldwide (June 2001). The vast
logical, and cardiovascular diseases. Enzyme majority (78%) take place in the United States,
function affects patients responses to both the followed by Europe (18%). Most trials focus on
drug and the dose. Future advances will enable various types of cancer, although monogenic,
rapid testing to determine the patients geno- infectious, vascular, and other multigenic
type and drastically reduce hospitalization diseases are being studied as well. Protocols
resulting from adverse reactions. generally are aimed at establishing the
Genomic data and technologies safety of gene-delivery procedures rather
also are expected to make drug devel- than effectiveness, and no cures as yet
opment faster, cheaper, and more can be attributed to these trials.
effective. Most drugs today are based on Gene transfer still faces many scien-
about 500 molecular targets; genomic tific obstacles before it can become a
knowledge of the genes involved in practical approach for treating
diseases, disease pathways, and disease. According to the American
drug-response sites will lead to Society of Human Genetics State-
the discovery of thousands of new ment on Gene Therapy, effective
targets. New drugs, aimed at progress will be achieved only
specific sites in the body and at through continued rigorous
particular biochemical events research on the most fundamental
leading to mechanisms underlying gene
disease, probably will cause fewer delivery and gene expression in
animals.
8
Other Anticipated Benefits of Genetic Research
9
Societal Concerns Arising from the New Genetics
Since its inception, the Human Genome Uncertainties associated with gene tests
Project has dedicated funds toward studying for susceptibilities and complex conditions
the ethical, legal, and social issues (ELSI) sur- (e.g., heart disease, diabetes, and Alzheimers
rounding the availability of the new data and disease). Should testing be performed when no
capabilities. Examples of such issues follow. treatment is available or when interpretation is
unsure? Should children be tested for suscepti-
Privacy and confidentiality of genetic
bility to adult-onset diseases?
information. Who owns and controls genetic
information? Is genetic privacy different from Conceptual and philosophical implications
medical privacy? regarding human responsibility, free will vs
genetic determinism, and concepts of health
Fairness in the use of genetic information
and disease. Do our genes influence our behav-
by insurers, employers, courts, schools,
ior, and can we control it? What is considered
adoption agencies, and the military, among
acceptable diversity? Where is the line drawn
others. Who should have access to personal
between medical treatment and enhancement?
genetic information, and how will it be used?
Psychological impact, stigmatization, and Health and environmental issues concern-
discrimination due to an individuals genetic ing genetically modified (GM) foods and
differences. How does personal genetic informa- microbes. Are GM foods and other products safe
tion affect self-identity and societys perceptions? for humans and the environment? How will
these technologies affect developing nations
Reproductive issues including adequate and dependence on industrialized nations?
informed consent and the use of genetic
information in reproductive decision making. Commercialization of products including
Do healthcare personnel properly counsel property rights (patents, copyrights, and
parents about risks and limitations? What are trade secrets) and accessibility of data and
the larger societal issues raised by new repro- materials. Will patenting DNA sequences limit
ductive technologies? their accessibility and development into useful
products?
Clinical issues including the education
about capabilities, limitations, and social
risksof doctors and other health-service
providers, people identified with genetic
conditions, and the general public; and the
implementation of standards and quality
control measures. How do we prepare health
professionals for the new genetics? How do we
prepare the public to make informed choices?
How will genetic tests be evaluated and regu-
lated for accuracy, reliability, and usefulness?
(Currently, there is little regulation at the
federal level.) How do we as a society balance
current scientific limitations and social risk with
long-term benefits?
Fairness in access to advanced genomic
technologies. Who will benefit? Will there be
major worldwide inequities?
10
Human Genome Project Goals 19982003
Human DNA Sequencing clones and sequences for human and model-
The HGPs continued emphasis is on obtaining organism genes. Other functional-genomics goals
by 2003 a complete and highly accurate reference include studies into gene expression and control
sequence (1 error in 10,000 bases) that is largely and the development of experimental and compu-
continuous across each human chromosome. tational methods for understanding gene function.
Scientists believe that knowing this sequence is Comparative Genomics
critically important for understanding human The functions of human genes and other
biology and for applications to other fields. DNA regions often are revealed by studying their
A working draft of the sequence was parallels in nonhumans. HGP researchers have
completed 18 months ahead of schedule, in June obtained complete genomic sequences for the
2000. The achievement has provided scientists bacterium Escherichia coli, the yeast Saccharomyces
worldwide with a road map to an estimated 90% cerevisiae, the fruit fly Drosophila melanogaster, and
of genes on every chromosome. Although the draft the roundworm Caenorhabditis elegans. Sequenc-
contains gaps and errors and does not yet meet ing continues on the laboratory mouse. The
the criterion of 1 error in 10,000 bases outlined availability of complete genome sequences gener-
above, it provides a valuable scaffold for generating ated both inside and outside the HGP is driving a
a high-quality reference genome sequence. HGP major breakthrough in fundamental biology as
scientists make human DNA sequence available scientists compare entire genomes to gain new
broadly, rapidly, and free of charge via the Web. insights into evolutionary, biochemical, genetic,
Sequencing Technology metabolic, and physiological pathways.
Although current sequencing capacity is far Ethical, Legal, and Social Implications (ELSI)
greater than at the inception of the HGP, further Rapid advances in the science of genetics and
incremental progress in sequencing technologies, its applications present new and complex ethical
efficiency, and cost-reduction are needed. For and policy issues for individuals and society. ELSI
future sequencing applications, planners emphasize programs that identify and address these implica-
the importance of supporting novel technologies tions have been an integral part of the U.S. HGP
that may be 5 to 10 years in development. since its inception. These programs have resulted
Sequence Variation in a body of work that promotes education and helps
Although more than 99% of human DNA guide the conduct of genetic research and the
sequences are the same across the population, development of related medical and public policies.
variations in DNA sequence can have a major Bioinformatics and Computational Biology
impact on how humans respond to disease; to such Continued investment in current and new
environmental insults as bacteria, viruses, toxins, databases and analytical tools is critical to the
and chemicals; and to drugs and other therapies. success of the HGP and to the future usefulness of
Methods are being developed to detect different the data it produces. Databases must adapt to the
types of variation, particularly the most common evolving needs of the scientific community and
type called single-nucleotide polymorphisms (SNPs), must allow queries to be answered easily. Planners
which occur about once every 100 to 300 bases. suggest developing a human genome database,
Scientists believe SNP maps will help them identify analogous to model organism databases, that will
the multiple genes associated with such complex link to phenotypic information. Also needed are
diseases as cancer, diabetes, vascular disease, and databases and analytical tools for studying the
some forms of mental illness. These associations are expanding body of gene-expression and functional
difficult to establish with conventional gene- data, for modeling complex biological networks
hunting methods because a single altered gene and interactions, and for collecting and analyzing
may make only a small contribution to disease risk. sequence-variation data.
Functional Genomics Training
Efficient interpretation of the functions of Future genome scientists will require training in
human genes and other DNA sequences requires interdisciplinary areas including biology, computer
that strategies be developed to enable large-scale science, engineering, mathematics, physics, and
investigations across whole genomes. A first priority chemistry. Also, scientists with management skills will
is to generate complete sets of full-length cDNA be needed for leading large data-production efforts.
11
For More Information
12