Vous êtes sur la page 1sur 12

Genomics and Its Impact

on Medicine and Society


A 2001 Primer
U.S. Department of Energy Human Genome Program: www.ornl.gov/hgmis

The Basics DNA in the human genome is arranged


into 24 distinct chromosomesphysically
Cells are the fundamental working units separate molecules that range in length from
of every living system. All the instructions needed about 50 million to 250 million base pairs. A
to direct their activities are contained within few types of major chromosomal abnormalities,
the chemical DNA (deoxyribonucleic acid). including missing or extra copies or gross
DNA from all organisms is made up of breaks and rejoinings (translocations), can be
the same chemical and physical components. detected by microscopic examination. Most
The DNA sequence is the particular side-by- changes in DNA, however, are more subtle and
side arrangement of bases along the DNA require a closer analysis of the DNA molecule
strand (e.g., ATTCCGGA). This order spells out to find perhaps single-base differences.
the exact instructions required to create a Each chromosome contains many genes,
particular organism with its own unique traits. the basic physical and functional units of
The genome is an organisms complete heredity. Genes are specific sequences of bases
set of DNA. Genomes vary widely in size: the that encode instructions on how to make
smallest known genome for a free-living organ- proteins. Genes comprise only about 2% of the
ism (a bacterium) contains about 600,000 human genome; the remainder consists of
DNA base pairs, while human and mouse noncoding regions, whose functions may
genomes have some 3 billion. Except for mature include providing chromosomal structural
red blood cells, all human cells contain a com- integrity and regulating where, when, and in
plete genome. what quantity proteins are made. The human
genome is estimated to contain
30,000 to 40,000 genes.
Although genes get a lot of
attention, its the proteins that
perform most life functions and
even make up the majority of
cellular structures. Proteins are
large, complex molecules made
up of smaller subunits called
amino acids. Chemical properties
that distinguish the 20 different
amino acids cause the protein
chains to fold up into specific
three-dimensional structures that
define their particular functions in
the cell.
The constellation of all
proteins in a cell is called its
proteome. Unlike the relatively
unchanging genome, the
dynamic proteome changes from

1
moment to moment in response to tens of time and with which it associates and reacts.
thousands of intra- and extracellular envi- Studies to explore protein structure and
ronmental signals. A proteins chemistry and activities, known as proteomics, will be the
behavior are specified by the gene sequence focus of much research for decades to come
and by the number and identities of other and will help elucidate the molecular basis of
proteins made in the same cell at the same health and disease.

The Human Genome Project


A Little Bit of History
Though surprising to many, the Human
Genome Project (HGP) traces its roots to an initia-
. . . and Exciting Progress
tive in the U.S. Department of Energy (DOE). Since Although the HGP originally was planned
1945, DOE and its predecessor agencies have been to last 15 years, rapid technological advances
charged by Congress to develop new energy and worldwide participation have accelerated
resources and technologies and to pursue a deeper the expected completion date to 2003. In
understanding of potential health and environ- June 2000, scientists announced biologys most
mental risks posed by their production and use. stunning achievement: the generation of a
Such studies have since provided the scientific basis working draft sequence of the entire human
for individual risk assessments, for example, of genome. In addition to serving as a scaffold for
nuclear medicine technologies. the finished version, the draft provides a road
In 1986, DOE took a bold step in announc- map to an estimated 90% of genes on every
ing its Human Genome Initiative, convinced that chromosome and already has enabled gene
DOEs missions would be well served by a refer- hunters to pinpoint genes associated with more
ence human genome sequence. Shortly there- than 30 disorders.
after, DOE and the National Institutes of Health HGP resources have spurred a boom in
developed a plan for a joint HGP that officially spin-off sequencing programs on the human
began in 1990. and other genomes in both the private and
public sectors. To stimulate further research, all
Ambitious Goals . . . data generated in the public sector are made
available rapidly and free of charge via the Web.
From the outset, the HGPs ultimate goal
has been to generate a high-quality reference
sequence for the entire human genome and
identify all human genes. Other important goals
HGP Spinoff Projects
are to sequence the genomes of model organ- Microbial Genome Project
isms to help interpret human DNA, enhance www.sc.doe.gov/ober/microbial.html
computational resources to support future www.ornl.gov/microbialgenomes
research and commercial applications, and Microbial Cell Project
explore gene function through mouse-human microbialcellproject.org
comparisons. Potential applications are numer- Genomes to Life
ous and include customized medicines, doegenomestolife.org
improved agriculture products, new energy Environmental Genome Project
resources, and tools for environmental cleanup. www.niehs.nih.gov/envgenom/home.htm
The HGP also aims to train future scientists, Cancer Genome Anatomy Project
study human variation, and address critical www.ncbi.nlm.nih.gov/ncicgap
societal issues arising from the increased avail-
SNP Consortium
ability of personal human genome data and
snp.cshl.org
related analytical technologies.

2
A Major MilestoneAchieving the
Working Draft Human Genome Sequence
In February 2001, HGP and sequence data, comprising overlap-
Celera Genomics scientists pub- ping fragments totaling 3.9 billion
lished the long-awaited details of bases and providing sevenfold
the working-draft DNA sequence. coverage (sequenced seven times) of
Although the draft is filled with the human genome. Over 30% is
mysteries, the first panoramic view high-quality, finished sequence, with
of the human genetic landscape has eight- to tenfold coverage, 99.99%
revealed a wealth of information and accuracy, and few gaps.
some early surprises. Papers describ-
ing research observations in the journals Nature High-Quality Version
(Feb. 15, 2001) andScience (Feb. 16, 2001) are
freely accessible (see www.ornl.gov/hgmis/ Expected in 2003
project/journals/journals.html). The entire working draft will be finished to
Although clearly not a Holy Grail or high quality by 2003. Coincidentally, that year also
Rosetta Stone for deciphering all of biology will be the 50th anniversary of Watson and Cricks
two early metaphors commonly used to publication of DNA structure that launched the era
describe the coveted prizethe sequence is a of molecular genetics (see www.nature.com/
magnificent and unprecedented resource that genomics/human/watson-crick). Much will
will serve as a basis for research and discovery remain to be deciphered even then. Some
throughout this century and beyond. It will highlights follow from Nature, Science, and The
have diverse practical applications and a pro- Wellcome Trust philanthropy (an HGP funder).
found impact upon how we view ourselves and
our place in the tapestry of life.
One insight already gleaned from the
sequence is that, even on the molecular level, Bioinformatics Boom:
we are more than the sum of our 35,000 or so
genes. Surprisingly, this newly estimated number Managing the Data
of genes is only one-third as great as previously Massive quantities of genomic data and high-
thought and only twice as many as those of a throughput technologies are now enabling studies
tiny transparent worm, although the numbers on a vastly larger scale than ever before, for
may be revised as more computational and example, in monitoring and comparing the activity
experimental analyses are performed. At once of tens of thousands of genes simultaneously in
humbled and intrigued by this finding, scientists cancerous and noncancerous tissue. Advanced
suggest that the genetic key to human com- computational tools and interdisciplinary experts
are needed to capture, represent, store, integrate,
plexity lies not in the number of genes but in
distribute, and analyze all the data.
how gene parts are used to build different
products in a process called alternative splicing. Bioinformatics is the term coined for the new field
Other sources of added complexity are the that merges biology, computer science, and
thousands of chemical modifications made to information technology to manage and analyze
proteins and the repertoire of regulatory the data, with the ultimate goal of understanding
mechanisms controlling these processes. and modeling living systems. Computing and
information demands will continue to rise with the
The draft encompasses 90% of the human
explosive torrent of data from large-scale studies
genomes euchromatic portion, which contains at the molecular, cellular, and whole organism
the most genes. In constructing the working levels (tour of the public DNA sequence database
draft, the 16 genome sequencing centers GenBank: www.ncbi.nlm.nih.gov/Tour/).
produced over 22.1 billion bases of raw

3
What Does the Working Draft Tell Us?
By the Numbers

The human genome contains 3164.7 million How Its Arranged


chemical nucleotide bases (A, C, T, and G). The human genomes gene-dense urban
The average gene consists of 3000 bases, but centers are predominantly composed of the
sizes vary greatly, with the largest known human DNA building blocks G and C.
gene being dystrophin at 2.4 million bases. In contrast, the gene-poor deserts are rich
The total number of genes is estimated at in the DNA building blocks A and T. GC- and
30,000 to 40,000, much lower than previous AT-rich regions usually can be seen through a
estimates of 80,000 to 140,000 that had been microscope as light and dark bands on the
based on extrapolations from gene-rich areas chromosomes.
as opposed to a composite of gene-rich and Genes appear to be concentrated in random
gene-poor areas. areas along the genome, with vast expanses
The order of almost all (99.9%) nucleotide of noncoding DNA between.
bases is exactly the same in all people. Stretches of up to 30,000 C and G bases
The functions are unknown for more than repeating over and over often occur adja-
50% of discovered genes. cent to gene-rich areas, forming a barrier
between the genes and the junk DNA.
The Wheat from the Chaff These CpG islands are believed to help
regulate gene activity.
About 2% of the genome encodes instruc-
tions for the synthesis of proteins. Chromosome 1 has the most genes (2968),
and the Y chromosome has the fewest (231).
Repeated sequences that do not code for
proteins (junk DNA) make up at least 50%
How the Human Genome Compares with
of the human genome.
Those of Other Organisms
Repetitive sequences are thought to have no Unlike the humans seemingly random
direct functions, but they shed light on chro- distribution of gene-rich areas, many other
mosome structure and dynamics. Over time, organisms genomes are more uniform, with
these repeats reshape the genome by rearrang- genes evenly spaced throughout.
ing it, thereby creating entirely new genes or
modifying and reshuffling existing genes. Humans have on average three times as
many kinds of proteins as the fly or worm
During the past 50 million years, a dramatic because of mRNA transcript alternative
decrease seems to have occurred in the rate of splicing and chemical modifications to the
accumulation of these repeats. proteins. This process can yield different
protein products from the same gene.

4
Humans share most of the same protein The ratio of germline (sperm or egg cell)
families with worms, flies, and plants, but the mutations is 2:1 in males vs females.
number of gene family members has Researchers point to several reasons for the
expanded in humans, especially in proteins higher mutation rate in the male germline,
involved in development and immunity. including the greater number of cell divisions
required for sperm formation than for eggs.
The human genome has a much greater
portion (50%) of repeat sequences than the Applications, Future Challenges
mustard weed (11%), the worm (7%), and
the fly (3%). Deriving meaningful knowledge from the
DNA sequence will define research through the
Although humans appear to have stopped coming decades to inform our understanding of
accumulating repetitive DNA over 50 million biological systems. This enormous task will
years ago, there seems to be no such decline require the expertise and creativity of tens of
in rodents. This may account for some of the thousands of scientists from varied disciplines in
fundamental differences between hominids both the public and private sectors worldwide.
and rodents, although estimates of gene The draft sequence already is having an
numbers are similar in both species. Scien- impact on finding genes associated with disease.
tists have proposed many theories to explain Genes have been pinpointed and associated with
evolutionary contrasts between humans and numerous diseases and disorders including breast
other organisms, including those of life span, cancer, muscle disease, deafness, and blindness.
litter sizes, inbreeding, and genetic drift. Additionally, finding the DNA sequences underly-
ing such common diseases as cardiovascular
Variations and Mutations disease, diabetes, arthritis, and cancers is being
Scientists have identified about 1.4 million aided by the human SNP maps generated in the
locations where single-base DNA differences HGP in cooperation with the private sector. These
(SNPs, see Sequence Variation, p. 11) occur genes and SNPs provide focused targets for the
in humans. This information promises to development of effective new therapies.
revolutionize the processes of finding chro- One of the greatest impacts of having the
mosomal locations for disease-associated sequence may well be in enabling an entirely
sequences and tracing human history. new approach to biological research. In the past,
researchers studied one or a few genes at a time.
With whole-genome sequences and new auto-
More on the Working Draft mated, high-throughput technologies, they can
Papers from Science and Nature approach questions systematically and on a
www.ornl.gov/hgmis/project/journals/ grand scale. They can study all the genes in a
journals.html genome, for example, or all the gene products
Sequence Access Sites in a particular tissue or organ or tumor, or how
www.ornl.gov/hgmis/project/journals/ tens of thousands of genes and proteins work
sequencesites.html together in interconnected networks to orches-
trate the chemistry of life.

5
After the HGP, the Next Steps . . .

The words of Winston Churchill, spoken in Proteomicsthe study of protein expression


1942 after 3 years of war, capture well the HGP and functioncan bring researchers closer
era: Now this is not the end. It is not even the than gene-expression studies to whats
beginning of the end. But it is, perhaps, the actually happening in the cell.
end of the beginning.
The avalanche of genome data grows Structural genomics initiatives are being
daily. The new challenge will be to use this vast launched worldwide to generate the 3-D
reservoir of data to explore how DNA and structures of one or more proteins from each
proteins work with each other and the environ- protein family, thus offering clues to their
ment to create complex, dynamic living sys- function and providing biological targets for
tems. Systematic studies of function on a grand drug design.
scalefunctional genomicswill be the focus Knockout studies are one experimental
of biological explorations in this century and method for understanding the function of
beyond. These explorations will encompass DNA sequences and the proteins they encode.
studies in transcriptomics, proteomics, struc- Researchers inactivate genes in living organ-
tural genomics, new experimental methodolo- isms and monitor any changes that could
gies, and comparative genomics. reveal the function of specific genes.
Transcriptomics involves large-scale analysis Comparative genomicsanalyzing DNA
of messenger RNAs (molecules that are sequence patterns of humans and well-
transcribed from active genes) to determine studied model organisms side by sidehas
when, where, and under what conditions become one of the most powerful strategies
genes are expressed. for identifying human genes and interpreting
their function.

oday, Future applications of this knowledge promise far-


T scientists
have in
reaching benefits to the nation:
Independence from foreign oil
hand the com-
Significant savings in toxic waste cleanup and
plete DNA
disposal
sequences of
genomes for Stabilization of atmospheric carbon dioxide to
many organisms counter global warming
from microbes to mice to humans. For the first Enhanced biowarfare agent detection and response
time, we can begin to explore the operating Genomes to life will build on the Human Genome
systems of life written into these genetic codes Project, both by exploiting its data and by extending
and put them to use. At the leading edge of this the whole-genome approach to biology to the next
great scientific frontier is the DOE Genomes to Life levelattaining a comprehensive understanding of
program, whose overarching goal is to use these how entire biological systems work. This will lead to
biological tools to target critical mission challenges models that enable scientists to understand living
in energy independence, global climate change, systems well enough to predict their behavior under
toxic waste cleanup, and human health protection. different environmental conditions. Applications of
this level of understanding will be revolutionary.

6
Medicine and the New Genetics:
Gene Testing, Pharmacogenomics, Gene Therapy
DNA underlies every aspect of colon cancers. Although they have
our health, both in function and limitations, these tests sometimes
dysfunction. Obtaining a are used to make risk estimates
detailed picture of how in presymptomatic individuals
genes and other DNA with a family history of the
sequences function together disorder. One potential benefit
and interact with environmental to using these gene tests is that
factors ultimately will lead to the they could provide information that
discovery of pathways involved in helps physicians and patients manage
normal processes and in disease the disease or condition more effec-
pathogenesis. Such knowledge will tively. Regular colonoscopies for those
have a profound impact on the way having mutations associated with
disorders are diagnosed, treated, colon cancer, for instance, could
and prevented and will bring about prevent thousands of deaths each year.
revolutionary changes in clinical and Some scientific limitations are that
public health practice. Some of these the tests may not detect every muta-
transformative developments are tion associated with a particular condi-
described below. tion (many are as yet undiscovered),
and the ones they do detect may
present different risks to different
Gene Tests people and populations. Another
DNA-based tests are among the important consideration in gene testing
first commercial medical applications is the lack of effective treatments or
of the new genetic discoveries. Gene preventive measures for many diseases
tests can be used to diagnose disease, and conditions now being diagnosed or
confirm a diagnosis, provide prognostic predicted.
information about the course of disease, con- Revealing information about risk of future
firm the existence of a disease in asymptomatic disease can have significant emotional and
individuals, and, with varying degrees of psychological effects as well. Moreover, the
accuracy, predict the risk of future disease in absence of privacy and antidiscrimination legal
healthy individuals or their progeny. protections can lead to discrimination in
Currently, several hundred genetic tests employment or insurance or other misuse of
are in clinical use, with many more under personal genetic information. Additionally,
development, and their numbers and varieties because genetic tests reveal information about
are expected to increase rapidly over the next individuals and their families, test results can
decade. Most current tests detect mutations affect family dynamics. Results also can pose
associated with rare genetic disorders that risks for population groups if they lead to
follow Mendelian inheritance patterns. These group stigmatization.
include myotonic and Duchenne muscular Other issues related to gene tests include
dystrophies, cystic fibrosis, neurofibromatosis their effective introduction into clinical practice,
type 1, sickle cell anemia, and Huntingtons the regulation of laboratory quality assurance,
disease. the availability of testing for rare diseases, and
Recently, tests have been developed to the education of healthcare providers and
detect mutations for a handful of more com- patients about correct interpretation and
plex conditions such as breast, ovarian, and attendant risks.

7
side effects than many current medicines.
Pharmacogenomics: Moving Ideally, the new genomic drugs could be given
Away from One-Size-Fits-All earlier in the disease process. As knowledge
becomes available to select patients most likely
Therapeutics to benefit from a potential drug,
Within the next decade, researchers will pharmacogenomics will speed the design of
begin to correlate DNA variants with individual clinical trials to bring the drugs to market
responses to medical treatments, identify particu- sooner.
lar subgroups of patients, and develop drugs
customized for those populations. The disci- Gene Therapy, Enhancement
pline that blends pharmacology with genomic
The potential for using genes themselves
capabilities is called pharmacogenomics.
to treat disease or enhance particular traits has
More than 100,000 people die each year
captured the imagination of the public and the
as the result of adverse responses to medications
biomedical community. This largely experimen-
that are beneficial to others. Another 2.2 mil-
tal fieldgene transfer or gene therapyholds
lion experience serious reactions, while potential for treating or even curing such
others fail to respond at all. DNA variants in genetic and acquired diseases as cancers and
genes involved in drug metabolism, particularly AIDS by using normal genes to supplement or
the cytochrome P450 multigene family, are the replace defective genes or bolster a normal
focus of much current research in this area. function such as immunity.
Enzymes encoded by these genes are respon- More than 500 clinical gene-therapy trials
sible for metabolizing most drugs used today, involving about 3500 patients have been
including many for treating psychiatric, neuro- identified worldwide (June 2001). The vast
logical, and cardiovascular diseases. Enzyme majority (78%) take place in the United States,
function affects patients responses to both the followed by Europe (18%). Most trials focus on
drug and the dose. Future advances will enable various types of cancer, although monogenic,
rapid testing to determine the patients geno- infectious, vascular, and other multigenic
type and drastically reduce hospitalization diseases are being studied as well. Protocols
resulting from adverse reactions. generally are aimed at establishing the
Genomic data and technologies safety of gene-delivery procedures rather
also are expected to make drug devel- than effectiveness, and no cures as yet
opment faster, cheaper, and more can be attributed to these trials.
effective. Most drugs today are based on Gene transfer still faces many scien-
about 500 molecular targets; genomic tific obstacles before it can become a
knowledge of the genes involved in practical approach for treating
diseases, disease pathways, and disease. According to the American
drug-response sites will lead to Society of Human Genetics State-
the discovery of thousands of new ment on Gene Therapy, effective
targets. New drugs, aimed at progress will be achieved only
specific sites in the body and at through continued rigorous
particular biochemical events research on the most fundamental
leading to mechanisms underlying gene
disease, probably will cause fewer delivery and gene expression in
animals.

8
Other Anticipated Benefits of Genetic Research

Rapid progress in genome science and a Bioarchaeology, Anthropology, Evolution,


glimpse into its potential applications have and Human Migration
spurred observers to predict that biology will Study evolution through germline mutations
be the foremost science of the 21st century. in lineages
Technology and resources generated by the
Human Genome Project and other genomic Study migration of different population
research already are having major impacts on groups based on maternal genetic inheritance
research across the life sciences. Doubling in Study mutations on the Y chromosome to trace
size between 1993 and 1999, according to lineage and migration of males
Ernst & Young,1 the biotechnology industry Compare breakpoints in the evolution of
generated 151,000 direct jobs and 287,000 mutations with ages of populations and
indirect jobs. Revenues totaled $20 billion historical events
directly and $27 billion indirectly.
DNA Identification
Some Current and Potential Identify potential suspects whose DNA may
Applications of Genome match evidence left at crime scenes

Research Exonerate persons wrongly accused of crimes


Identify crime, catastrophe, and other victims
Molecular Medicine
Establish paternity and other family relationships
Improve diagnosis of disease
Identify endangered and protected species as
Detect genetic predispositions to disease
an aid to wildlife officials (could be used for
Create drugs based on molecular information prosecuting poachers)
Use gene therapy and control systems as drugs Detect bacteria and other organisms that may
Design custom drugs based on individual pollute air, water, soil, and food
genetic profiles Match organ donors with recipients in
Microbial Genomics transplant programs
Rapidly detect and treat pathogens (disease- Determine pedigree for seed or livestock breeds
causing microbes) in clinical practice Authenticate consumables such as caviar and
Develop new energy sources (biofuels) wine
Monitor environments to detect pollutants Agriculture, Livestock Breeding, and
Protect citizenry from biological and chemi- Bioprocessing
cal warfare Grow disease-, insect-, and drought-resistant
crops
Clean up toxic waste safely and efficiently
Breed healthier, more productive, disease-
Risk Assessment resistant farm animals
Evaluate the health risks faced by individuals
Grow more nutritious produce
who may be exposed to radiation (including
low levels in industrial areas) and to cancer- Develop biopesticides
causing chemicals and toxins Incorporate edible vaccines into food products
1. The Economic Contributions of the Biotechnology Develop new environmental cleanup uses for
Industry to the U.S. Economy, May 2000, Ernst & Young. plants like tobacco

9
Societal Concerns Arising from the New Genetics

Since its inception, the Human Genome Uncertainties associated with gene tests
Project has dedicated funds toward studying for susceptibilities and complex conditions
the ethical, legal, and social issues (ELSI) sur- (e.g., heart disease, diabetes, and Alzheimers
rounding the availability of the new data and disease). Should testing be performed when no
capabilities. Examples of such issues follow. treatment is available or when interpretation is
unsure? Should children be tested for suscepti-
Privacy and confidentiality of genetic
bility to adult-onset diseases?
information. Who owns and controls genetic
information? Is genetic privacy different from Conceptual and philosophical implications
medical privacy? regarding human responsibility, free will vs
genetic determinism, and concepts of health
Fairness in the use of genetic information
and disease. Do our genes influence our behav-
by insurers, employers, courts, schools,
ior, and can we control it? What is considered
adoption agencies, and the military, among
acceptable diversity? Where is the line drawn
others. Who should have access to personal
between medical treatment and enhancement?
genetic information, and how will it be used?
Psychological impact, stigmatization, and Health and environmental issues concern-
discrimination due to an individuals genetic ing genetically modified (GM) foods and
differences. How does personal genetic informa- microbes. Are GM foods and other products safe
tion affect self-identity and societys perceptions? for humans and the environment? How will
these technologies affect developing nations
Reproductive issues including adequate and dependence on industrialized nations?
informed consent and the use of genetic
information in reproductive decision making. Commercialization of products including
Do healthcare personnel properly counsel property rights (patents, copyrights, and
parents about risks and limitations? What are trade secrets) and accessibility of data and
the larger societal issues raised by new repro- materials. Will patenting DNA sequences limit
ductive technologies? their accessibility and development into useful
products?
Clinical issues including the education
about capabilities, limitations, and social
risksof doctors and other health-service
providers, people identified with genetic
conditions, and the general public; and the
implementation of standards and quality
control measures. How do we prepare health
professionals for the new genetics? How do we
prepare the public to make informed choices?
How will genetic tests be evaluated and regu-
lated for accuracy, reliability, and usefulness?
(Currently, there is little regulation at the
federal level.) How do we as a society balance
current scientific limitations and social risk with
long-term benefits?
Fairness in access to advanced genomic
technologies. Who will benefit? Will there be
major worldwide inequities?

10
Human Genome Project Goals 19982003
Human DNA Sequencing clones and sequences for human and model-
The HGPs continued emphasis is on obtaining organism genes. Other functional-genomics goals
by 2003 a complete and highly accurate reference include studies into gene expression and control
sequence (1 error in 10,000 bases) that is largely and the development of experimental and compu-
continuous across each human chromosome. tational methods for understanding gene function.
Scientists believe that knowing this sequence is Comparative Genomics
critically important for understanding human The functions of human genes and other
biology and for applications to other fields. DNA regions often are revealed by studying their
A working draft of the sequence was parallels in nonhumans. HGP researchers have
completed 18 months ahead of schedule, in June obtained complete genomic sequences for the
2000. The achievement has provided scientists bacterium Escherichia coli, the yeast Saccharomyces
worldwide with a road map to an estimated 90% cerevisiae, the fruit fly Drosophila melanogaster, and
of genes on every chromosome. Although the draft the roundworm Caenorhabditis elegans. Sequenc-
contains gaps and errors and does not yet meet ing continues on the laboratory mouse. The
the criterion of 1 error in 10,000 bases outlined availability of complete genome sequences gener-
above, it provides a valuable scaffold for generating ated both inside and outside the HGP is driving a
a high-quality reference genome sequence. HGP major breakthrough in fundamental biology as
scientists make human DNA sequence available scientists compare entire genomes to gain new
broadly, rapidly, and free of charge via the Web. insights into evolutionary, biochemical, genetic,
Sequencing Technology metabolic, and physiological pathways.
Although current sequencing capacity is far Ethical, Legal, and Social Implications (ELSI)
greater than at the inception of the HGP, further Rapid advances in the science of genetics and
incremental progress in sequencing technologies, its applications present new and complex ethical
efficiency, and cost-reduction are needed. For and policy issues for individuals and society. ELSI
future sequencing applications, planners emphasize programs that identify and address these implica-
the importance of supporting novel technologies tions have been an integral part of the U.S. HGP
that may be 5 to 10 years in development. since its inception. These programs have resulted
Sequence Variation in a body of work that promotes education and helps
Although more than 99% of human DNA guide the conduct of genetic research and the
sequences are the same across the population, development of related medical and public policies.
variations in DNA sequence can have a major Bioinformatics and Computational Biology
impact on how humans respond to disease; to such Continued investment in current and new
environmental insults as bacteria, viruses, toxins, databases and analytical tools is critical to the
and chemicals; and to drugs and other therapies. success of the HGP and to the future usefulness of
Methods are being developed to detect different the data it produces. Databases must adapt to the
types of variation, particularly the most common evolving needs of the scientific community and
type called single-nucleotide polymorphisms (SNPs), must allow queries to be answered easily. Planners
which occur about once every 100 to 300 bases. suggest developing a human genome database,
Scientists believe SNP maps will help them identify analogous to model organism databases, that will
the multiple genes associated with such complex link to phenotypic information. Also needed are
diseases as cancer, diabetes, vascular disease, and databases and analytical tools for studying the
some forms of mental illness. These associations are expanding body of gene-expression and functional
difficult to establish with conventional gene- data, for modeling complex biological networks
hunting methods because a single altered gene and interactions, and for collecting and analyzing
may make only a small contribution to disease risk. sequence-variation data.
Functional Genomics Training
Efficient interpretation of the functions of Future genome scientists will require training in
human genes and other DNA sequences requires interdisciplinary areas including biology, computer
that strategies be developed to enable large-scale science, engineering, mathematics, physics, and
investigations across whole genomes. A first priority chemistry. Also, scientists with management skills will
is to generate complete sets of full-length cDNA be needed for leading large data-production efforts.

11
For More Information

Human Genome Project Information Publication of Genome Draft Sequence


www.ornl.gov/hgmis www.ornl.gov/hgmis/project/journals/
journals.html
Online Version of this Primer
www.ornl.gov/hgmis/publicat/primer/intro.html Image Gallery
www.ornl.gov/hgmis/education/images.html
Medicine and the New Genetics
www.ornl.gov/hgmis/medicine/medicine.html Resources for Teachers
www.ornl.gov/hgmis/education/education.html
Ethical, Legal, and Social Issues
www.ornl.gov/hgmis/elsi/elsi.html Resources for Students
www.ornl.gov/hgmis/education/
Genomes to Life (post-HGP research) students.html
DOEGenomesToLife.org
Careers in Genomics
www.ornl.gov/hgmis/education/careers.html

Wall Poster Available Free of Charge

A wall poster depicting the 24 human


chromosomes and many mapped genes
may be ordered from www.ornl.gov/hgmis/
posters/chromosome/ or from HGMIS.
Informative sidebars explain genetic terms
and provide URLs for finding more detailed
information.

October 2001. Produced by the Human


Genome Management Information
System (HGMIS) at Oak Ridge National
Laboratory, Oak Ridge, Tennessee, for
the U.S. Department of Energy Human
Genome Program.
For additional copies, contact:
caseydk@ornl.gov, 865/574-0597,
www.ornl.gov/hgmis.
A PDF version of this document and
accompanying PowerPoint slides may
be downloaded from www.ornl.gov/
hgmis/publicat/primer/intro.html.

12

Vous aimerez peut-être aussi