U.s. Hgp On Fast Track For Early Completion

ember 1998, advisory committees at DOE andtype called single-nucleotide polymorphisms (SNPs),
NIH approved new 5-year goals aimed atwhich occur about once every 100 to 300 bases.
completing the Human Genome Project (HGP) 2Scientists believe SNP maps will help them identify
years earlier than originally planned in 1990. Thethe multiple genes associated with such complex
target date of 2003 also will mark the 50thdiseases as cancer, diabetes, vascular disease, and
anniversary of Watson and Crick's description ofsome forms of mental illness. These associations
DNA's fundamental structure.are difficult to establish with conventional
The new plan was published in the October 23,gene-hunting methods because a single altered
1998, issue of Science, which also cited thegene may make only a small contribution to
contributions of international partners. Thesedisease risk.
partners include the Sanger Centre in the UnitedFunctional Genomics
Kingdom and research centers in Germany, Japan,Efficient interpretation of the functions of human
and France.genes and other DNA sequences requires that
The U.S. HGP began officially in 1990 as a $3-billion,resources and strategies be developed to enable
15-year program to find the estimated 80,000large-scale investigations across whole genomes.
human genes and determine the sequence of theA technically challenging first priority is to generate
3 billion DNA building blocks that underlie all ofcomplete sets of full-length cDNA clones and
human biology and its diversity. The early phasesequences for human and model-organism genes.
of the HGP was characterized by efforts toOther functional-genomics goals include studies into
create the biological, instrumentation, andgene expression and control, creation of
computing resources necessary for efficientmutations that cause loss or alteration of function
production-scale DNA sequencing. The first 5-yearin nonhuman organisms, and development of
plan was revised in 1993 due to remarkableexperimental and computational methods for
technological progress, and the second planprotein analyses.
projected goals through FY 1998. The latest planComparative Genomics
was developed during a series of individual andThe functions of human genes and other DNA
joint DOE and NIH workshops held over the pastregions often are revealed by studying their
2 years (see box, p. 3).parallels in nonhumans. To enable such
Observers have predicted that the 21th centurycomparisons, HGP researchers have obtained
will be the "biology century." The analytical powercomplete genomic sequences for the bacterium
arising from the reference DNA sequences ofEscherichia coli, the yeast Saccharomyces
several entire genomes and other genomiccerevisiae, and the roundworm Caenorhabditis
resources is anticipated to help jump start theelegans. Sequencing continues on Drosophila
new millennium.melanogaster and the laboratory mouse. The
Human DNA Sequencingavailability of complete genome sequences
The HGP's continued emphasis is on obtaining agenerated both inside and outside the HGP is
complete and highly accurate reference sequencedriving a major breakthrough in fundamental
(1 error in 10,000 bases) that is largely continuousbiology as scientists compare entire genomes to
across each human chromosome. Scientistsgain new insights into evolutionary, biochemical,
believe that knowing this sequence is criticallygenetic, metabolic, and physiological pathways.
important for understanding human biology andHGP planners stress the need for a sustainable
for applications to other fields.sequencing capacity to facilitate future
The plan calls for generating a "working draft" ofcomparisons.
the human genome DNA sequence by 2001. TheEthical, Legal, and Social Implications (ELSI)
working draft will comprise shotgun sequenceRapid advances in the science of genetics and its
data from mapped clones, with gaps andapplications present new and complex ethical and
ambiguities unresolved. If these data sets can bepolicy issues for individuals and society. ELSI
merged with those from the private sector, theyprograms that identify and address these
may increase the depth of the mapped draft,implications have been an integral part of the U.S.
which scientists expect will contain about half theHGP since its inception. These programs have
genes. Draft sequence will provide a foundationresulted in a body of work that promotes
for obtaining the high-quality finished sequence andeducation and helps guide the conduct of genetic
also will be a valuable tool for researchers huntingresearch and the development of related medical
disease genes.and public policies.
According to Ari Patrinos, DOE Associate DirectorA continuing challenge is to safeguard the privacy
for Biological and Environmental Research,of individuals and groups who contribute DNA
"Although we have as our primary goal thesamples for large-scale sequence-variation studies.
finished Book of Life' by the end of 2003, we alsoOther concerns are to anticipate how the resulting
want the working draft to be as useful asdata may affect concepts of race and ethnicity;
possible."identify potential uses (or misuses) of genetic
NIH and DOE sequencing centers expect theirdata in workplaces, schools, and courts; identify
facilities to generate about 60% to 70% of thecommercial uses; and foresee impacts of genetic
human DNA sequence, which will be madeadvances on the concepts of humanity and
available broadly and rapidly via the Web topersonal responsibility.
stimulate further research.Bioinformatics and Computational Biology
Sequencing TechnologyContinued investment in current and new
Although current sequencing capacity is fardatabases and analytical tools is critical to the
greater than at the inception of the HGP,success of the HGP and to the future usefulness
achieving the new sequencing goals will require aof the data it produces. Databases must adapt to
two- to threefold improvement. Furtherthe evolving needs of the scientific community
incremental advances in sequencing technologies,and must allow queries to be answered easily.
efficiency, and cost will be needed. For futurePlanners suggest developing a human genome
sequencing applications, planners emphasize thedatabase, analogous to model organism
importance of supporting novel technologies thatdatabases, that will link to phenotypic information.
may be 5 to 10 years in development.Also needed are databases and analytical tools for
Sequence Variationstudying the expanding body of gene-expression
A new goal focuses on identifying individualand functional data, for modeling complex biological
variations in the human genome. Although morenetworks and interactions, and for collecting and
than 99% of human DNA sequences are theanalyzing sequence-variation data.
same across the population, variations in DNATraining
sequence can have a major impact on howPlanners note that future genomic scientists will
humans respond to disease; environmental insultsrequire training in interdisciplinary areas that include
such as bacteria, viruses, toxins, and chemicals;biology, computer science, engineering,
and drugs and other therapies.mathematics, physics, and chemistry. Additionally,
Methods are being developed to detect differentscientists with management skills will be needed
types of variation, particularly the most commonfor leading large data-production efforts.