Click Here!

  eMJA     The Medical Journal of Australia

Home | Issues | eMJA shop | My account | Classifieds | Contact | More... | Topics | Search   

Cracking the Code

Milestones in the Human Genome Project: genesis to postgenome

Ronald J A Trent

Photo of Watson

The Human Genome Project (HGP) will change medicine and medical research irrevocably. The obvious gains in genetic knowledge from the HGP, together with the advances which will flow into bioinformatics, biotechnology and the potential for novel therapeutic agents, will ensure that the financial investment in the HGP is repaid many times over. The HGP's costs in terms of ethical and social issues remain to be determined, but it is to be hoped that these will not detract from the scientific and medical achievements. How did such an endeavour start, and what path did it follow?

MJA 2000; 173: 591-594

Years 1-5 - Years 6-10 - A new era - References - Authors' details
Make a comment -
Register to be notified of new articles by e-mail - Current contents list - More articles on Genetics


  A key player in the formative years of the Human Genome Project (HGP) was the United States Department of Energy (DOE). The DOE had a long term research focus on DNA because its early involvement in the World War II program that produced the atomic bomb had led to an interest in induced mutations. To understand the link between DNA and induced mutations, the DOE ultimately needed to characterise individual differences in DNA sequences. However, at the time, the sequences of only a few selected genes within the genome were known. Most of the estimated 40 000 human genes had yet to be discovered,1 and a considerable proportion of the 3 x 109 base-pairs making up the human haploid genome did not contain genes (about 30% is repetitive DNA2). Therefore, vast tracts of DNA with unknown function remained to be explored.

A concerted effort to sequence the entire human genome within a reasonable time frame required the development of better technology. In addition, no research group was big enough to take on such a mammoth task. Despite the enormous obstacles, the majority of scientific opinion by the end of the 1980s was that sequencing the entire human genome was feasible. However, not all scientists were convinced -- there was considerable apprehension that it was a monumental exercise in data gathering rather than "true" research. There were also fears that the potential huge costs of the HGP would divert funds from more traditional research.

Ultimately, the HGP, started in late 1990, was planned to be completed by 2005, and had a budget of US$3 billion. Politically, the HGP promised more than medical benefits -- it promised technological developments that would lead to economic wealth and job creation. David Smith, then Director of the DOE's Human Genome Program, described the HGP as "developing an infrastructure for future research". In reply to concerns about the potential for shrinking research funds in other areas, Smith said that, after the HGP was completed, "individual investigators would do things that they would never be able to do otherwise".3 A final point to note about the HGP is that the term "human" is a misnomer, as parallel work was also planned to sequence the genomes of model organisms, including a mouse, a fruit fly, various microorganisms, a worm, a plant and a fish. This work, called "comparative genomics", was undertaken to facilitate understanding of the human genome, as there are many similarities between human genes and those of other organisms (Box 1).


Years 1-5

The goals identified for the first five years of the Human Genome Project are described in Box 2.6 During this time, many laboratories throughout the world carefully constructed maps of the genome, and then identified, by DNA sequencing, each base in the segment they were allotted. The HGP became a truly international endeavour, with the British and French participating early. Interestingly, in both countries, substantial funding for HGP-related activities came from outside the public purses (the Wellcome Trust, in the UK, and the Muscular Dystrophy Association, in France). Although the DOE was a key leading player, it was soon partnered by the National Institutes of Health (NIH). Today, the NIH is the leading public-sector player in the HGP.

The Human Genome Organisation (HUGO) was formed to coordinate international efforts, as well as to facilitate education and rapid exchange of information. Australia has played a key role in HUGO, with Professor Grant Sutherland, from Adelaide, being one of the presidents of that body; the 1999 Human Genome Meeting (HGM'99) was held in Brisbane.


Years 6-10

In contrast to the relatively low-key and steady progress made in the early stages of the HGP, more recent years have been considerably more turbulent. By 1998, the impressive developments in DNA-sequencing technology, particularly automation, had brought forward the timing for specific goals -- it was now believed that the complete sequence of the human genome would be determined by 2003. The first success stories of the HGP centred around the completed sequencing of genomes from model organisms (Box 3). These successes increased the momentum of the human genome work, as they confirmed that genomes could be completely sequenced, and that the information obtained had both scientific value, and potential medical importance.

As the pace quickened towards the 10th year, so did the growing influence of the commercial sector (Box 3). As shown in Box 2, key goals of the HGP were the development of technology (particularly in relation to DNA sequencing), and the transfer of HGP outcomes into the private sector. The HGP represented an ideal venture in which the public and private sectors could cooperate (and compete). Governments and public funding bodies set aside substantial resources to participate in the HGP as part of the intellectual pursuit, but also as a base for economic development or alternative research funding. Developments coming from HGP were meant to be rapidly disseminated to users, an aim which would later come under some pressure. The HGP's goals requiring that DNA sequencing results were to be communicated freely and without delay were, to some extent, at odds with the protection of intellectual property through patenting. In 1991, the NIH was embroiled in an international controversy when it attempted to patent anonymous DNA sequences (those for which no function was known). Following public and international indignation, the NIH withdrew these patents.7 However, apart from the patent issue, free-flowing, publicly available information remained the rule rather than the exception, until the private sector became a major player.

A high profile example of commercialisation came in the late 1990s with Celera Genomics, a privately funded organisation sponsored by Applied Biosystems (now PE Biosystems). Celera (company motto, "Speed matters") took on the might of the NIH and the world when it publicly boasted that with its resources (300 of the most modern automated DNA sequencers, and supercomputers second only to those in the US military), and a different strategy for sequencing DNA, it would finish the first draft of the human sequence before the NIH or other countries, and at a much reduced cost of US$200 million.8,9

This challenge had some positive effects, as it focused the cumbersome and slow-moving multicampus, multinational Human Genome Project, but, on the negative side, it once again highlighted that big corporations and big money can get there first, but at a cost -- the availability and access to future databases would no longer be free, at least in the short term.


A new era

In June this year, President Clinton, flanked by Dr Francis Collins of the NIH and Dr Craig Venter of Celera, announced simultaneously with the UK's Prime Minister Blair that the first draft of the human DNA sequence was now complete, having reached this stage with input from both the public and private sectors. Whether Celera or the NIH-sponsored initiative won the "race" is less relevant than who will control access to the databases containing the DNA sequences. At this stage, Celera appears to hold the upper hand, and is well on its way to the stated goal of being the definitive source of genomic and related medical and agricultural information.9 Although the HGP has officially reached its goals, it is important to note that this is only the first draft of the DNA sequence, and considerable work remains to ensure that DNA sequencing errors are removed. This will take a few more years.

At the end of the HGP, the DNA sequence from the human genome will be deposited in various databases. What will be left is the mammoth task of working out the function of the genes. Hence, the post-genome era has been called functional genomics, which includes proteomics -- the technology and strategies required to determine the function of proteins. How this will be accomplished remains to be determined, but new technologies will be needed. The use of microarrays (a method by which the expression of many thousands of genes can be identified very rapidly with microchips) is an early, promising strategy in functional genomics. Bioinformatics will need to come up with more sophisticated programs by which the function of genes can be predicted. The traditional "wet-lab" approach to research might even give way to a complete "in-silico" (ie, computer) strategy! The first challenge -- sequencing the genome -- has been accomplished. Many more even larger challenges await us as we set out to determine the function of all human genes.10,11


References

  1. Aparicio AJR. How to count . . . human genes. Nat Genet 2000; 25: 129-130.
  2. Mueller RF, Young ID. Emery's elements of medical genetics. 10th ed. Churchill Livingstone, Edinburgh, 1998: 15.
  3. Smith D. Evolution of a vision: genome project origins, present and future challenges and far reaching benefits. Human Genome News 1995; 7: 3-4. (See <http://www.ornl.gov/hgmis/> accessed November 2000)
  4. Clark MS. Comparative genomics: the key to understanding the Human Genome Project. Bioessays 1999; 21: 121-130.
  5. Collins FS. Shattuck lecture -- medical and societal consequences of the Human Genome Project. N Engl J Med 1999; 341: 28-37.
  6. Human Genome Project Information <http://www.ornl.gov/hgmis/> (accessed November 2000).
  7. Anderson C. NIH drops bid for gene patents. Science 1994; 263: 909-910.
  8. Wadman M. Company aims to beat NIH human genome efforts. Nature 1998; 393: 101.
  9. Celera. A PE Corporation business. <http:www.celera.com/> (accessed November 2000).
  10. Burley SK, Almo SC, Bonanno JB, et al. Structural genomics: beyond the human genome project. Nat Genet 1999; 23: 151-157.
  11. Van Ommen GJ, Bakker E, den Dunnen JT. The human genome project and the future of diagnostics, treatment and prevention. Lancet 1999; 354 (Suppl 1): 5-10.



Authors' details

University of Sydney at the Royal Prince Alfred Hospital, Sydney, NSW.
Ronald J A Trent, DPhil(Oxon), FRACP, FRCPA, Professor of Molecular Genetics.

Reprints will not be available from the authors.
Correspondence: Professor R J A Trent, Department of Molecular and Clinical Genetics, Royal Prince Alfred Hospital, Missenden Road, Camperdown, NSW 2050.
rtrentATmed.usyd.edu.au

Top right-hand corner is a photo of James Watson (left) and Francis Crick (courtesy of A Barrington Brown, Sciences Source/Photo Researchers).

©MJA 2000
Make a comment

Home | Issues | eMJA shop | My account | Classifieds | More... | Contact | Topics | Search

The Medical Journal of Australia    eMJA  


Readers may print a single copy for personal use. No further reproduction or distribution of the articles should proceed without the permission of the publisher. For permission, contact the Australasian Medical Publishing Company.
Journalists are welcome to write news stories based on what they read here, but should acknowledge their source as "an article published on the Internet by The Medical Journal of Australia <http://www.mja.com.au>".

<URL: http://www.mja.com.au/> © 2000 Medical Journal of Australia.
We appreciate your comments.

1: Comparative genomics
Image of genes
Genes that are important to survival have been conserved during evolution, and remain common to organisms ranging from yeast to humans. For example, it is estimated that about 74% of known human genes have a corresponding homologue in the nematode Caenorhabditis elegans.4,5 This is why it is often possible to identify the function of a human gene by working on the corresponding gene in a model organism. In mice, a gene can be "knocked out" (rendered non-functional) by genetic manipulation and the effect of this observed; in fruit flies, observing the phenotypes for natural or induced mutant genes enables the function of the corresponding gene in humans to be predicted.
Back to text
 
2: Goals of the Human Genome Project6
  • Identify the 40 000 or so genes which make up the human genome.
  • Determine the sequence of the approximately 3 billion bases which make up the human genome, as well as sequencing the genomes of model organisms.
  • Store information in databases.*
  • Develop tools for data analysis.*
  • Transfer technologies to the private sector. (Private industry needed to be involved to make both technology development and research training effective.)
  • Address the ethical, legal, and social issues arising from the HGP (approximately 3%-5% of the Department of Energy/National Institutes of Health budget was directed to this goal).

* These goals would require very sophisticated bioinformatics capability, so the rapid development of bioinformatics (a discipline involving computational skills) has been a by-product of the HGP.
Back to text
 
3: Landmarks on the road to realising the Human Genome Project
Date Scientific achievement/Implications

  • 1954

Watson and Crick describe the double-stranded nature of DNA.

The beginning of "molecular medicine", as diseases could now be considered in molecular (DNA) terms.

  • 1975

Sanger, Maxam and Gilbert describe how DNA can be sequenced.

This would enable the human genome to be read base by base.

  • 1985

Mullis and colleagues describe how DNA can be amplified with the polymerase chain reaction (PCR).

PCR opened up the potential for diagnosing genetic disorders.

  • 1987

Automated means by which DNA can be sequenced becomes available through the Applied Biosystems company (later PE Biosystems).

Automation allowed sequencing of large segments of DNA. This was a critical development without which the Human Genome Project could not have been conceived.

  • 1990
Human Genome Project starts.
  • 1991

Controversy as the NIH attempts to patent anonymous DNA sequences. New genome research centres develop as start-up, for-profit companies. The private sector is involved because of the potential for DNA diagnostics and novel therapies based on DNA sequencing data.

The human genome work becomes increasingly commercialised, with the potential for future ethical dilemmas in terms of access.

  • 1995

The DNA sequence for the genome of the first model organism (Haemophilus influenzae) is published. This is achieved by TIGR, a company with commercial ties. Craig Venter from TIGR would later form Celera. In the next three years, the genomes for model organisms Escherichia coli, the yeast Saccharomyces cerevisiae and the nematode Caenorhabditis elegans are sequenced.

The potential to understand pathogenicity at the DNA level would identify targets for novel therapeutic agents.

  • 1996

Altruistic public policy reaffirmed that information coming from the Human Genome Project will be deposited in public databases within 24 hours.

This response to the increasing influence of the commercial sector is an attempt to assuage fears among the more conservative researchers. However, with industry becoming involved, the user (government or individual) must eventually pay.

  • 2000
The complete sequence of the fruit fly (Drosophila melanogaster) genome is announced by Celera and, as a sign of good faith, the company deposits the sequence in a public database.
 

The first draft of the human sequence is announced by President Clinton.

To date, knowledge of the DNA sequence has had a modest impact on medical practice, apart from increasing the potential for diagnosis of genetic disorders. However, with the functional genomics era will come novel therapeutics based on knowledge of DNA and its role in disease.

  Access to the Celera human database (as well as the mouse genome sequence due for completion soon) becomes available but only by subscription. The NHMRC is one of the first to subscribe to the Celera databases, allowing Australian scientists access to DNA sequence information not available in the public databases.
Back to text
 
Further historical information on the Human Genome Project can be found at the following web sites: http://www.ornl.gov/hgmis/ (the DOE version); http://www.gene.ucl.ac.uk/hugo/ (HUGO's role); http://www.nhgri.nih.gov/ (NIH information); http://www.fplc.edu/risk/vol5/spring/cookdeeg.htm (historical summary by Robert Cook-Deegan).