The status of the human gene catalogue
Authors:
Paulo Amaral,
Silvia Carbonell-Sala,
Francisco M. De La Vega,
Tiago Faial,
Adam Frankish,
Thomas Gingeras,
Roderic Guigo,
Jennifer L Harrow,
Artemis G. Hatzigeorgiou,
Rory Johnson,
Terence D. Murphy,
Mihaela Pertea,
Kim D. Pruitt,
Shashikant Pujar,
Hazuki Takahashi,
Igor Ulitsky,
Ales Varabyou,
Christine A. Wells,
Mark Yandell,
Piero Carninci,
Steven L. Salzberg
Abstract:
Scientists have been trying to identify all of the genes in the human genome since the initial draft of the genome was published in 2001. Over the intervening years, much progress has been made in identifying protein-coding genes, and the estimated number has shrunk to fewer than 20,000, although the number of distinct protein-coding isoforms has expanded dramatically. The invention of high-throug…
▽ More
Scientists have been trying to identify all of the genes in the human genome since the initial draft of the genome was published in 2001. Over the intervening years, much progress has been made in identifying protein-coding genes, and the estimated number has shrunk to fewer than 20,000, although the number of distinct protein-coding isoforms has expanded dramatically. The invention of high-throughput RNA sequencing and other technological breakthroughs have led to an explosion in the number of reported non-coding RNA genes, although most of them do not yet have any known function. A combination of recent advances offers a path forward to identifying these functions and towards eventually completing the human gene catalogue. However, much work remains to be done before we have a universal annotation standard that includes all medically significant genes, maintains their relationships with different reference genomes, and describes clinically relevant genetic variants.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
Needed for completion of the human genome: hypothesis driven experiments and biologically realistic mathematical models
Authors:
Roderic Guigo,
Ewan Birney,
Michael Brent,
Emmanouil Dermitzakis,
Lior Pachter,
Hugues Roest Crollius,
Victor Solovyev,
Michael Q. Zhang
Abstract:
With the sponsorship of ``Fundacio La Caixa'' we met in Barcelona, November 21st and 22nd, to analyze the reasons why, after the completion of the human genome sequence, the identification all protein coding genes and their variants remains a distant goal. Here we report on our discussions and summarize some of the major challenges that need to be overcome in order to complete the human gene cat…
▽ More
With the sponsorship of ``Fundacio La Caixa'' we met in Barcelona, November 21st and 22nd, to analyze the reasons why, after the completion of the human genome sequence, the identification all protein coding genes and their variants remains a distant goal. Here we report on our discussions and summarize some of the major challenges that need to be overcome in order to complete the human gene catalog.
△ Less
Submitted 6 October, 2004;
originally announced October 2004.