Cargando…

Identifying protein-coding genes in genomic sequences

The vast majority of the biology of a newly sequenced genome is inferred from the set of encoded proteins. Predicting this set is therefore invariably the first step after the completion of the genome DNA sequence. Here we review the main computational pipelines used to generate the human reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Harrow, Jennifer, Nagy, Alinda, Reymond, Alexandre, Alioto, Tyler, Patthy, Laszlo, Antonarakis, Stylianos E, Guigó, Roderic
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2687780/
https://www.ncbi.nlm.nih.gov/pubmed/19226436
http://dx.doi.org/10.1186/gb-2009-10-1-201
Descripción
Sumario:The vast majority of the biology of a newly sequenced genome is inferred from the set of encoded proteins. Predicting this set is therefore invariably the first step after the completion of the genome DNA sequence. Here we review the main computational pipelines used to generate the human reference protein-coding gene sets.