Cargando…

HOPPSIGEN: a database of human and mouse processed pseudogenes

Processed pseudogenes result from reverse transcribed mRNAs. In general, because processed pseudogenes lack promoters, they are no longer functional from the moment they are inserted into the genome. Subsequently, they freely accumulate substitutions, insertions and deletions. Moreover, the ancestra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Adel, Khelifi, Laurent, Duret, Dominique, Mouchiroud
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2005
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC540038/ https://www.ncbi.nlm.nih.gov/pubmed/15608268 http://dx.doi.org/10.1093/nar/gki084

_version_	1782122109407854592
author	Adel, Khelifi Laurent, Duret Dominique, Mouchiroud
author_facet	Adel, Khelifi Laurent, Duret Dominique, Mouchiroud
author_sort	Adel, Khelifi
collection	PubMed
description	Processed pseudogenes result from reverse transcribed mRNAs. In general, because processed pseudogenes lack promoters, they are no longer functional from the moment they are inserted into the genome. Subsequently, they freely accumulate substitutions, insertions and deletions. Moreover, the ancestral structure of processed pseudogenes could be easily inferred using the sequence of their functional homologous genes. Owing to these characteristics, processed pseudogenes represent good neutral markers for studying genome evolution. Recently, there is an increasing interest for these markers, particularly to help gene prediction in the field of genome annotation, functional genomics and genome evolution analysis (patterns of substitution). For these reasons, we have developed a method to annotate processed pseudogenes in complete genomes. To make them useful to different fields of research, we stored them in a nucleic acid database after having annotated them. In this work, we screened both mouse and human complete genomes from ENSEMBL to find processed pseudogenes generated from functional genes with introns. We used a conservative method to detect processed pseudogenes in order to minimize the rate of false positive sequences. Within processed pseudogenes, some are still having a conserved open reading frame and some have overlapping gene locations. We designated as retroelements all reverse transcribed sequences and more strictly, we designated as processed pseudogenes, all retroelements not falling in the two former categories (having a conserved open reading or overlapping gene locations). We annotated 5823 retroelements (5206 processed pseudogenes) in the human genome and 3934 (3428 processed pseudogenes) in the mouse genome. Compared to previous estimations, the total number of processed pseudogenes was underestimated but the aim of this procedure was to generate a high-quality dataset. To facilitate the use of processed pseudogenes in studying genome structure and evolution, DNA sequences from processed pseudogenes, and their functional reverse transcribed homologs, are now stored in a nucleic acid database, HOPPSIGEN. HOPPSIGEN can be browsed on the PBIL (Pôle Bioinformatique Lyonnais) World Wide Web server (http://pbil.univ-lyon1.fr/) or fully downloaded for local installation.
format	Text
id	pubmed-540038
institution	National Center for Biotechnology Information
language	English
publishDate	2005
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-5400382005-01-04 HOPPSIGEN: a database of human and mouse processed pseudogenes Adel, Khelifi Laurent, Duret Dominique, Mouchiroud Nucleic Acids Res Articles Processed pseudogenes result from reverse transcribed mRNAs. In general, because processed pseudogenes lack promoters, they are no longer functional from the moment they are inserted into the genome. Subsequently, they freely accumulate substitutions, insertions and deletions. Moreover, the ancestral structure of processed pseudogenes could be easily inferred using the sequence of their functional homologous genes. Owing to these characteristics, processed pseudogenes represent good neutral markers for studying genome evolution. Recently, there is an increasing interest for these markers, particularly to help gene prediction in the field of genome annotation, functional genomics and genome evolution analysis (patterns of substitution). For these reasons, we have developed a method to annotate processed pseudogenes in complete genomes. To make them useful to different fields of research, we stored them in a nucleic acid database after having annotated them. In this work, we screened both mouse and human complete genomes from ENSEMBL to find processed pseudogenes generated from functional genes with introns. We used a conservative method to detect processed pseudogenes in order to minimize the rate of false positive sequences. Within processed pseudogenes, some are still having a conserved open reading frame and some have overlapping gene locations. We designated as retroelements all reverse transcribed sequences and more strictly, we designated as processed pseudogenes, all retroelements not falling in the two former categories (having a conserved open reading or overlapping gene locations). We annotated 5823 retroelements (5206 processed pseudogenes) in the human genome and 3934 (3428 processed pseudogenes) in the mouse genome. Compared to previous estimations, the total number of processed pseudogenes was underestimated but the aim of this procedure was to generate a high-quality dataset. To facilitate the use of processed pseudogenes in studying genome structure and evolution, DNA sequences from processed pseudogenes, and their functional reverse transcribed homologs, are now stored in a nucleic acid database, HOPPSIGEN. HOPPSIGEN can be browsed on the PBIL (Pôle Bioinformatique Lyonnais) World Wide Web server (http://pbil.univ-lyon1.fr/) or fully downloaded for local installation. Oxford University Press 2005-01-01 2004-12-17 /pmc/articles/PMC540038/ /pubmed/15608268 http://dx.doi.org/10.1093/nar/gki084 Text en Copyright © 2005 Oxford University Press
spellingShingle	Articles Adel, Khelifi Laurent, Duret Dominique, Mouchiroud HOPPSIGEN: a database of human and mouse processed pseudogenes
title	HOPPSIGEN: a database of human and mouse processed pseudogenes
title_full	HOPPSIGEN: a database of human and mouse processed pseudogenes
title_fullStr	HOPPSIGEN: a database of human and mouse processed pseudogenes
title_full_unstemmed	HOPPSIGEN: a database of human and mouse processed pseudogenes
title_short	HOPPSIGEN: a database of human and mouse processed pseudogenes
title_sort	hoppsigen: a database of human and mouse processed pseudogenes
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC540038/ https://www.ncbi.nlm.nih.gov/pubmed/15608268 http://dx.doi.org/10.1093/nar/gki084
work_keys_str_mv	AT adelkhelifi hoppsigenadatabaseofhumanandmouseprocessedpseudogenes AT laurentduret hoppsigenadatabaseofhumanandmouseprocessedpseudogenes AT dominiquemouchiroud hoppsigenadatabaseofhumanandmouseprocessedpseudogenes

HOPPSIGEN: a database of human and mouse processed pseudogenes

Ejemplares similares