Cargando…

Pseudogenes and Their Genome-Wide Prediction in Plants

Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence dem...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Jin, Sekhwal, Manoj Kumar, Li, Pingchuan, Ragupathy, Raja, Cloutier, Sylvie, Wang, Xiue, You, Frank M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5187791/
https://www.ncbi.nlm.nih.gov/pubmed/27916797
http://dx.doi.org/10.3390/ijms17121991
_version_ 1782486898743181312
author Xiao, Jin
Sekhwal, Manoj Kumar
Li, Pingchuan
Ragupathy, Raja
Cloutier, Sylvie
Wang, Xiue
You, Frank M.
author_facet Xiao, Jin
Sekhwal, Manoj Kumar
Li, Pingchuan
Ragupathy, Raja
Cloutier, Sylvie
Wang, Xiue
You, Frank M.
author_sort Xiao, Jin
collection PubMed
description Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence demonstrates that some of them have potential roles in regulation. The majority of pseudogenes are generated from functional progenitor genes either by gene duplication (duplicated pseudogenes) or retro-transposition (processed pseudogenes). Pseudogenes are primarily identified by comparison to their parent genes. Bioinformatics tools for pseudogene prediction have been developed, among which PseudoPipe, PSF and Shiu’s pipeline are publicly available. We compared these three tools using the well-annotated Arabidopsis thaliana genome and its known 924 pseudogenes as a test data set. PseudoPipe and Shiu’s pipeline identified ~80% of A. thaliana pseudogenes, of which 94% were shared, while PSF failed to generate adequate results. A need for improvement of the bioinformatics tools for pseudogene prediction accuracy in plant genomes was thus identified, with the ultimate goal of improving the quality of genome annotation in plants.
format Online
Article
Text
id pubmed-5187791
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-51877912016-12-30 Pseudogenes and Their Genome-Wide Prediction in Plants Xiao, Jin Sekhwal, Manoj Kumar Li, Pingchuan Ragupathy, Raja Cloutier, Sylvie Wang, Xiue You, Frank M. Int J Mol Sci Review Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence demonstrates that some of them have potential roles in regulation. The majority of pseudogenes are generated from functional progenitor genes either by gene duplication (duplicated pseudogenes) or retro-transposition (processed pseudogenes). Pseudogenes are primarily identified by comparison to their parent genes. Bioinformatics tools for pseudogene prediction have been developed, among which PseudoPipe, PSF and Shiu’s pipeline are publicly available. We compared these three tools using the well-annotated Arabidopsis thaliana genome and its known 924 pseudogenes as a test data set. PseudoPipe and Shiu’s pipeline identified ~80% of A. thaliana pseudogenes, of which 94% were shared, while PSF failed to generate adequate results. A need for improvement of the bioinformatics tools for pseudogene prediction accuracy in plant genomes was thus identified, with the ultimate goal of improving the quality of genome annotation in plants. MDPI 2016-11-28 /pmc/articles/PMC5187791/ /pubmed/27916797 http://dx.doi.org/10.3390/ijms17121991 Text en © 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Xiao, Jin
Sekhwal, Manoj Kumar
Li, Pingchuan
Ragupathy, Raja
Cloutier, Sylvie
Wang, Xiue
You, Frank M.
Pseudogenes and Their Genome-Wide Prediction in Plants
title Pseudogenes and Their Genome-Wide Prediction in Plants
title_full Pseudogenes and Their Genome-Wide Prediction in Plants
title_fullStr Pseudogenes and Their Genome-Wide Prediction in Plants
title_full_unstemmed Pseudogenes and Their Genome-Wide Prediction in Plants
title_short Pseudogenes and Their Genome-Wide Prediction in Plants
title_sort pseudogenes and their genome-wide prediction in plants
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5187791/
https://www.ncbi.nlm.nih.gov/pubmed/27916797
http://dx.doi.org/10.3390/ijms17121991
work_keys_str_mv AT xiaojin pseudogenesandtheirgenomewidepredictioninplants
AT sekhwalmanojkumar pseudogenesandtheirgenomewidepredictioninplants
AT lipingchuan pseudogenesandtheirgenomewidepredictioninplants
AT ragupathyraja pseudogenesandtheirgenomewidepredictioninplants
AT cloutiersylvie pseudogenesandtheirgenomewidepredictioninplants
AT wangxiue pseudogenesandtheirgenomewidepredictioninplants
AT youfrankm pseudogenesandtheirgenomewidepredictioninplants