Cargando…

EST2Prot: Mapping EST sequences to proteins

BACKGROUND: EST libraries are used in various biological studies, from microarray experiments to proteomic and genetic screens. These libraries usually contain many uncharacterized ESTs that are typically ignored since they cannot be mapped to known genes. Consequently, new discoveries are possibly...

Descripción completa

Detalles Bibliográficos
Autores principales: Shafer, Paul, Lin, David M, Yona, Golan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1456965/
https://www.ncbi.nlm.nih.gov/pubmed/16515706
http://dx.doi.org/10.1186/1471-2164-7-41
_version_ 1782127411817611264
author Shafer, Paul
Lin, David M
Yona, Golan
author_facet Shafer, Paul
Lin, David M
Yona, Golan
author_sort Shafer, Paul
collection PubMed
description BACKGROUND: EST libraries are used in various biological studies, from microarray experiments to proteomic and genetic screens. These libraries usually contain many uncharacterized ESTs that are typically ignored since they cannot be mapped to known genes. Consequently, new discoveries are possibly overlooked. RESULTS: We describe a system (EST2Prot) that uses multiple elements to map EST sequences to their corresponding protein products. EST2Prot uses UniGene clusters, substring analysis, information about protein coding regions in existing DNA sequences and protein database searches to detect protein products related to a query EST sequence. Gene Ontology terms, Swiss-Prot keywords, and protein similarity data are used to map the ESTs to functional descriptors. CONCLUSION: EST2Prot extends and significantly enriches the popular UniGene mapping by utilizing multiple relations between known biological entities. It produces a mapping between ESTs and proteins in real-time through a simple web-interface. The system is part of the Biozon database and is accessible at .
format Text
id pubmed-1456965
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14569652006-05-04 EST2Prot: Mapping EST sequences to proteins Shafer, Paul Lin, David M Yona, Golan BMC Genomics Software BACKGROUND: EST libraries are used in various biological studies, from microarray experiments to proteomic and genetic screens. These libraries usually contain many uncharacterized ESTs that are typically ignored since they cannot be mapped to known genes. Consequently, new discoveries are possibly overlooked. RESULTS: We describe a system (EST2Prot) that uses multiple elements to map EST sequences to their corresponding protein products. EST2Prot uses UniGene clusters, substring analysis, information about protein coding regions in existing DNA sequences and protein database searches to detect protein products related to a query EST sequence. Gene Ontology terms, Swiss-Prot keywords, and protein similarity data are used to map the ESTs to functional descriptors. CONCLUSION: EST2Prot extends and significantly enriches the popular UniGene mapping by utilizing multiple relations between known biological entities. It produces a mapping between ESTs and proteins in real-time through a simple web-interface. The system is part of the Biozon database and is accessible at . BioMed Central 2006-03-04 /pmc/articles/PMC1456965/ /pubmed/16515706 http://dx.doi.org/10.1186/1471-2164-7-41 Text en Copyright © 2006 Shafer et al; licensee BioMed Central Ltd.
spellingShingle Software
Shafer, Paul
Lin, David M
Yona, Golan
EST2Prot: Mapping EST sequences to proteins
title EST2Prot: Mapping EST sequences to proteins
title_full EST2Prot: Mapping EST sequences to proteins
title_fullStr EST2Prot: Mapping EST sequences to proteins
title_full_unstemmed EST2Prot: Mapping EST sequences to proteins
title_short EST2Prot: Mapping EST sequences to proteins
title_sort est2prot: mapping est sequences to proteins
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1456965/
https://www.ncbi.nlm.nih.gov/pubmed/16515706
http://dx.doi.org/10.1186/1471-2164-7-41
work_keys_str_mv AT shaferpaul est2protmappingestsequencestoproteins
AT lindavidm est2protmappingestsequencestoproteins
AT yonagolan est2protmappingestsequencestoproteins