Cargando…

TranSeqAnnotator: large-scale analysis of transcriptomic data

BACKGROUND: The transcriptome of an organism can be studied with the analysis of expressed sequence tag (EST) data sets that offers a rapid and cost effective approach with several new and updated bioinformatics approaches and tools for assembly and annotation. The comprehensive analyses comprehend...

Descripción completa

Detalles Bibliográficos
Autores principales: Menon, Ranjeeta, Garg, Gagan, Gasser, Robin B, Ranganathan, Shoba
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3521237/
https://www.ncbi.nlm.nih.gov/pubmed/23282024
http://dx.doi.org/10.1186/1471-2105-13-S17-S24
_version_ 1782252911894462464
author Menon, Ranjeeta
Garg, Gagan
Gasser, Robin B
Ranganathan, Shoba
author_facet Menon, Ranjeeta
Garg, Gagan
Gasser, Robin B
Ranganathan, Shoba
author_sort Menon, Ranjeeta
collection PubMed
description BACKGROUND: The transcriptome of an organism can be studied with the analysis of expressed sequence tag (EST) data sets that offers a rapid and cost effective approach with several new and updated bioinformatics approaches and tools for assembly and annotation. The comprehensive analyses comprehend an organism along with the genome and proteome analysis. With the advent of large-scale sequencing projects and generation of sequence data at protein and cDNA levels, automated analysis pipeline is necessary to store, organize and annotate ESTs. RESULTS: TranSeqAnnotator is a workflow for large-scale analysis of transcriptomic data with the most appropriate bioinformatics tools for data management and analysis. The pipeline automatically cleans, clusters, assembles and generates consensus sequences, conceptually translates these into possible protein products and assigns putative function based on various DNA and protein similarity searches. Excretory/secretory (ES) proteins inferred from ESTs/short reads are also identified. The TranSeqAnnotator accepts FASTA format raw and quality ESTs along with protein and short read sequences and are analysed with user selected programs. After pre-processing and assembly, the dataset is annotated at the nucleotide, protein and ES protein levels. CONCLUSION: TranSeqAnnotator has been developed in a Linux cluster, to perform an exhaustive and reliable analysis and provide detailed annotation. TranSeqAnnotator outputs gene ontologies, protein functional identifications in terms of mapping to protein domains and metabolic pathways. The pipeline is applied to annotate large EST datasets to identify several novel and known genes with therapeutic experimental validations and could serve as potential targets for parasite intervention. TransSeqAnnotator is freely available for the scientific community at http://estexplorer.biolinfo.org/TranSeqAnnotator/.
format Online
Article
Text
id pubmed-3521237
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35212372012-12-14 TranSeqAnnotator: large-scale analysis of transcriptomic data Menon, Ranjeeta Garg, Gagan Gasser, Robin B Ranganathan, Shoba BMC Bioinformatics Proceedings BACKGROUND: The transcriptome of an organism can be studied with the analysis of expressed sequence tag (EST) data sets that offers a rapid and cost effective approach with several new and updated bioinformatics approaches and tools for assembly and annotation. The comprehensive analyses comprehend an organism along with the genome and proteome analysis. With the advent of large-scale sequencing projects and generation of sequence data at protein and cDNA levels, automated analysis pipeline is necessary to store, organize and annotate ESTs. RESULTS: TranSeqAnnotator is a workflow for large-scale analysis of transcriptomic data with the most appropriate bioinformatics tools for data management and analysis. The pipeline automatically cleans, clusters, assembles and generates consensus sequences, conceptually translates these into possible protein products and assigns putative function based on various DNA and protein similarity searches. Excretory/secretory (ES) proteins inferred from ESTs/short reads are also identified. The TranSeqAnnotator accepts FASTA format raw and quality ESTs along with protein and short read sequences and are analysed with user selected programs. After pre-processing and assembly, the dataset is annotated at the nucleotide, protein and ES protein levels. CONCLUSION: TranSeqAnnotator has been developed in a Linux cluster, to perform an exhaustive and reliable analysis and provide detailed annotation. TranSeqAnnotator outputs gene ontologies, protein functional identifications in terms of mapping to protein domains and metabolic pathways. The pipeline is applied to annotate large EST datasets to identify several novel and known genes with therapeutic experimental validations and could serve as potential targets for parasite intervention. TransSeqAnnotator is freely available for the scientific community at http://estexplorer.biolinfo.org/TranSeqAnnotator/. BioMed Central 2012-12-07 /pmc/articles/PMC3521237/ /pubmed/23282024 http://dx.doi.org/10.1186/1471-2105-13-S17-S24 Text en Copyright ©2012 Menon et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Menon, Ranjeeta
Garg, Gagan
Gasser, Robin B
Ranganathan, Shoba
TranSeqAnnotator: large-scale analysis of transcriptomic data
title TranSeqAnnotator: large-scale analysis of transcriptomic data
title_full TranSeqAnnotator: large-scale analysis of transcriptomic data
title_fullStr TranSeqAnnotator: large-scale analysis of transcriptomic data
title_full_unstemmed TranSeqAnnotator: large-scale analysis of transcriptomic data
title_short TranSeqAnnotator: large-scale analysis of transcriptomic data
title_sort transeqannotator: large-scale analysis of transcriptomic data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3521237/
https://www.ncbi.nlm.nih.gov/pubmed/23282024
http://dx.doi.org/10.1186/1471-2105-13-S17-S24
work_keys_str_mv AT menonranjeeta transeqannotatorlargescaleanalysisoftranscriptomicdata
AT garggagan transeqannotatorlargescaleanalysisoftranscriptomicdata
AT gasserrobinb transeqannotatorlargescaleanalysisoftranscriptomicdata
AT ranganathanshoba transeqannotatorlargescaleanalysisoftranscriptomicdata