Cargando…

Workflow and web application for annotating NCBI BioProject transcriptome data

The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Vera Alvarez, Roberto, Medeiros Vidal, Newton, Garzón-Martínez, Gina A., Barrero, Luz S., Landsman, David, Mariño-Ramírez, Leonardo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2017
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5467576/ https://www.ncbi.nlm.nih.gov/pubmed/28605765 http://dx.doi.org/10.1093/database/bax008

_version_	1783243293173219328
author	Vera Alvarez, Roberto Medeiros Vidal, Newton Garzón-Martínez, Gina A. Barrero, Luz S. Landsman, David Mariño-Ramírez, Leonardo
author_facet	Vera Alvarez, Roberto Medeiros Vidal, Newton Garzón-Martínez, Gina A. Barrero, Luz S. Landsman, David Mariño-Ramírez, Leonardo
author_sort	Vera Alvarez, Roberto
collection	PubMed
description	The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621. Database URL: http://www.ncbi.nlm.nih.gov/projects/physalis/
format	Online Article Text
id	pubmed-5467576
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-54675762017-06-19 Workflow and web application for annotating NCBI BioProject transcriptome data Vera Alvarez, Roberto Medeiros Vidal, Newton Garzón-Martínez, Gina A. Barrero, Luz S. Landsman, David Mariño-Ramírez, Leonardo Database (Oxford) Original Article The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621. Database URL: http://www.ncbi.nlm.nih.gov/projects/physalis/ Oxford University Press 2017-04-17 /pmc/articles/PMC5467576/ /pubmed/28605765 http://dx.doi.org/10.1093/database/bax008 Text en Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.
spellingShingle	Original Article Vera Alvarez, Roberto Medeiros Vidal, Newton Garzón-Martínez, Gina A. Barrero, Luz S. Landsman, David Mariño-Ramírez, Leonardo Workflow and web application for annotating NCBI BioProject transcriptome data
title	Workflow and web application for annotating NCBI BioProject transcriptome data
title_full	Workflow and web application for annotating NCBI BioProject transcriptome data
title_fullStr	Workflow and web application for annotating NCBI BioProject transcriptome data
title_full_unstemmed	Workflow and web application for annotating NCBI BioProject transcriptome data
title_short	Workflow and web application for annotating NCBI BioProject transcriptome data
title_sort	workflow and web application for annotating ncbi bioproject transcriptome data
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5467576/ https://www.ncbi.nlm.nih.gov/pubmed/28605765 http://dx.doi.org/10.1093/database/bax008
work_keys_str_mv	AT veraalvarezroberto workflowandwebapplicationforannotatingncbibioprojecttranscriptomedata AT medeirosvidalnewton workflowandwebapplicationforannotatingncbibioprojecttranscriptomedata AT garzonmartinezginaa workflowandwebapplicationforannotatingncbibioprojecttranscriptomedata AT barreroluzs workflowandwebapplicationforannotatingncbibioprojecttranscriptomedata AT landsmandavid workflowandwebapplicationforannotatingncbibioprojecttranscriptomedata AT marinoramirezleonardo workflowandwebapplicationforannotatingncbibioprojecttranscriptomedata

Workflow and web application for annotating NCBI BioProject transcriptome data

Ejemplares similares