Cargando…

Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows

Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in populatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Torri, Federica, Dinov, Ivo D., Zamanyan, Alen, Hobel, Sam, Genco, Alex, Petrosyan, Petros, Clark, Andrew P., Liu, Zhizhong, Eggert, Paul, Pierce, Jonathan, Knowles, James A., Ames, Joseph, Kesselman, Carl, Toga, Arthur W., Potkin, Steven G., Vawter, Marquis P., Macciardi, Fabio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3490498/
https://www.ncbi.nlm.nih.gov/pubmed/23139896
http://dx.doi.org/10.3390/genes3030545
_version_ 1782248853037121536
author Torri, Federica
Dinov, Ivo D.
Zamanyan, Alen
Hobel, Sam
Genco, Alex
Petrosyan, Petros
Clark, Andrew P.
Liu, Zhizhong
Eggert, Paul
Pierce, Jonathan
Knowles, James A.
Ames, Joseph
Kesselman, Carl
Toga, Arthur W.
Potkin, Steven G.
Vawter, Marquis P.
Macciardi, Fabio
author_facet Torri, Federica
Dinov, Ivo D.
Zamanyan, Alen
Hobel, Sam
Genco, Alex
Petrosyan, Petros
Clark, Andrew P.
Liu, Zhizhong
Eggert, Paul
Pierce, Jonathan
Knowles, James A.
Ames, Joseph
Kesselman, Carl
Toga, Arthur W.
Potkin, Steven G.
Vawter, Marquis P.
Macciardi, Fabio
author_sort Torri, Federica
collection PubMed
description Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in population genetics. These achievements have been made possible by next generation sequencing (NGS) technologies, which require substantial bioinformatics resources to analyze the dense and complex sequence data. The huge analytical burden of data from genome sequencing might be seen as a bottleneck slowing the publication of NGS papers at this time, especially in psychiatric genetics. We review the existing methods for processing NGS data, to place into context the rationale for the design of a computational resource. We describe our method, the Graphical Pipeline for Computational Genomics (GPCG), to perform the computational steps required to analyze NGS data. The GPCG implements flexible workflows for basic sequence alignment, sequence data quality control, single nucleotide polymorphism analysis, copy number variant identification, annotation, and visualization of results. These workflows cover all the analytical steps required for NGS data, from processing the raw reads to variant calling and annotation. The current version of the pipeline is freely available at http://pipeline.loni.ucla.edu. These applications of NGS analysis may gain clinical utility in the near future (e.g., identifying miRNA signatures in diseases) when the bioinformatics approach is made feasible. Taken together, the annotation tools and strategies that have been developed to retrieve information and test hypotheses about the functional role of variants present in the human genome will help to pinpoint the genetic risk factors for psychiatric disorders.
format Online
Article
Text
id pubmed-3490498
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-34904982012-11-06 Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows Torri, Federica Dinov, Ivo D. Zamanyan, Alen Hobel, Sam Genco, Alex Petrosyan, Petros Clark, Andrew P. Liu, Zhizhong Eggert, Paul Pierce, Jonathan Knowles, James A. Ames, Joseph Kesselman, Carl Toga, Arthur W. Potkin, Steven G. Vawter, Marquis P. Macciardi, Fabio Genes (Basel) Article Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in population genetics. These achievements have been made possible by next generation sequencing (NGS) technologies, which require substantial bioinformatics resources to analyze the dense and complex sequence data. The huge analytical burden of data from genome sequencing might be seen as a bottleneck slowing the publication of NGS papers at this time, especially in psychiatric genetics. We review the existing methods for processing NGS data, to place into context the rationale for the design of a computational resource. We describe our method, the Graphical Pipeline for Computational Genomics (GPCG), to perform the computational steps required to analyze NGS data. The GPCG implements flexible workflows for basic sequence alignment, sequence data quality control, single nucleotide polymorphism analysis, copy number variant identification, annotation, and visualization of results. These workflows cover all the analytical steps required for NGS data, from processing the raw reads to variant calling and annotation. The current version of the pipeline is freely available at http://pipeline.loni.ucla.edu. These applications of NGS analysis may gain clinical utility in the near future (e.g., identifying miRNA signatures in diseases) when the bioinformatics approach is made feasible. Taken together, the annotation tools and strategies that have been developed to retrieve information and test hypotheses about the functional role of variants present in the human genome will help to pinpoint the genetic risk factors for psychiatric disorders. MDPI 2012-08-30 /pmc/articles/PMC3490498/ /pubmed/23139896 http://dx.doi.org/10.3390/genes3030545 Text en © 2012 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0/ This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Torri, Federica
Dinov, Ivo D.
Zamanyan, Alen
Hobel, Sam
Genco, Alex
Petrosyan, Petros
Clark, Andrew P.
Liu, Zhizhong
Eggert, Paul
Pierce, Jonathan
Knowles, James A.
Ames, Joseph
Kesselman, Carl
Toga, Arthur W.
Potkin, Steven G.
Vawter, Marquis P.
Macciardi, Fabio
Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
title Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
title_full Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
title_fullStr Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
title_full_unstemmed Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
title_short Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
title_sort next generation sequence analysis and computational genomics using graphical pipeline workflows
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3490498/
https://www.ncbi.nlm.nih.gov/pubmed/23139896
http://dx.doi.org/10.3390/genes3030545
work_keys_str_mv AT torrifederica nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT dinovivod nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT zamanyanalen nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT hobelsam nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT gencoalex nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT petrosyanpetros nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT clarkandrewp nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT liuzhizhong nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT eggertpaul nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT piercejonathan nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT knowlesjamesa nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT amesjoseph nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT kesselmancarl nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT togaarthurw nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT potkinsteveng nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT vawtermarquisp nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows
AT macciardifabio nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows