Cargando…
Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows
Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in populatio...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3490498/ https://www.ncbi.nlm.nih.gov/pubmed/23139896 http://dx.doi.org/10.3390/genes3030545 |
_version_ | 1782248853037121536 |
---|---|
author | Torri, Federica Dinov, Ivo D. Zamanyan, Alen Hobel, Sam Genco, Alex Petrosyan, Petros Clark, Andrew P. Liu, Zhizhong Eggert, Paul Pierce, Jonathan Knowles, James A. Ames, Joseph Kesselman, Carl Toga, Arthur W. Potkin, Steven G. Vawter, Marquis P. Macciardi, Fabio |
author_facet | Torri, Federica Dinov, Ivo D. Zamanyan, Alen Hobel, Sam Genco, Alex Petrosyan, Petros Clark, Andrew P. Liu, Zhizhong Eggert, Paul Pierce, Jonathan Knowles, James A. Ames, Joseph Kesselman, Carl Toga, Arthur W. Potkin, Steven G. Vawter, Marquis P. Macciardi, Fabio |
author_sort | Torri, Federica |
collection | PubMed |
description | Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in population genetics. These achievements have been made possible by next generation sequencing (NGS) technologies, which require substantial bioinformatics resources to analyze the dense and complex sequence data. The huge analytical burden of data from genome sequencing might be seen as a bottleneck slowing the publication of NGS papers at this time, especially in psychiatric genetics. We review the existing methods for processing NGS data, to place into context the rationale for the design of a computational resource. We describe our method, the Graphical Pipeline for Computational Genomics (GPCG), to perform the computational steps required to analyze NGS data. The GPCG implements flexible workflows for basic sequence alignment, sequence data quality control, single nucleotide polymorphism analysis, copy number variant identification, annotation, and visualization of results. These workflows cover all the analytical steps required for NGS data, from processing the raw reads to variant calling and annotation. The current version of the pipeline is freely available at http://pipeline.loni.ucla.edu. These applications of NGS analysis may gain clinical utility in the near future (e.g., identifying miRNA signatures in diseases) when the bioinformatics approach is made feasible. Taken together, the annotation tools and strategies that have been developed to retrieve information and test hypotheses about the functional role of variants present in the human genome will help to pinpoint the genetic risk factors for psychiatric disorders. |
format | Online Article Text |
id | pubmed-3490498 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-34904982012-11-06 Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows Torri, Federica Dinov, Ivo D. Zamanyan, Alen Hobel, Sam Genco, Alex Petrosyan, Petros Clark, Andrew P. Liu, Zhizhong Eggert, Paul Pierce, Jonathan Knowles, James A. Ames, Joseph Kesselman, Carl Toga, Arthur W. Potkin, Steven G. Vawter, Marquis P. Macciardi, Fabio Genes (Basel) Article Whole-genome and exome sequencing have already proven to be essential and powerful methods to identify genes responsible for simple Mendelian inherited disorders. These methods can be applied to complex disorders as well, and have been adopted as one of the current mainstream approaches in population genetics. These achievements have been made possible by next generation sequencing (NGS) technologies, which require substantial bioinformatics resources to analyze the dense and complex sequence data. The huge analytical burden of data from genome sequencing might be seen as a bottleneck slowing the publication of NGS papers at this time, especially in psychiatric genetics. We review the existing methods for processing NGS data, to place into context the rationale for the design of a computational resource. We describe our method, the Graphical Pipeline for Computational Genomics (GPCG), to perform the computational steps required to analyze NGS data. The GPCG implements flexible workflows for basic sequence alignment, sequence data quality control, single nucleotide polymorphism analysis, copy number variant identification, annotation, and visualization of results. These workflows cover all the analytical steps required for NGS data, from processing the raw reads to variant calling and annotation. The current version of the pipeline is freely available at http://pipeline.loni.ucla.edu. These applications of NGS analysis may gain clinical utility in the near future (e.g., identifying miRNA signatures in diseases) when the bioinformatics approach is made feasible. Taken together, the annotation tools and strategies that have been developed to retrieve information and test hypotheses about the functional role of variants present in the human genome will help to pinpoint the genetic risk factors for psychiatric disorders. MDPI 2012-08-30 /pmc/articles/PMC3490498/ /pubmed/23139896 http://dx.doi.org/10.3390/genes3030545 Text en © 2012 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0/ This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Article Torri, Federica Dinov, Ivo D. Zamanyan, Alen Hobel, Sam Genco, Alex Petrosyan, Petros Clark, Andrew P. Liu, Zhizhong Eggert, Paul Pierce, Jonathan Knowles, James A. Ames, Joseph Kesselman, Carl Toga, Arthur W. Potkin, Steven G. Vawter, Marquis P. Macciardi, Fabio Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows |
title | Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows |
title_full | Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows |
title_fullStr | Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows |
title_full_unstemmed | Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows |
title_short | Next Generation Sequence Analysis and Computational Genomics Using Graphical Pipeline Workflows |
title_sort | next generation sequence analysis and computational genomics using graphical pipeline workflows |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3490498/ https://www.ncbi.nlm.nih.gov/pubmed/23139896 http://dx.doi.org/10.3390/genes3030545 |
work_keys_str_mv | AT torrifederica nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT dinovivod nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT zamanyanalen nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT hobelsam nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT gencoalex nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT petrosyanpetros nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT clarkandrewp nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT liuzhizhong nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT eggertpaul nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT piercejonathan nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT knowlesjamesa nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT amesjoseph nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT kesselmancarl nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT togaarthurw nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT potkinsteveng nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT vawtermarquisp nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows AT macciardifabio nextgenerationsequenceanalysisandcomputationalgenomicsusinggraphicalpipelineworkflows |