Cargando…

RICOPILI: Rapid Imputation for COnsortias PIpeLIne

SUMMARY: Genome-wide association study (GWAS) analyses, at sufficient sample sizes and power, have successfully revealed biological insights for several complex traits. RICOPILI, an open-sourced Perl-based pipeline was developed to address the challenges of rapidly processing large-scale multi-cohor...

Descripción completa

Detalles Bibliográficos
Autores principales: Lam, Max, Awasthi, Swapnil, Watson, Hunna J, Goldstein, Jackie, Panagiotaropoulou, Georgia, Trubetskoy, Vassily, Karlsson, Robert, Frei, Oleksander, Fan, Chun-Chieh, De Witte, Ward, Mota, Nina R, Mullins, Niamh, Brügger, Kim, Lee, S Hong, Wray, Naomi R, Skarabis, Nora, Huang, Hailiang, Neale, Benjamin, Daly, Mark J, Mattheisen, Manuel, Walters, Raymond, Ripke, Stephan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7868045/
https://www.ncbi.nlm.nih.gov/pubmed/31393554
http://dx.doi.org/10.1093/bioinformatics/btz633
_version_ 1783648390799687680
author Lam, Max
Awasthi, Swapnil
Watson, Hunna J
Goldstein, Jackie
Panagiotaropoulou, Georgia
Trubetskoy, Vassily
Karlsson, Robert
Frei, Oleksander
Fan, Chun-Chieh
De Witte, Ward
Mota, Nina R
Mullins, Niamh
Brügger, Kim
Lee, S Hong
Wray, Naomi R
Skarabis, Nora
Huang, Hailiang
Neale, Benjamin
Daly, Mark J
Mattheisen, Manuel
Walters, Raymond
Ripke, Stephan
author_facet Lam, Max
Awasthi, Swapnil
Watson, Hunna J
Goldstein, Jackie
Panagiotaropoulou, Georgia
Trubetskoy, Vassily
Karlsson, Robert
Frei, Oleksander
Fan, Chun-Chieh
De Witte, Ward
Mota, Nina R
Mullins, Niamh
Brügger, Kim
Lee, S Hong
Wray, Naomi R
Skarabis, Nora
Huang, Hailiang
Neale, Benjamin
Daly, Mark J
Mattheisen, Manuel
Walters, Raymond
Ripke, Stephan
author_sort Lam, Max
collection PubMed
description SUMMARY: Genome-wide association study (GWAS) analyses, at sufficient sample sizes and power, have successfully revealed biological insights for several complex traits. RICOPILI, an open-sourced Perl-based pipeline was developed to address the challenges of rapidly processing large-scale multi-cohort GWAS studies including quality control (QC), imputation and downstream analyses. The pipeline is computationally efficient with portability to a wide range of high-performance computing environments. RICOPILI was created as the Psychiatric Genomics Consortium pipeline for GWAS and adopted by other users. The pipeline features (i) technical and genomic QC in case-control and trio cohorts, (ii) genome-wide phasing and imputation, (iv) association analysis, (v) meta-analysis, (vi) polygenic risk scoring and (vii) replication analysis. Notably, a major differentiator from other GWAS pipelines, RICOPILI leverages on automated parallelization and cluster job management approaches for rapid production of imputed genome-wide data. A comprehensive meta-analysis of simulated GWAS data has been incorporated demonstrating each step of the pipeline. This includes all the associated visualization plots, to allow ease of data interpretation and manuscript preparation. Simulated GWAS datasets are also packaged with the pipeline for user training tutorials and developer work. AVAILABILITY AND IMPLEMENTATION: RICOPILI has a flexible architecture to allow for ongoing development and incorporation of newer available algorithms and is adaptable to various HPC environments (QSUB, BSUB, SLURM and others). Specific links for genomic resources are either directly provided in this paper or via tutorials and external links. The central location hosting scripts and tutorials is found at this URL: https://sites.google.com/a/broadinstitute.org/RICOPILI/home SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7868045
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-78680452021-02-10 RICOPILI: Rapid Imputation for COnsortias PIpeLIne Lam, Max Awasthi, Swapnil Watson, Hunna J Goldstein, Jackie Panagiotaropoulou, Georgia Trubetskoy, Vassily Karlsson, Robert Frei, Oleksander Fan, Chun-Chieh De Witte, Ward Mota, Nina R Mullins, Niamh Brügger, Kim Lee, S Hong Wray, Naomi R Skarabis, Nora Huang, Hailiang Neale, Benjamin Daly, Mark J Mattheisen, Manuel Walters, Raymond Ripke, Stephan Bioinformatics Applications Note SUMMARY: Genome-wide association study (GWAS) analyses, at sufficient sample sizes and power, have successfully revealed biological insights for several complex traits. RICOPILI, an open-sourced Perl-based pipeline was developed to address the challenges of rapidly processing large-scale multi-cohort GWAS studies including quality control (QC), imputation and downstream analyses. The pipeline is computationally efficient with portability to a wide range of high-performance computing environments. RICOPILI was created as the Psychiatric Genomics Consortium pipeline for GWAS and adopted by other users. The pipeline features (i) technical and genomic QC in case-control and trio cohorts, (ii) genome-wide phasing and imputation, (iv) association analysis, (v) meta-analysis, (vi) polygenic risk scoring and (vii) replication analysis. Notably, a major differentiator from other GWAS pipelines, RICOPILI leverages on automated parallelization and cluster job management approaches for rapid production of imputed genome-wide data. A comprehensive meta-analysis of simulated GWAS data has been incorporated demonstrating each step of the pipeline. This includes all the associated visualization plots, to allow ease of data interpretation and manuscript preparation. Simulated GWAS datasets are also packaged with the pipeline for user training tutorials and developer work. AVAILABILITY AND IMPLEMENTATION: RICOPILI has a flexible architecture to allow for ongoing development and incorporation of newer available algorithms and is adaptable to various HPC environments (QSUB, BSUB, SLURM and others). Specific links for genomic resources are either directly provided in this paper or via tutorials and external links. The central location hosting scripts and tutorials is found at this URL: https://sites.google.com/a/broadinstitute.org/RICOPILI/home SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-08-08 /pmc/articles/PMC7868045/ /pubmed/31393554 http://dx.doi.org/10.1093/bioinformatics/btz633 Text en © The Author(s) 2019. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Note
Lam, Max
Awasthi, Swapnil
Watson, Hunna J
Goldstein, Jackie
Panagiotaropoulou, Georgia
Trubetskoy, Vassily
Karlsson, Robert
Frei, Oleksander
Fan, Chun-Chieh
De Witte, Ward
Mota, Nina R
Mullins, Niamh
Brügger, Kim
Lee, S Hong
Wray, Naomi R
Skarabis, Nora
Huang, Hailiang
Neale, Benjamin
Daly, Mark J
Mattheisen, Manuel
Walters, Raymond
Ripke, Stephan
RICOPILI: Rapid Imputation for COnsortias PIpeLIne
title RICOPILI: Rapid Imputation for COnsortias PIpeLIne
title_full RICOPILI: Rapid Imputation for COnsortias PIpeLIne
title_fullStr RICOPILI: Rapid Imputation for COnsortias PIpeLIne
title_full_unstemmed RICOPILI: Rapid Imputation for COnsortias PIpeLIne
title_short RICOPILI: Rapid Imputation for COnsortias PIpeLIne
title_sort ricopili: rapid imputation for consortias pipeline
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7868045/
https://www.ncbi.nlm.nih.gov/pubmed/31393554
http://dx.doi.org/10.1093/bioinformatics/btz633
work_keys_str_mv AT lammax ricopilirapidimputationforconsortiaspipeline
AT awasthiswapnil ricopilirapidimputationforconsortiaspipeline
AT watsonhunnaj ricopilirapidimputationforconsortiaspipeline
AT goldsteinjackie ricopilirapidimputationforconsortiaspipeline
AT panagiotaropoulougeorgia ricopilirapidimputationforconsortiaspipeline
AT trubetskoyvassily ricopilirapidimputationforconsortiaspipeline
AT karlssonrobert ricopilirapidimputationforconsortiaspipeline
AT freioleksander ricopilirapidimputationforconsortiaspipeline
AT fanchunchieh ricopilirapidimputationforconsortiaspipeline
AT dewitteward ricopilirapidimputationforconsortiaspipeline
AT motaninar ricopilirapidimputationforconsortiaspipeline
AT mullinsniamh ricopilirapidimputationforconsortiaspipeline
AT bruggerkim ricopilirapidimputationforconsortiaspipeline
AT leeshong ricopilirapidimputationforconsortiaspipeline
AT wraynaomir ricopilirapidimputationforconsortiaspipeline
AT skarabisnora ricopilirapidimputationforconsortiaspipeline
AT huanghailiang ricopilirapidimputationforconsortiaspipeline
AT nealebenjamin ricopilirapidimputationforconsortiaspipeline
AT dalymarkj ricopilirapidimputationforconsortiaspipeline
AT mattheisenmanuel ricopilirapidimputationforconsortiaspipeline
AT waltersraymond ricopilirapidimputationforconsortiaspipeline
AT ripkestephan ricopilirapidimputationforconsortiaspipeline