Cargando…

EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow

EPIGEN-Brazil is one of the largest Latin American initiatives at the interface of human genomics, public health, and computational biology. Here, we present two resources to address two challenges to the global dissemination of precision medicine and the development of the bioinformatics know-how t...

Descripción completa

Detalles Bibliográficos
Autores principales: Magalhães, Wagner C.S., Araujo, Nathalia M., Leal, Thiago P., Araujo, Gilderlanio S., Viriato, Paula J.S., Kehdy, Fernanda S., Costa, Gustavo N., Barreto, Mauricio L., Horta, Bernardo L., Lima-Costa, Maria Fernanda, Pereira, Alexandre C., Tarazona-Santos, Eduardo, Rodrigues, Maíra R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6028131/
https://www.ncbi.nlm.nih.gov/pubmed/29903722
http://dx.doi.org/10.1101/gr.225458.117
_version_ 1783336718833811456
author Magalhães, Wagner C.S.
Araujo, Nathalia M.
Leal, Thiago P.
Araujo, Gilderlanio S.
Viriato, Paula J.S.
Kehdy, Fernanda S.
Costa, Gustavo N.
Barreto, Mauricio L.
Horta, Bernardo L.
Lima-Costa, Maria Fernanda
Pereira, Alexandre C.
Tarazona-Santos, Eduardo
Rodrigues, Maíra R.
author_facet Magalhães, Wagner C.S.
Araujo, Nathalia M.
Leal, Thiago P.
Araujo, Gilderlanio S.
Viriato, Paula J.S.
Kehdy, Fernanda S.
Costa, Gustavo N.
Barreto, Mauricio L.
Horta, Bernardo L.
Lima-Costa, Maria Fernanda
Pereira, Alexandre C.
Tarazona-Santos, Eduardo
Rodrigues, Maíra R.
author_sort Magalhães, Wagner C.S.
collection PubMed
description EPIGEN-Brazil is one of the largest Latin American initiatives at the interface of human genomics, public health, and computational biology. Here, we present two resources to address two challenges to the global dissemination of precision medicine and the development of the bioinformatics know-how to support it. To address the underrepresentation of non-European individuals in human genome diversity studies, we present the EPIGEN-5M+1KGP imputation panel—the fusion of the public 1000 Genomes Project (1KGP) Phase 3 imputation panel with haplotypes derived from the EPIGEN-5M data set (a product of the genotyping of 4.3 million SNPs in 265 admixed individuals from the EPIGEN-Brazil Initiative). When we imputed a target SNPs data set (6487 admixed individuals genotyped for 2.2 million SNPs from the EPIGEN-Brazil project) with the EPIGEN-5M+1KGP panel, we gained 140,452 more SNPs in total than when using the 1KGP Phase 3 panel alone and 788,873 additional high confidence SNPs (info score ≥ 0.8). Thus, the major effect of the inclusion of the EPIGEN-5M data set in this new imputation panel is not only to gain more SNPs but also to improve the quality of imputation. To address the lack of transparency and reproducibility of bioinformatics protocols, we present a conceptual Scientific Workflow in the form of a website that models the scientific process (by including publications, flowcharts, masterscripts, documents, and bioinformatics protocols), making it accessible and interactive. Its applicability is shown in the context of the development of our EPIGEN-5M+1KGP imputation panel. The Scientific Workflow also serves as a repository of bioinformatics resources.
format Online
Article
Text
id pubmed-6028131
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-60281312019-01-01 EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow Magalhães, Wagner C.S. Araujo, Nathalia M. Leal, Thiago P. Araujo, Gilderlanio S. Viriato, Paula J.S. Kehdy, Fernanda S. Costa, Gustavo N. Barreto, Mauricio L. Horta, Bernardo L. Lima-Costa, Maria Fernanda Pereira, Alexandre C. Tarazona-Santos, Eduardo Rodrigues, Maíra R. Genome Res Resource EPIGEN-Brazil is one of the largest Latin American initiatives at the interface of human genomics, public health, and computational biology. Here, we present two resources to address two challenges to the global dissemination of precision medicine and the development of the bioinformatics know-how to support it. To address the underrepresentation of non-European individuals in human genome diversity studies, we present the EPIGEN-5M+1KGP imputation panel—the fusion of the public 1000 Genomes Project (1KGP) Phase 3 imputation panel with haplotypes derived from the EPIGEN-5M data set (a product of the genotyping of 4.3 million SNPs in 265 admixed individuals from the EPIGEN-Brazil Initiative). When we imputed a target SNPs data set (6487 admixed individuals genotyped for 2.2 million SNPs from the EPIGEN-Brazil project) with the EPIGEN-5M+1KGP panel, we gained 140,452 more SNPs in total than when using the 1KGP Phase 3 panel alone and 788,873 additional high confidence SNPs (info score ≥ 0.8). Thus, the major effect of the inclusion of the EPIGEN-5M data set in this new imputation panel is not only to gain more SNPs but also to improve the quality of imputation. To address the lack of transparency and reproducibility of bioinformatics protocols, we present a conceptual Scientific Workflow in the form of a website that models the scientific process (by including publications, flowcharts, masterscripts, documents, and bioinformatics protocols), making it accessible and interactive. Its applicability is shown in the context of the development of our EPIGEN-5M+1KGP imputation panel. The Scientific Workflow also serves as a repository of bioinformatics resources. Cold Spring Harbor Laboratory Press 2018-07 /pmc/articles/PMC6028131/ /pubmed/29903722 http://dx.doi.org/10.1101/gr.225458.117 Text en © 2018 Magalhães et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Magalhães, Wagner C.S.
Araujo, Nathalia M.
Leal, Thiago P.
Araujo, Gilderlanio S.
Viriato, Paula J.S.
Kehdy, Fernanda S.
Costa, Gustavo N.
Barreto, Mauricio L.
Horta, Bernardo L.
Lima-Costa, Maria Fernanda
Pereira, Alexandre C.
Tarazona-Santos, Eduardo
Rodrigues, Maíra R.
EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow
title EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow
title_full EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow
title_fullStr EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow
title_full_unstemmed EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow
title_short EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow
title_sort epigen-brazil initiative resources: a latin american imputation panel and the scientific workflow
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6028131/
https://www.ncbi.nlm.nih.gov/pubmed/29903722
http://dx.doi.org/10.1101/gr.225458.117
work_keys_str_mv AT magalhaeswagnercs epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT araujonathaliam epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT lealthiagop epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT araujogilderlanios epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT viriatopaulajs epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT kehdyfernandas epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT costagustavon epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT barretomauriciol epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT hortabernardol epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT limacostamariafernanda epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT pereiraalexandrec epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT tarazonasantoseduardo epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT rodriguesmairar epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow
AT epigenbrazilinitiativeresourcesalatinamericanimputationpanelandthescientificworkflow