Cargando…

SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation

MOTIVATION: Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination l...

Descripción completa

Detalles Bibliográficos
Autores principales: Manching, Heather, Wisser, Randall J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8097754/
https://www.ncbi.nlm.nih.gov/pubmed/32840564
http://dx.doi.org/10.1093/bioinformatics/btaa749
_version_ 1783688376430362624
author Manching, Heather
Wisser, Randall J
author_facet Manching, Heather
Wisser, Randall J
author_sort Manching, Heather
collection PubMed
description MOTIVATION: Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more. Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference. RESULTS: We introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data. Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure. It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference. Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis. Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS. AVAILABILITYAND IMPLEMENTATION: SPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.com/maizeatlas/spears). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8097754
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80977542021-05-10 SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation Manching, Heather Wisser, Randall J Bioinformatics Applications Notes MOTIVATION: Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more. Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference. RESULTS: We introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data. Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure. It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference. Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis. Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS. AVAILABILITYAND IMPLEMENTATION: SPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.com/maizeatlas/spears). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-08-25 /pmc/articles/PMC8097754/ /pubmed/32840564 http://dx.doi.org/10.1093/bioinformatics/btaa749 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Manching, Heather
Wisser, Randall J
SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
title SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
title_full SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
title_fullStr SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
title_full_unstemmed SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
title_short SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
title_sort spears: standard performance evaluation of ancestral haplotype reconstruction through simulation
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8097754/
https://www.ncbi.nlm.nih.gov/pubmed/32840564
http://dx.doi.org/10.1093/bioinformatics/btaa749
work_keys_str_mv AT manchingheather spearsstandardperformanceevaluationofancestralhaplotypereconstructionthroughsimulation
AT wisserrandallj spearsstandardperformanceevaluationofancestralhaplotypereconstructionthroughsimulation