Cargando…

Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles

Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection g...

Descripción completa

Detalles Bibliográficos
Autores principales: Highnam, Gareth, Franck, Christopher, Martin, Andy, Stephens, Calvin, Puthige, Ashwin, Mittelman, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592458/
https://www.ncbi.nlm.nih.gov/pubmed/23090981
http://dx.doi.org/10.1093/nar/gks981
_version_ 1782262120567537664
author Highnam, Gareth
Franck, Christopher
Martin, Andy
Stephens, Calvin
Puthige, Ashwin
Mittelman, David
author_facet Highnam, Gareth
Franck, Christopher
Martin, Andy
Stephens, Calvin
Puthige, Ashwin
Mittelman, David
author_sort Highnam, Gareth
collection PubMed
description Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties. Next, we apply RepeatSeq to high-coverage genomes from the 1000 Genomes Project to evaluate performance and accuracy. The software uses common formats, such as VCF, for compatibility with existing genome analysis pipelines. Source code and binaries are available at http://github.com/adaptivegenome/repeatseq.
format Online
Article
Text
id pubmed-3592458
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35924582013-03-08 Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles Highnam, Gareth Franck, Christopher Martin, Andy Stephens, Calvin Puthige, Ashwin Mittelman, David Nucleic Acids Res Methods Online Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties. Next, we apply RepeatSeq to high-coverage genomes from the 1000 Genomes Project to evaluate performance and accuracy. The software uses common formats, such as VCF, for compatibility with existing genome analysis pipelines. Source code and binaries are available at http://github.com/adaptivegenome/repeatseq. Oxford University Press 2013-01 2012-10-22 /pmc/articles/PMC3592458/ /pubmed/23090981 http://dx.doi.org/10.1093/nar/gks981 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Methods Online
Highnam, Gareth
Franck, Christopher
Martin, Andy
Stephens, Calvin
Puthige, Ashwin
Mittelman, David
Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
title Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
title_full Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
title_fullStr Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
title_full_unstemmed Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
title_short Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
title_sort accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592458/
https://www.ncbi.nlm.nih.gov/pubmed/23090981
http://dx.doi.org/10.1093/nar/gks981
work_keys_str_mv AT highnamgareth accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles
AT franckchristopher accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles
AT martinandy accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles
AT stephenscalvin accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles
AT puthigeashwin accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles
AT mittelmandavid accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles