Cargando…
Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles
Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection g...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592458/ https://www.ncbi.nlm.nih.gov/pubmed/23090981 http://dx.doi.org/10.1093/nar/gks981 |
_version_ | 1782262120567537664 |
---|---|
author | Highnam, Gareth Franck, Christopher Martin, Andy Stephens, Calvin Puthige, Ashwin Mittelman, David |
author_facet | Highnam, Gareth Franck, Christopher Martin, Andy Stephens, Calvin Puthige, Ashwin Mittelman, David |
author_sort | Highnam, Gareth |
collection | PubMed |
description | Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties. Next, we apply RepeatSeq to high-coverage genomes from the 1000 Genomes Project to evaluate performance and accuracy. The software uses common formats, such as VCF, for compatibility with existing genome analysis pipelines. Source code and binaries are available at http://github.com/adaptivegenome/repeatseq. |
format | Online Article Text |
id | pubmed-3592458 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35924582013-03-08 Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles Highnam, Gareth Franck, Christopher Martin, Andy Stephens, Calvin Puthige, Ashwin Mittelman, David Nucleic Acids Res Methods Online Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties. Next, we apply RepeatSeq to high-coverage genomes from the 1000 Genomes Project to evaluate performance and accuracy. The software uses common formats, such as VCF, for compatibility with existing genome analysis pipelines. Source code and binaries are available at http://github.com/adaptivegenome/repeatseq. Oxford University Press 2013-01 2012-10-22 /pmc/articles/PMC3592458/ /pubmed/23090981 http://dx.doi.org/10.1093/nar/gks981 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com. |
spellingShingle | Methods Online Highnam, Gareth Franck, Christopher Martin, Andy Stephens, Calvin Puthige, Ashwin Mittelman, David Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
title | Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
title_full | Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
title_fullStr | Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
title_full_unstemmed | Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
title_short | Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
title_sort | accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592458/ https://www.ncbi.nlm.nih.gov/pubmed/23090981 http://dx.doi.org/10.1093/nar/gks981 |
work_keys_str_mv | AT highnamgareth accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles AT franckchristopher accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles AT martinandy accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles AT stephenscalvin accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles AT puthigeashwin accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles AT mittelmandavid accuratehumanmicrosatellitegenotypesfromhighthroughputresequencingdatausinginformederrorprofiles |