Cargando…

MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals

Short tandem repeats (STRs) are highly mutable genetic elements that often reside in regulatory and coding DNA. The cumulative evidence of genetic studies on individual STRs suggests that STR variation profoundly affects phenotype and contributes to trait heritability. Despite recent advances in seq...

Descripción completa

Detalles Bibliográficos
Autores principales: Carlson, Keisha D., Sudmant, Peter H., Press, Maximilian O., Eichler, Evan E., Shendure, Jay, Queitsch, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417122/
https://www.ncbi.nlm.nih.gov/pubmed/25659649
http://dx.doi.org/10.1101/gr.182212.114
_version_ 1782369309709828096
author Carlson, Keisha D.
Sudmant, Peter H.
Press, Maximilian O.
Eichler, Evan E.
Shendure, Jay
Queitsch, Christine
author_facet Carlson, Keisha D.
Sudmant, Peter H.
Press, Maximilian O.
Eichler, Evan E.
Shendure, Jay
Queitsch, Christine
author_sort Carlson, Keisha D.
collection PubMed
description Short tandem repeats (STRs) are highly mutable genetic elements that often reside in regulatory and coding DNA. The cumulative evidence of genetic studies on individual STRs suggests that STR variation profoundly affects phenotype and contributes to trait heritability. Despite recent advances in sequencing technology, STR variation has remained largely inaccessible across many individuals compared to single nucleotide variation or copy number variation. STR genotyping with short-read sequence data is confounded by (1) the difficulty of uniquely mapping short, low-complexity reads; and (2) the high rate of STR amplification stutter. Here, we present MIPSTR, a robust, scalable, and affordable method that addresses these challenges. MIPSTR uses targeted capture of STR loci by single-molecule Molecular Inversion Probes (smMIPs) and a unique mapping strategy. Targeted capture and our mapping strategy resolve the first challenge; the use of single molecule information resolves the second challenge. Unlike previous methods, MIPSTR is capable of distinguishing technical error due to amplification stutter from somatic STR mutations. In proof-of-principle experiments, we use MIPSTR to determine germline STR genotypes for 102 STR loci with high accuracy across diverse populations of the plant A. thaliana. We show that putatively functional STRs may be identified by deviation from predicted STR variation and by association with quantitative phenotypes. Using DNA mixing experiments and a mutant deficient in DNA repair, we demonstrate that MIPSTR can detect low-frequency somatic STR variants. MIPSTR is applicable to any organism with a high-quality reference genome and is scalable to genotyping many thousands of STR loci in thousands of individuals.
format Online
Article
Text
id pubmed-4417122
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-44171222015-11-01 MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals Carlson, Keisha D. Sudmant, Peter H. Press, Maximilian O. Eichler, Evan E. Shendure, Jay Queitsch, Christine Genome Res Method Short tandem repeats (STRs) are highly mutable genetic elements that often reside in regulatory and coding DNA. The cumulative evidence of genetic studies on individual STRs suggests that STR variation profoundly affects phenotype and contributes to trait heritability. Despite recent advances in sequencing technology, STR variation has remained largely inaccessible across many individuals compared to single nucleotide variation or copy number variation. STR genotyping with short-read sequence data is confounded by (1) the difficulty of uniquely mapping short, low-complexity reads; and (2) the high rate of STR amplification stutter. Here, we present MIPSTR, a robust, scalable, and affordable method that addresses these challenges. MIPSTR uses targeted capture of STR loci by single-molecule Molecular Inversion Probes (smMIPs) and a unique mapping strategy. Targeted capture and our mapping strategy resolve the first challenge; the use of single molecule information resolves the second challenge. Unlike previous methods, MIPSTR is capable of distinguishing technical error due to amplification stutter from somatic STR mutations. In proof-of-principle experiments, we use MIPSTR to determine germline STR genotypes for 102 STR loci with high accuracy across diverse populations of the plant A. thaliana. We show that putatively functional STRs may be identified by deviation from predicted STR variation and by association with quantitative phenotypes. Using DNA mixing experiments and a mutant deficient in DNA repair, we demonstrate that MIPSTR can detect low-frequency somatic STR variants. MIPSTR is applicable to any organism with a high-quality reference genome and is scalable to genotyping many thousands of STR loci in thousands of individuals. Cold Spring Harbor Laboratory Press 2015-05 /pmc/articles/PMC4417122/ /pubmed/25659649 http://dx.doi.org/10.1101/gr.182212.114 Text en © 2015 Carlson et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Carlson, Keisha D.
Sudmant, Peter H.
Press, Maximilian O.
Eichler, Evan E.
Shendure, Jay
Queitsch, Christine
MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
title MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
title_full MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
title_fullStr MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
title_full_unstemmed MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
title_short MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
title_sort mipstr: a method for multiplex genotyping of germline and somatic str variation across many individuals
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417122/
https://www.ncbi.nlm.nih.gov/pubmed/25659649
http://dx.doi.org/10.1101/gr.182212.114
work_keys_str_mv AT carlsonkeishad mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals
AT sudmantpeterh mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals
AT pressmaximiliano mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals
AT eichlerevane mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals
AT shendurejay mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals
AT queitschchristine mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals