Cargando…
MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals
Short tandem repeats (STRs) are highly mutable genetic elements that often reside in regulatory and coding DNA. The cumulative evidence of genetic studies on individual STRs suggests that STR variation profoundly affects phenotype and contributes to trait heritability. Despite recent advances in seq...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417122/ https://www.ncbi.nlm.nih.gov/pubmed/25659649 http://dx.doi.org/10.1101/gr.182212.114 |
_version_ | 1782369309709828096 |
---|---|
author | Carlson, Keisha D. Sudmant, Peter H. Press, Maximilian O. Eichler, Evan E. Shendure, Jay Queitsch, Christine |
author_facet | Carlson, Keisha D. Sudmant, Peter H. Press, Maximilian O. Eichler, Evan E. Shendure, Jay Queitsch, Christine |
author_sort | Carlson, Keisha D. |
collection | PubMed |
description | Short tandem repeats (STRs) are highly mutable genetic elements that often reside in regulatory and coding DNA. The cumulative evidence of genetic studies on individual STRs suggests that STR variation profoundly affects phenotype and contributes to trait heritability. Despite recent advances in sequencing technology, STR variation has remained largely inaccessible across many individuals compared to single nucleotide variation or copy number variation. STR genotyping with short-read sequence data is confounded by (1) the difficulty of uniquely mapping short, low-complexity reads; and (2) the high rate of STR amplification stutter. Here, we present MIPSTR, a robust, scalable, and affordable method that addresses these challenges. MIPSTR uses targeted capture of STR loci by single-molecule Molecular Inversion Probes (smMIPs) and a unique mapping strategy. Targeted capture and our mapping strategy resolve the first challenge; the use of single molecule information resolves the second challenge. Unlike previous methods, MIPSTR is capable of distinguishing technical error due to amplification stutter from somatic STR mutations. In proof-of-principle experiments, we use MIPSTR to determine germline STR genotypes for 102 STR loci with high accuracy across diverse populations of the plant A. thaliana. We show that putatively functional STRs may be identified by deviation from predicted STR variation and by association with quantitative phenotypes. Using DNA mixing experiments and a mutant deficient in DNA repair, we demonstrate that MIPSTR can detect low-frequency somatic STR variants. MIPSTR is applicable to any organism with a high-quality reference genome and is scalable to genotyping many thousands of STR loci in thousands of individuals. |
format | Online Article Text |
id | pubmed-4417122 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-44171222015-11-01 MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals Carlson, Keisha D. Sudmant, Peter H. Press, Maximilian O. Eichler, Evan E. Shendure, Jay Queitsch, Christine Genome Res Method Short tandem repeats (STRs) are highly mutable genetic elements that often reside in regulatory and coding DNA. The cumulative evidence of genetic studies on individual STRs suggests that STR variation profoundly affects phenotype and contributes to trait heritability. Despite recent advances in sequencing technology, STR variation has remained largely inaccessible across many individuals compared to single nucleotide variation or copy number variation. STR genotyping with short-read sequence data is confounded by (1) the difficulty of uniquely mapping short, low-complexity reads; and (2) the high rate of STR amplification stutter. Here, we present MIPSTR, a robust, scalable, and affordable method that addresses these challenges. MIPSTR uses targeted capture of STR loci by single-molecule Molecular Inversion Probes (smMIPs) and a unique mapping strategy. Targeted capture and our mapping strategy resolve the first challenge; the use of single molecule information resolves the second challenge. Unlike previous methods, MIPSTR is capable of distinguishing technical error due to amplification stutter from somatic STR mutations. In proof-of-principle experiments, we use MIPSTR to determine germline STR genotypes for 102 STR loci with high accuracy across diverse populations of the plant A. thaliana. We show that putatively functional STRs may be identified by deviation from predicted STR variation and by association with quantitative phenotypes. Using DNA mixing experiments and a mutant deficient in DNA repair, we demonstrate that MIPSTR can detect low-frequency somatic STR variants. MIPSTR is applicable to any organism with a high-quality reference genome and is scalable to genotyping many thousands of STR loci in thousands of individuals. Cold Spring Harbor Laboratory Press 2015-05 /pmc/articles/PMC4417122/ /pubmed/25659649 http://dx.doi.org/10.1101/gr.182212.114 Text en © 2015 Carlson et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Carlson, Keisha D. Sudmant, Peter H. Press, Maximilian O. Eichler, Evan E. Shendure, Jay Queitsch, Christine MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals |
title | MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals |
title_full | MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals |
title_fullStr | MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals |
title_full_unstemmed | MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals |
title_short | MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals |
title_sort | mipstr: a method for multiplex genotyping of germline and somatic str variation across many individuals |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417122/ https://www.ncbi.nlm.nih.gov/pubmed/25659649 http://dx.doi.org/10.1101/gr.182212.114 |
work_keys_str_mv | AT carlsonkeishad mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals AT sudmantpeterh mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals AT pressmaximiliano mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals AT eichlerevane mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals AT shendurejay mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals AT queitschchristine mipstramethodformultiplexgenotypingofgermlineandsomaticstrvariationacrossmanyindividuals |