Cargando…
The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms
To date we have little knowledge of how accurate next-generation sequencing (NGS) technologies are in sequencing repetitive sequences beyond known limitations to accurately sequence homopolymers. Only a handful of previous reports have evaluated the potential of NGS for sequencing short tandem repea...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4250034/ https://www.ncbi.nlm.nih.gov/pubmed/25436869 http://dx.doi.org/10.1371/journal.pone.0113862 |
_version_ | 1782346939360083968 |
---|---|
author | Zavodna, Monika Bagshaw, Andrew Brauning, Rudiger Gemmell, Neil J. |
author_facet | Zavodna, Monika Bagshaw, Andrew Brauning, Rudiger Gemmell, Neil J. |
author_sort | Zavodna, Monika |
collection | PubMed |
description | To date we have little knowledge of how accurate next-generation sequencing (NGS) technologies are in sequencing repetitive sequences beyond known limitations to accurately sequence homopolymers. Only a handful of previous reports have evaluated the potential of NGS for sequencing short tandem repeats (microsatellites) and no empirical study has compared and evaluated the performance of more than one NGS platform with the same dataset. Here we examined yeast microsatellite variants from both long-read (454-sequencing) and short-read (Illumina) NGS platforms and compared these to data derived through Sanger sequencing. In addition, we investigated any locus-specific biases and differences that might have resulted from variability in microsatellite repeat number, repeat motif or type of mutation. Out of 112 insertion/deletion variants identified among 45 microsatellite amplicons in our study, we found 87.5% agreement between the 454-platform and Sanger sequencing in frequency of variant detection after Benjamini-Hochberg correction for multiple tests. For a subset of 21 microsatellite amplicons derived from Illumina sequencing, the results of short-read platform were highly consistent with the other two platforms, with 100% agreement with 454-sequencing and 93.6% agreement with the Sanger method after Benjamini-Hochberg correction. We found that the microsatellite attributes copy number, repeat motif and type of mutation did not have a significant effect on differences seen between the sequencing platforms. We show that both long-read and short-read NGS platforms can be used to sequence short tandem repeats accurately, which makes it feasible to consider the use of these platforms in high-throughput genotyping. It appears the major requirement for achieving both high accuracy and rare variant detection in microsatellite genotyping is sufficient read depth coverage. This might be a challenge because each platform generates a consistent pattern of non-uniform sequence coverage, which, as our study suggests, may affect some types of tandem repeats more than others. |
format | Online Article Text |
id | pubmed-4250034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42500342014-12-05 The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms Zavodna, Monika Bagshaw, Andrew Brauning, Rudiger Gemmell, Neil J. PLoS One Research Article To date we have little knowledge of how accurate next-generation sequencing (NGS) technologies are in sequencing repetitive sequences beyond known limitations to accurately sequence homopolymers. Only a handful of previous reports have evaluated the potential of NGS for sequencing short tandem repeats (microsatellites) and no empirical study has compared and evaluated the performance of more than one NGS platform with the same dataset. Here we examined yeast microsatellite variants from both long-read (454-sequencing) and short-read (Illumina) NGS platforms and compared these to data derived through Sanger sequencing. In addition, we investigated any locus-specific biases and differences that might have resulted from variability in microsatellite repeat number, repeat motif or type of mutation. Out of 112 insertion/deletion variants identified among 45 microsatellite amplicons in our study, we found 87.5% agreement between the 454-platform and Sanger sequencing in frequency of variant detection after Benjamini-Hochberg correction for multiple tests. For a subset of 21 microsatellite amplicons derived from Illumina sequencing, the results of short-read platform were highly consistent with the other two platforms, with 100% agreement with 454-sequencing and 93.6% agreement with the Sanger method after Benjamini-Hochberg correction. We found that the microsatellite attributes copy number, repeat motif and type of mutation did not have a significant effect on differences seen between the sequencing platforms. We show that both long-read and short-read NGS platforms can be used to sequence short tandem repeats accurately, which makes it feasible to consider the use of these platforms in high-throughput genotyping. It appears the major requirement for achieving both high accuracy and rare variant detection in microsatellite genotyping is sufficient read depth coverage. This might be a challenge because each platform generates a consistent pattern of non-uniform sequence coverage, which, as our study suggests, may affect some types of tandem repeats more than others. Public Library of Science 2014-12-01 /pmc/articles/PMC4250034/ /pubmed/25436869 http://dx.doi.org/10.1371/journal.pone.0113862 Text en © 2014 Zavodna et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zavodna, Monika Bagshaw, Andrew Brauning, Rudiger Gemmell, Neil J. The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms |
title | The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms |
title_full | The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms |
title_fullStr | The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms |
title_full_unstemmed | The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms |
title_short | The Accuracy, Feasibility and Challenges of Sequencing Short Tandem Repeats Using Next-Generation Sequencing Platforms |
title_sort | accuracy, feasibility and challenges of sequencing short tandem repeats using next-generation sequencing platforms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4250034/ https://www.ncbi.nlm.nih.gov/pubmed/25436869 http://dx.doi.org/10.1371/journal.pone.0113862 |
work_keys_str_mv | AT zavodnamonika theaccuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT bagshawandrew theaccuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT brauningrudiger theaccuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT gemmellneilj theaccuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT zavodnamonika accuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT bagshawandrew accuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT brauningrudiger accuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms AT gemmellneilj accuracyfeasibilityandchallengesofsequencingshorttandemrepeatsusingnextgenerationsequencingplatforms |