Cargando…
Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing
Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from l...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5514472/ https://www.ncbi.nlm.nih.gov/pubmed/28720120 http://dx.doi.org/10.1186/s13073-017-0456-7 |
_version_ | 1783250844442951680 |
---|---|
author | Liu, Qian Zhang, Peng Wang, Depeng Gu, Weihong Wang, Kai |
author_facet | Liu, Qian Zhang, Peng Wang, Depeng Gu, Weihong Wang, Kai |
author_sort | Liu, Qian |
collection | PubMed |
description | Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from long-read sequencing data. Evaluation on simulation data, real amplicon sequencing data on two repeat expansion disorders, and whole-genome sequencing data generated by PacBio and Oxford Nanopore technologies showed superior performance over competing approaches. We concluded that long-read sequencing coupled with RepeatHMM can estimate repeat counts on microsatellites and can interrogate the “unsequenceable” genomic trinucleotide repeat disorders. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13073-017-0456-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5514472 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55144722017-07-19 Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing Liu, Qian Zhang, Peng Wang, Depeng Gu, Weihong Wang, Kai Genome Med Method Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from long-read sequencing data. Evaluation on simulation data, real amplicon sequencing data on two repeat expansion disorders, and whole-genome sequencing data generated by PacBio and Oxford Nanopore technologies showed superior performance over competing approaches. We concluded that long-read sequencing coupled with RepeatHMM can estimate repeat counts on microsatellites and can interrogate the “unsequenceable” genomic trinucleotide repeat disorders. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13073-017-0456-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-07-18 /pmc/articles/PMC5514472/ /pubmed/28720120 http://dx.doi.org/10.1186/s13073-017-0456-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Method Liu, Qian Zhang, Peng Wang, Depeng Gu, Weihong Wang, Kai Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
title | Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
title_full | Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
title_fullStr | Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
title_full_unstemmed | Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
title_short | Interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
title_sort | interrogating the “unsequenceable” genomic trinucleotide repeat disorders by long-read sequencing |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5514472/ https://www.ncbi.nlm.nih.gov/pubmed/28720120 http://dx.doi.org/10.1186/s13073-017-0456-7 |
work_keys_str_mv | AT liuqian interrogatingtheunsequenceablegenomictrinucleotiderepeatdisordersbylongreadsequencing AT zhangpeng interrogatingtheunsequenceablegenomictrinucleotiderepeatdisordersbylongreadsequencing AT wangdepeng interrogatingtheunsequenceablegenomictrinucleotiderepeatdisordersbylongreadsequencing AT guweihong interrogatingtheunsequenceablegenomictrinucleotiderepeatdisordersbylongreadsequencing AT wangkai interrogatingtheunsequenceablegenomictrinucleotiderepeatdisordersbylongreadsequencing |