Cargando…

TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads

Calling tandem repeat (TR) variants from DNA sequences is of both theoretical and practical significance. Some bioinformatics tools have been developed for detecting or genotyping TRs. However, little study has been done to genotyping TR alleles from long-read sequencing data, and the accuracy of ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xuewen, Huang, Meng, Budowle, Bruce, Ge, Jianye
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10390829/
https://www.ncbi.nlm.nih.gov/pubmed/37533432
http://dx.doi.org/10.3389/fgene.2023.1227176
_version_ 1785082564145315840
author Wang, Xuewen
Huang, Meng
Budowle, Bruce
Ge, Jianye
author_facet Wang, Xuewen
Huang, Meng
Budowle, Bruce
Ge, Jianye
author_sort Wang, Xuewen
collection PubMed
description Calling tandem repeat (TR) variants from DNA sequences is of both theoretical and practical significance. Some bioinformatics tools have been developed for detecting or genotyping TRs. However, little study has been done to genotyping TR alleles from long-read sequencing data, and the accuracy of genotyping TR alleles from next-generation sequencing data still needs to be improved. Herein, a novel algorithm is described to retrieve TR regions from sequence alignment, and a software program TRcaller has been developed and integrated into a web portal to call TR alleles from both short- and long-read sequences, both whole genome and targeted sequences generated from multiple sequencing platforms. All TR alleles are genotyped as haplotypes and the robust alleles will be reported, even multiple alleles in a DNA mixture. TRcaller could provide substantially higher accuracy (>99% in 289 human individuals) in detecting TR alleles with magnitudes faster (e.g., ∼2 s for 300x human sequence data) than the mainstream software tools. The web portal preselected 119 TR loci from forensics, genealogy, and disease related TR loci. TRcaller is validated to be scalable in various applications, such as DNA forensics and disease diagnosis, which can be expanded into other fields like breeding programs. Availability: TRcaller is available at https://www.trcaller.com/SignIn.aspx.
format Online
Article
Text
id pubmed-10390829
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103908292023-08-02 TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads Wang, Xuewen Huang, Meng Budowle, Bruce Ge, Jianye Front Genet Genetics Calling tandem repeat (TR) variants from DNA sequences is of both theoretical and practical significance. Some bioinformatics tools have been developed for detecting or genotyping TRs. However, little study has been done to genotyping TR alleles from long-read sequencing data, and the accuracy of genotyping TR alleles from next-generation sequencing data still needs to be improved. Herein, a novel algorithm is described to retrieve TR regions from sequence alignment, and a software program TRcaller has been developed and integrated into a web portal to call TR alleles from both short- and long-read sequences, both whole genome and targeted sequences generated from multiple sequencing platforms. All TR alleles are genotyped as haplotypes and the robust alleles will be reported, even multiple alleles in a DNA mixture. TRcaller could provide substantially higher accuracy (>99% in 289 human individuals) in detecting TR alleles with magnitudes faster (e.g., ∼2 s for 300x human sequence data) than the mainstream software tools. The web portal preselected 119 TR loci from forensics, genealogy, and disease related TR loci. TRcaller is validated to be scalable in various applications, such as DNA forensics and disease diagnosis, which can be expanded into other fields like breeding programs. Availability: TRcaller is available at https://www.trcaller.com/SignIn.aspx. Frontiers Media S.A. 2023-07-18 /pmc/articles/PMC10390829/ /pubmed/37533432 http://dx.doi.org/10.3389/fgene.2023.1227176 Text en Copyright © 2023 Wang, Huang, Budowle and Ge. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Xuewen
Huang, Meng
Budowle, Bruce
Ge, Jianye
TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
title TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
title_full TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
title_fullStr TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
title_full_unstemmed TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
title_short TRcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
title_sort trcaller: a novel tool for precise and ultrafast tandem repeat variant genotyping in massively parallel sequencing reads
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10390829/
https://www.ncbi.nlm.nih.gov/pubmed/37533432
http://dx.doi.org/10.3389/fgene.2023.1227176
work_keys_str_mv AT wangxuewen trcalleranoveltoolforpreciseandultrafasttandemrepeatvariantgenotypinginmassivelyparallelsequencingreads
AT huangmeng trcalleranoveltoolforpreciseandultrafasttandemrepeatvariantgenotypinginmassivelyparallelsequencingreads
AT budowlebruce trcalleranoveltoolforpreciseandultrafasttandemrepeatvariantgenotypinginmassivelyparallelsequencingreads
AT gejianye trcalleranoveltoolforpreciseandultrafasttandemrepeatvariantgenotypinginmassivelyparallelsequencingreads