Cargando…

Nebula: ultra-efficient mapping-free structural variant genotyper

Large scale catalogs of common genetic variants (including indels and structural variants) are being created using data from second and third generation whole-genome sequencing technologies. However, the genotyping of these variants in newly sequenced samples is a nontrivial task that requires exten...

Descripción completa

Detalles Bibliográficos
Autores principales: Khorsand, Parsoa, Hormozdiari, Fereydoun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8096284/
https://www.ncbi.nlm.nih.gov/pubmed/33503255
http://dx.doi.org/10.1093/nar/gkab025
_version_ 1783688128226131968
author Khorsand, Parsoa
Hormozdiari, Fereydoun
author_facet Khorsand, Parsoa
Hormozdiari, Fereydoun
author_sort Khorsand, Parsoa
collection PubMed
description Large scale catalogs of common genetic variants (including indels and structural variants) are being created using data from second and third generation whole-genome sequencing technologies. However, the genotyping of these variants in newly sequenced samples is a nontrivial task that requires extensive computational resources. Furthermore, current approaches are mostly limited to only specific types of variants and are generally prone to various errors and ambiguities when genotyping complex events. We are proposing an ultra-efficient approach for genotyping any type of structural variation that is not limited by the shortcomings and complexities of current mapping-based approaches. Our method Nebula utilizes the changes in the count of k-mers to predict the genotype of structural variants. We have shown that not only Nebula is an order of magnitude faster than mapping based approaches for genotyping structural variants, but also has comparable accuracy to state-of-the-art approaches. Furthermore, Nebula is a generic framework not limited to any specific type of event. Nebula is publicly available at https://github.com/Parsoa/Nebula.
format Online
Article
Text
id pubmed-8096284
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80962842021-05-10 Nebula: ultra-efficient mapping-free structural variant genotyper Khorsand, Parsoa Hormozdiari, Fereydoun Nucleic Acids Res Methods Online Large scale catalogs of common genetic variants (including indels and structural variants) are being created using data from second and third generation whole-genome sequencing technologies. However, the genotyping of these variants in newly sequenced samples is a nontrivial task that requires extensive computational resources. Furthermore, current approaches are mostly limited to only specific types of variants and are generally prone to various errors and ambiguities when genotyping complex events. We are proposing an ultra-efficient approach for genotyping any type of structural variation that is not limited by the shortcomings and complexities of current mapping-based approaches. Our method Nebula utilizes the changes in the count of k-mers to predict the genotype of structural variants. We have shown that not only Nebula is an order of magnitude faster than mapping based approaches for genotyping structural variants, but also has comparable accuracy to state-of-the-art approaches. Furthermore, Nebula is a generic framework not limited to any specific type of event. Nebula is publicly available at https://github.com/Parsoa/Nebula. Oxford University Press 2021-01-27 /pmc/articles/PMC8096284/ /pubmed/33503255 http://dx.doi.org/10.1093/nar/gkab025 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Khorsand, Parsoa
Hormozdiari, Fereydoun
Nebula: ultra-efficient mapping-free structural variant genotyper
title Nebula: ultra-efficient mapping-free structural variant genotyper
title_full Nebula: ultra-efficient mapping-free structural variant genotyper
title_fullStr Nebula: ultra-efficient mapping-free structural variant genotyper
title_full_unstemmed Nebula: ultra-efficient mapping-free structural variant genotyper
title_short Nebula: ultra-efficient mapping-free structural variant genotyper
title_sort nebula: ultra-efficient mapping-free structural variant genotyper
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8096284/
https://www.ncbi.nlm.nih.gov/pubmed/33503255
http://dx.doi.org/10.1093/nar/gkab025
work_keys_str_mv AT khorsandparsoa nebulaultraefficientmappingfreestructuralvariantgenotyper
AT hormozdiarifereydoun nebulaultraefficientmappingfreestructuralvariantgenotyper