Cargando…

NGS based haplotype assembly using matrix completion

We apply matrix completion methods for haplotype assembly from NGS reads to develop the new HapSVT, HapNuc, and HapOPT algorithms. This is performed by applying a mathematical model to convert the reads to an incomplete matrix and estimating unknown components. This process is followed by quantizing...

Descripción completa

Detalles Bibliográficos
Autores principales: Majidian, Sina, Kahaei, Mohammad Hossein
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6435133/
https://www.ncbi.nlm.nih.gov/pubmed/30913270
http://dx.doi.org/10.1371/journal.pone.0214455
_version_ 1783406596920967168
author Majidian, Sina
Kahaei, Mohammad Hossein
author_facet Majidian, Sina
Kahaei, Mohammad Hossein
author_sort Majidian, Sina
collection PubMed
description We apply matrix completion methods for haplotype assembly from NGS reads to develop the new HapSVT, HapNuc, and HapOPT algorithms. This is performed by applying a mathematical model to convert the reads to an incomplete matrix and estimating unknown components. This process is followed by quantizing and decoding the completed matrix in order to estimate haplotypes. These algorithms are compared to the state-of-the-art algorithms using simulated data as well as the real fosmid data. It is shown that the SNP missing rate and the haplotype block length of the proposed HapOPT are better than those of HapCUT2 with comparable accuracy in terms of reconstruction rate and switch error rate. A program implementing the proposed algorithms in MATLAB is freely available at https://github.com/smajidian/HapMC.
format Online
Article
Text
id pubmed-6435133
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64351332019-04-08 NGS based haplotype assembly using matrix completion Majidian, Sina Kahaei, Mohammad Hossein PLoS One Research Article We apply matrix completion methods for haplotype assembly from NGS reads to develop the new HapSVT, HapNuc, and HapOPT algorithms. This is performed by applying a mathematical model to convert the reads to an incomplete matrix and estimating unknown components. This process is followed by quantizing and decoding the completed matrix in order to estimate haplotypes. These algorithms are compared to the state-of-the-art algorithms using simulated data as well as the real fosmid data. It is shown that the SNP missing rate and the haplotype block length of the proposed HapOPT are better than those of HapCUT2 with comparable accuracy in terms of reconstruction rate and switch error rate. A program implementing the proposed algorithms in MATLAB is freely available at https://github.com/smajidian/HapMC. Public Library of Science 2019-03-26 /pmc/articles/PMC6435133/ /pubmed/30913270 http://dx.doi.org/10.1371/journal.pone.0214455 Text en © 2019 Majidian, Kahaei http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Majidian, Sina
Kahaei, Mohammad Hossein
NGS based haplotype assembly using matrix completion
title NGS based haplotype assembly using matrix completion
title_full NGS based haplotype assembly using matrix completion
title_fullStr NGS based haplotype assembly using matrix completion
title_full_unstemmed NGS based haplotype assembly using matrix completion
title_short NGS based haplotype assembly using matrix completion
title_sort ngs based haplotype assembly using matrix completion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6435133/
https://www.ncbi.nlm.nih.gov/pubmed/30913270
http://dx.doi.org/10.1371/journal.pone.0214455
work_keys_str_mv AT majidiansina ngsbasedhaplotypeassemblyusingmatrixcompletion
AT kahaeimohammadhossein ngsbasedhaplotypeassemblyusingmatrixcompletion