Cargando…

On the use of algebraic topology concepts to check the consistency of genome assembly

This paper presents a preliminary work consisting of two contributions. The first one is the design of a very efficient algorithm based on an “Overlap-Layout-Consensus” (OLC) graph to assemble the long reads provided by 3rd generation technologies. The second concerns the analysis of this graph usin...

Descripción completa

Detalles Bibliográficos
Autor principal: Gibrat, Jean-François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Biophysical Society of Japan (BSJ) 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6975893/
https://www.ncbi.nlm.nih.gov/pubmed/31984196
http://dx.doi.org/10.2142/biophysico.16.0_444
_version_ 1783490297620070400
author Gibrat, Jean-François
author_facet Gibrat, Jean-François
author_sort Gibrat, Jean-François
collection PubMed
description This paper presents a preliminary work consisting of two contributions. The first one is the design of a very efficient algorithm based on an “Overlap-Layout-Consensus” (OLC) graph to assemble the long reads provided by 3rd generation technologies. The second concerns the analysis of this graph using algebraic topology concepts to determine, in advance, whether the assembly of the genome will be straightforward, i.e., whether it will lead to a pseudo-Hamiltonian path or cycle, or whether the results will need to be scrutinized. In the latter case, it will be necessary to look for “loops” in the OLC assembly graph caused by unresolved repeated genomic regions, and then try to untie the “knots” created by these regions.
format Online
Article
Text
id pubmed-6975893
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher The Biophysical Society of Japan (BSJ)
record_format MEDLINE/PubMed
spelling pubmed-69758932020-01-24 On the use of algebraic topology concepts to check the consistency of genome assembly Gibrat, Jean-François Biophys Physicobiol Regular Article This paper presents a preliminary work consisting of two contributions. The first one is the design of a very efficient algorithm based on an “Overlap-Layout-Consensus” (OLC) graph to assemble the long reads provided by 3rd generation technologies. The second concerns the analysis of this graph using algebraic topology concepts to determine, in advance, whether the assembly of the genome will be straightforward, i.e., whether it will lead to a pseudo-Hamiltonian path or cycle, or whether the results will need to be scrutinized. In the latter case, it will be necessary to look for “loops” in the OLC assembly graph caused by unresolved repeated genomic regions, and then try to untie the “knots” created by these regions. The Biophysical Society of Japan (BSJ) 2019-11-29 /pmc/articles/PMC6975893/ /pubmed/31984196 http://dx.doi.org/10.2142/biophysico.16.0_444 Text en 2019 © The Biophysical Society of Japan This article is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-sa/4.0/.
spellingShingle Regular Article
Gibrat, Jean-François
On the use of algebraic topology concepts to check the consistency of genome assembly
title On the use of algebraic topology concepts to check the consistency of genome assembly
title_full On the use of algebraic topology concepts to check the consistency of genome assembly
title_fullStr On the use of algebraic topology concepts to check the consistency of genome assembly
title_full_unstemmed On the use of algebraic topology concepts to check the consistency of genome assembly
title_short On the use of algebraic topology concepts to check the consistency of genome assembly
title_sort on the use of algebraic topology concepts to check the consistency of genome assembly
topic Regular Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6975893/
https://www.ncbi.nlm.nih.gov/pubmed/31984196
http://dx.doi.org/10.2142/biophysico.16.0_444
work_keys_str_mv AT gibratjeanfrancois ontheuseofalgebraictopologyconceptstochecktheconsistencyofgenomeassembly