Cargando…

A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing

We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Pan-Gyu, Cho, Hwan-Gue, Park, Kiejung
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2291285/
https://www.ncbi.nlm.nih.gov/pubmed/18414585
http://dx.doi.org/10.1155/2008/675741
_version_ 1782152439446634496
author Kim, Pan-Gyu
Cho, Hwan-Gue
Park, Kiejung
author_facet Kim, Pan-Gyu
Cho, Hwan-Gue
Park, Kiejung
author_sort Kim, Pan-Gyu
collection PubMed
description We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.
format Text
id pubmed-2291285
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-22912852008-04-14 A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing Kim, Pan-Gyu Cho, Hwan-Gue Park, Kiejung J Biomed Biotechnol Research Article We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures. Hindawi Publishing Corporation 2008 2008-04-03 /pmc/articles/PMC2291285/ /pubmed/18414585 http://dx.doi.org/10.1155/2008/675741 Text en Copyright © 2008 Pan-Gyu Kim et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kim, Pan-Gyu
Cho, Hwan-Gue
Park, Kiejung
A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
title A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
title_full A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
title_fullStr A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
title_full_unstemmed A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
title_short A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
title_sort scaffold analysis tool using mate-pair information in genome sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2291285/
https://www.ncbi.nlm.nih.gov/pubmed/18414585
http://dx.doi.org/10.1155/2008/675741
work_keys_str_mv AT kimpangyu ascaffoldanalysistoolusingmatepairinformationingenomesequencing
AT chohwangue ascaffoldanalysistoolusingmatepairinformationingenomesequencing
AT parkkiejung ascaffoldanalysistoolusingmatepairinformationingenomesequencing
AT kimpangyu scaffoldanalysistoolusingmatepairinformationingenomesequencing
AT chohwangue scaffoldanalysistoolusingmatepairinformationingenomesequencing
AT parkkiejung scaffoldanalysistoolusingmatepairinformationingenomesequencing