Cargando…
A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2291285/ https://www.ncbi.nlm.nih.gov/pubmed/18414585 http://dx.doi.org/10.1155/2008/675741 |
_version_ | 1782152439446634496 |
---|---|
author | Kim, Pan-Gyu Cho, Hwan-Gue Park, Kiejung |
author_facet | Kim, Pan-Gyu Cho, Hwan-Gue Park, Kiejung |
author_sort | Kim, Pan-Gyu |
collection | PubMed |
description | We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures. |
format | Text |
id | pubmed-2291285 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-22912852008-04-14 A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing Kim, Pan-Gyu Cho, Hwan-Gue Park, Kiejung J Biomed Biotechnol Research Article We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures. Hindawi Publishing Corporation 2008 2008-04-03 /pmc/articles/PMC2291285/ /pubmed/18414585 http://dx.doi.org/10.1155/2008/675741 Text en Copyright © 2008 Pan-Gyu Kim et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kim, Pan-Gyu Cho, Hwan-Gue Park, Kiejung A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing |
title | A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing |
title_full | A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing |
title_fullStr | A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing |
title_full_unstemmed | A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing |
title_short | A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing |
title_sort | scaffold analysis tool using mate-pair information in genome sequencing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2291285/ https://www.ncbi.nlm.nih.gov/pubmed/18414585 http://dx.doi.org/10.1155/2008/675741 |
work_keys_str_mv | AT kimpangyu ascaffoldanalysistoolusingmatepairinformationingenomesequencing AT chohwangue ascaffoldanalysistoolusingmatepairinformationingenomesequencing AT parkkiejung ascaffoldanalysistoolusingmatepairinformationingenomesequencing AT kimpangyu scaffoldanalysistoolusingmatepairinformationingenomesequencing AT chohwangue scaffoldanalysistoolusingmatepairinformationingenomesequencing AT parkkiejung scaffoldanalysistoolusingmatepairinformationingenomesequencing |