Cargando…

JunctionViewer: customizable annotation software for repeat-rich genomic regions

BACKGROUND: Repeat-rich regions such as centromeres receive less attention than their gene-rich euchromatic counterparts because the former are difficult to assemble and analyze. Our objectives were to 1) map all ten centromeres onto the maize genetic map and 2) characterize the sequence features of...

Descripción completa

Detalles Bibliográficos
Autores principales: Wolfgruber, Thomas K, Presting, Gernot G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2824676/
https://www.ncbi.nlm.nih.gov/pubmed/20067643
http://dx.doi.org/10.1186/1471-2105-11-23
_version_ 1782177715592364032
author Wolfgruber, Thomas K
Presting, Gernot G
author_facet Wolfgruber, Thomas K
Presting, Gernot G
author_sort Wolfgruber, Thomas K
collection PubMed
description BACKGROUND: Repeat-rich regions such as centromeres receive less attention than their gene-rich euchromatic counterparts because the former are difficult to assemble and analyze. Our objectives were to 1) map all ten centromeres onto the maize genetic map and 2) characterize the sequence features of maize centromeres, each of which spans several megabases of highly repetitive DNA. Repetitive sequences can be mapped using special molecular markers that are based on PCR with primers designed from two unique "repeat junctions". Efficient screening of large amounts of maize genome sequence data for repeat junctions, as well as key centromere sequence features required the development of specific annotation software. RESULTS: We developed JunctionViewer to automate the process of identifying and differentiating closely related centromere repeats and repeat junctions, and to generate graphical displays of these and other features within centromeric sequences. JunctionViewer generates NCBI BLAST, WU-BLAST, cross_match and MUMmer alignments, and displays the optimal alignments and additional annotation data as concise graphical representations that can be viewed directly through the graphical interface or as PostScript(® )output. This software enabled us to quickly characterize millions of nucleotides of newly sequenced DNA ranging in size from single reads to assembled BACs and megabase-sized pseudochromosome regions. It expedited the process of generating repeat junction markers that were subsequently used to anchor all 10 centromeres to the maize map. It also enabled us to efficiently identify key features in large genomic regions, providing insight into the arrangement and evolution of maize centromeric DNA. CONCLUSIONS: JunctionViewer will be useful to scientists who wish to automatically generate concise graphical summaries of repeat sequences. It is particularly valuable for those needing to efficiently identify unique repeat junctions. The scalability and ability to customize homology search parameters for different classes of closely related repeat sequences make this software ideal for recurring annotation (e.g., genome projects that are in progress) of genomic regions that contain well-defined repeats, such as those in centromeres. Although originally customized for maize centromere sequence, we anticipate this software to facilitate the analysis of centromere and other repeat-rich regions in other organisms.
format Text
id pubmed-2824676
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28246762010-02-19 JunctionViewer: customizable annotation software for repeat-rich genomic regions Wolfgruber, Thomas K Presting, Gernot G BMC Bioinformatics Software BACKGROUND: Repeat-rich regions such as centromeres receive less attention than their gene-rich euchromatic counterparts because the former are difficult to assemble and analyze. Our objectives were to 1) map all ten centromeres onto the maize genetic map and 2) characterize the sequence features of maize centromeres, each of which spans several megabases of highly repetitive DNA. Repetitive sequences can be mapped using special molecular markers that are based on PCR with primers designed from two unique "repeat junctions". Efficient screening of large amounts of maize genome sequence data for repeat junctions, as well as key centromere sequence features required the development of specific annotation software. RESULTS: We developed JunctionViewer to automate the process of identifying and differentiating closely related centromere repeats and repeat junctions, and to generate graphical displays of these and other features within centromeric sequences. JunctionViewer generates NCBI BLAST, WU-BLAST, cross_match and MUMmer alignments, and displays the optimal alignments and additional annotation data as concise graphical representations that can be viewed directly through the graphical interface or as PostScript(® )output. This software enabled us to quickly characterize millions of nucleotides of newly sequenced DNA ranging in size from single reads to assembled BACs and megabase-sized pseudochromosome regions. It expedited the process of generating repeat junction markers that were subsequently used to anchor all 10 centromeres to the maize map. It also enabled us to efficiently identify key features in large genomic regions, providing insight into the arrangement and evolution of maize centromeric DNA. CONCLUSIONS: JunctionViewer will be useful to scientists who wish to automatically generate concise graphical summaries of repeat sequences. It is particularly valuable for those needing to efficiently identify unique repeat junctions. The scalability and ability to customize homology search parameters for different classes of closely related repeat sequences make this software ideal for recurring annotation (e.g., genome projects that are in progress) of genomic regions that contain well-defined repeats, such as those in centromeres. Although originally customized for maize centromere sequence, we anticipate this software to facilitate the analysis of centromere and other repeat-rich regions in other organisms. BioMed Central 2010-01-12 /pmc/articles/PMC2824676/ /pubmed/20067643 http://dx.doi.org/10.1186/1471-2105-11-23 Text en Copyright ©2010 Wolfgruber and Presting; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Wolfgruber, Thomas K
Presting, Gernot G
JunctionViewer: customizable annotation software for repeat-rich genomic regions
title JunctionViewer: customizable annotation software for repeat-rich genomic regions
title_full JunctionViewer: customizable annotation software for repeat-rich genomic regions
title_fullStr JunctionViewer: customizable annotation software for repeat-rich genomic regions
title_full_unstemmed JunctionViewer: customizable annotation software for repeat-rich genomic regions
title_short JunctionViewer: customizable annotation software for repeat-rich genomic regions
title_sort junctionviewer: customizable annotation software for repeat-rich genomic regions
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2824676/
https://www.ncbi.nlm.nih.gov/pubmed/20067643
http://dx.doi.org/10.1186/1471-2105-11-23
work_keys_str_mv AT wolfgruberthomask junctionviewercustomizableannotationsoftwareforrepeatrichgenomicregions
AT prestinggernotg junctionviewercustomizableannotationsoftwareforrepeatrichgenomicregions