Cargando…

OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations

BACKGROUND: Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across diffe...

Descripción completa

Detalles Bibliográficos
Autores principales: Berres, Sven, Gromoll, Jörg, Wöste, Marius, Sandmann, Sarah, Laurentino, Sandra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369718/
https://www.ncbi.nlm.nih.gov/pubmed/37496002
http://dx.doi.org/10.1186/s12859-023-05422-w
_version_ 1785077818327040000
author Berres, Sven
Gromoll, Jörg
Wöste, Marius
Sandmann, Sarah
Laurentino, Sandra
author_facet Berres, Sven
Gromoll, Jörg
Wöste, Marius
Sandmann, Sarah
Laurentino, Sandra
author_sort Berres, Sven
collection PubMed
description BACKGROUND: Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and visualization, which most biomedical researchers lack. Our aim was to develop an automated tool providing all necessary algorithms, benefiting both bioinformaticians and researchers without bioinformatic training. RESULTS: We developed overlapping annotated genomic regions (OGRE) as a comprehensive tool to associate and visualize input regions with genomic annotations. It does so by parsing regions of interest, mining publicly available annotations, and calculating possible overlaps between them. The user can thus identify location, type, and number of associated regulatory elements. Results are presented as easy to understand visualizations and result tables. We applied OGRE to recent studies and could show high reproducibility and potential new insights. To demonstrate OGRE’s performance in terms of running time and output, we have conducted a benchmark and compared its features with similar tools. CONCLUSIONS: OGRE’s functions and built-in annotations can be applied as a downstream overlap association step, which is compatible with most genomic sequencing outputs, and can thus enrich pre-existing analyses pipelines. Compared to similar tools, OGRE shows competitive performance, offers additional features, and has been successfully applied to two recent studies. Overall, OGRE addresses the lack of tools for automatic analysis, local genomic overlap calculation, and visualization by providing an easy to use, end-to-end solution for both biologists and computational scientists. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05422-w.
format Online
Article
Text
id pubmed-10369718
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103697182023-07-27 OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations Berres, Sven Gromoll, Jörg Wöste, Marius Sandmann, Sarah Laurentino, Sandra BMC Bioinformatics Software BACKGROUND: Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and visualization, which most biomedical researchers lack. Our aim was to develop an automated tool providing all necessary algorithms, benefiting both bioinformaticians and researchers without bioinformatic training. RESULTS: We developed overlapping annotated genomic regions (OGRE) as a comprehensive tool to associate and visualize input regions with genomic annotations. It does so by parsing regions of interest, mining publicly available annotations, and calculating possible overlaps between them. The user can thus identify location, type, and number of associated regulatory elements. Results are presented as easy to understand visualizations and result tables. We applied OGRE to recent studies and could show high reproducibility and potential new insights. To demonstrate OGRE’s performance in terms of running time and output, we have conducted a benchmark and compared its features with similar tools. CONCLUSIONS: OGRE’s functions and built-in annotations can be applied as a downstream overlap association step, which is compatible with most genomic sequencing outputs, and can thus enrich pre-existing analyses pipelines. Compared to similar tools, OGRE shows competitive performance, offers additional features, and has been successfully applied to two recent studies. Overall, OGRE addresses the lack of tools for automatic analysis, local genomic overlap calculation, and visualization by providing an easy to use, end-to-end solution for both biologists and computational scientists. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05422-w. BioMed Central 2023-07-26 /pmc/articles/PMC10369718/ /pubmed/37496002 http://dx.doi.org/10.1186/s12859-023-05422-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Berres, Sven
Gromoll, Jörg
Wöste, Marius
Sandmann, Sarah
Laurentino, Sandra
OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
title OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
title_full OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
title_fullStr OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
title_full_unstemmed OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
title_short OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
title_sort ogre: calculate, visualize, and analyze overlap between genomic input regions and public annotations
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369718/
https://www.ncbi.nlm.nih.gov/pubmed/37496002
http://dx.doi.org/10.1186/s12859-023-05422-w
work_keys_str_mv AT berressven ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations
AT gromolljorg ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations
AT wostemarius ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations
AT sandmannsarah ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations
AT laurentinosandra ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations