Cargando…
OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations
BACKGROUND: Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across diffe...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369718/ https://www.ncbi.nlm.nih.gov/pubmed/37496002 http://dx.doi.org/10.1186/s12859-023-05422-w |
_version_ | 1785077818327040000 |
---|---|
author | Berres, Sven Gromoll, Jörg Wöste, Marius Sandmann, Sarah Laurentino, Sandra |
author_facet | Berres, Sven Gromoll, Jörg Wöste, Marius Sandmann, Sarah Laurentino, Sandra |
author_sort | Berres, Sven |
collection | PubMed |
description | BACKGROUND: Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and visualization, which most biomedical researchers lack. Our aim was to develop an automated tool providing all necessary algorithms, benefiting both bioinformaticians and researchers without bioinformatic training. RESULTS: We developed overlapping annotated genomic regions (OGRE) as a comprehensive tool to associate and visualize input regions with genomic annotations. It does so by parsing regions of interest, mining publicly available annotations, and calculating possible overlaps between them. The user can thus identify location, type, and number of associated regulatory elements. Results are presented as easy to understand visualizations and result tables. We applied OGRE to recent studies and could show high reproducibility and potential new insights. To demonstrate OGRE’s performance in terms of running time and output, we have conducted a benchmark and compared its features with similar tools. CONCLUSIONS: OGRE’s functions and built-in annotations can be applied as a downstream overlap association step, which is compatible with most genomic sequencing outputs, and can thus enrich pre-existing analyses pipelines. Compared to similar tools, OGRE shows competitive performance, offers additional features, and has been successfully applied to two recent studies. Overall, OGRE addresses the lack of tools for automatic analysis, local genomic overlap calculation, and visualization by providing an easy to use, end-to-end solution for both biologists and computational scientists. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05422-w. |
format | Online Article Text |
id | pubmed-10369718 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103697182023-07-27 OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations Berres, Sven Gromoll, Jörg Wöste, Marius Sandmann, Sarah Laurentino, Sandra BMC Bioinformatics Software BACKGROUND: Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and visualization, which most biomedical researchers lack. Our aim was to develop an automated tool providing all necessary algorithms, benefiting both bioinformaticians and researchers without bioinformatic training. RESULTS: We developed overlapping annotated genomic regions (OGRE) as a comprehensive tool to associate and visualize input regions with genomic annotations. It does so by parsing regions of interest, mining publicly available annotations, and calculating possible overlaps between them. The user can thus identify location, type, and number of associated regulatory elements. Results are presented as easy to understand visualizations and result tables. We applied OGRE to recent studies and could show high reproducibility and potential new insights. To demonstrate OGRE’s performance in terms of running time and output, we have conducted a benchmark and compared its features with similar tools. CONCLUSIONS: OGRE’s functions and built-in annotations can be applied as a downstream overlap association step, which is compatible with most genomic sequencing outputs, and can thus enrich pre-existing analyses pipelines. Compared to similar tools, OGRE shows competitive performance, offers additional features, and has been successfully applied to two recent studies. Overall, OGRE addresses the lack of tools for automatic analysis, local genomic overlap calculation, and visualization by providing an easy to use, end-to-end solution for both biologists and computational scientists. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05422-w. BioMed Central 2023-07-26 /pmc/articles/PMC10369718/ /pubmed/37496002 http://dx.doi.org/10.1186/s12859-023-05422-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Berres, Sven Gromoll, Jörg Wöste, Marius Sandmann, Sarah Laurentino, Sandra OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
title | OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
title_full | OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
title_fullStr | OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
title_full_unstemmed | OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
title_short | OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
title_sort | ogre: calculate, visualize, and analyze overlap between genomic input regions and public annotations |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369718/ https://www.ncbi.nlm.nih.gov/pubmed/37496002 http://dx.doi.org/10.1186/s12859-023-05422-w |
work_keys_str_mv | AT berressven ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations AT gromolljorg ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations AT wostemarius ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations AT sandmannsarah ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations AT laurentinosandra ogrecalculatevisualizeandanalyzeoverlapbetweengenomicinputregionsandpublicannotations |