Cargando…
Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8593043/ https://www.ncbi.nlm.nih.gov/pubmed/34795698 http://dx.doi.org/10.3389/fgene.2021.774846 |
_version_ | 1784599625775185920 |
---|---|
author | Xu, Shuangbin Chen, Meijun Feng, Tingze Zhan, Li Zhou, Lang Yu, Guangchuang |
author_facet | Xu, Shuangbin Chen, Meijun Feng, Tingze Zhan, Li Zhou, Lang Yu, Guangchuang |
author_sort | Xu, Shuangbin |
collection | PubMed |
description | With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak). |
format | Online Article Text |
id | pubmed-8593043 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-85930432021-11-17 Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers Xu, Shuangbin Chen, Meijun Feng, Tingze Zhan, Li Zhou, Lang Yu, Guangchuang Front Genet Genetics With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak). Frontiers Media S.A. 2021-11-02 /pmc/articles/PMC8593043/ /pubmed/34795698 http://dx.doi.org/10.3389/fgene.2021.774846 Text en Copyright © 2021 Xu, Chen, Feng, Zhan, Zhou and Yu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Xu, Shuangbin Chen, Meijun Feng, Tingze Zhan, Li Zhou, Lang Yu, Guangchuang Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_full | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_fullStr | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_full_unstemmed | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_short | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_sort | use ggbreak to effectively utilize plotting space to deal with large datasets and outliers |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8593043/ https://www.ncbi.nlm.nih.gov/pubmed/34795698 http://dx.doi.org/10.3389/fgene.2021.774846 |
work_keys_str_mv | AT xushuangbin useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT chenmeijun useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT fengtingze useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT zhanli useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT zhoulang useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT yuguangchuang useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers |