Cargando…

Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers

With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Shuangbin, Chen, Meijun, Feng, Tingze, Zhan, Li, Zhou, Lang, Yu, Guangchuang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8593043/
https://www.ncbi.nlm.nih.gov/pubmed/34795698
http://dx.doi.org/10.3389/fgene.2021.774846
_version_ 1784599625775185920
author Xu, Shuangbin
Chen, Meijun
Feng, Tingze
Zhan, Li
Zhou, Lang
Yu, Guangchuang
author_facet Xu, Shuangbin
Chen, Meijun
Feng, Tingze
Zhan, Li
Zhou, Lang
Yu, Guangchuang
author_sort Xu, Shuangbin
collection PubMed
description With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak).
format Online
Article
Text
id pubmed-8593043
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85930432021-11-17 Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers Xu, Shuangbin Chen, Meijun Feng, Tingze Zhan, Li Zhou, Lang Yu, Guangchuang Front Genet Genetics With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak). Frontiers Media S.A. 2021-11-02 /pmc/articles/PMC8593043/ /pubmed/34795698 http://dx.doi.org/10.3389/fgene.2021.774846 Text en Copyright © 2021 Xu, Chen, Feng, Zhan, Zhou and Yu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Shuangbin
Chen, Meijun
Feng, Tingze
Zhan, Li
Zhou, Lang
Yu, Guangchuang
Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_full Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_fullStr Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_full_unstemmed Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_short Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_sort use ggbreak to effectively utilize plotting space to deal with large datasets and outliers
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8593043/
https://www.ncbi.nlm.nih.gov/pubmed/34795698
http://dx.doi.org/10.3389/fgene.2021.774846
work_keys_str_mv AT xushuangbin useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT chenmeijun useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT fengtingze useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT zhanli useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT zhoulang useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT yuguangchuang useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers