Cargando…
IMMerge: merging imputation data at scale
SUMMARY: Genomic data are often processed in batches and analyzed together to save time. However, it is challenging to combine multiple large VCFs and properly handle imputation quality and missing variants due to the limitations of available tools. To address these concerns, we developed IMMerge, a...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805583/ https://www.ncbi.nlm.nih.gov/pubmed/36413071 http://dx.doi.org/10.1093/bioinformatics/btac750 |
_version_ | 1784862358956408832 |
---|---|
author | Zhu, Wanying Chen, Hung-Hsin Petty, Alexander S Petty, Lauren E Polikowsky, Hannah G Gamazon, Eric R Below, Jennifer E Highland, Heather M |
author_facet | Zhu, Wanying Chen, Hung-Hsin Petty, Alexander S Petty, Lauren E Polikowsky, Hannah G Gamazon, Eric R Below, Jennifer E Highland, Heather M |
author_sort | Zhu, Wanying |
collection | PubMed |
description | SUMMARY: Genomic data are often processed in batches and analyzed together to save time. However, it is challenging to combine multiple large VCFs and properly handle imputation quality and missing variants due to the limitations of available tools. To address these concerns, we developed IMMerge, a Python-based tool that takes advantage of multiprocessing to reduce running time. For the first time in a publicly available tool, imputation quality scores are correctly combined with Fisher’s z transformation. AVAILABILITY AND IMPLEMENTATION: IMMerge is an open-source project under MIT license. Source code and user manual are available at https://github.com/belowlab/IMMerge. |
format | Online Article Text |
id | pubmed-9805583 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98055832023-01-03 IMMerge: merging imputation data at scale Zhu, Wanying Chen, Hung-Hsin Petty, Alexander S Petty, Lauren E Polikowsky, Hannah G Gamazon, Eric R Below, Jennifer E Highland, Heather M Bioinformatics Applications Note SUMMARY: Genomic data are often processed in batches and analyzed together to save time. However, it is challenging to combine multiple large VCFs and properly handle imputation quality and missing variants due to the limitations of available tools. To address these concerns, we developed IMMerge, a Python-based tool that takes advantage of multiprocessing to reduce running time. For the first time in a publicly available tool, imputation quality scores are correctly combined with Fisher’s z transformation. AVAILABILITY AND IMPLEMENTATION: IMMerge is an open-source project under MIT license. Source code and user manual are available at https://github.com/belowlab/IMMerge. Oxford University Press 2022-11-22 /pmc/articles/PMC9805583/ /pubmed/36413071 http://dx.doi.org/10.1093/bioinformatics/btac750 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Zhu, Wanying Chen, Hung-Hsin Petty, Alexander S Petty, Lauren E Polikowsky, Hannah G Gamazon, Eric R Below, Jennifer E Highland, Heather M IMMerge: merging imputation data at scale |
title | IMMerge: merging imputation data at scale |
title_full | IMMerge: merging imputation data at scale |
title_fullStr | IMMerge: merging imputation data at scale |
title_full_unstemmed | IMMerge: merging imputation data at scale |
title_short | IMMerge: merging imputation data at scale |
title_sort | immerge: merging imputation data at scale |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805583/ https://www.ncbi.nlm.nih.gov/pubmed/36413071 http://dx.doi.org/10.1093/bioinformatics/btac750 |
work_keys_str_mv | AT zhuwanying immergemergingimputationdataatscale AT chenhunghsin immergemergingimputationdataatscale AT pettyalexanders immergemergingimputationdataatscale AT pettylaurene immergemergingimputationdataatscale AT polikowskyhannahg immergemergingimputationdataatscale AT gamazonericr immergemergingimputationdataatscale AT belowjennifere immergemergingimputationdataatscale AT highlandheatherm immergemergingimputationdataatscale |