Cargando…

Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names

Opening and processing gene expression data files in Excel runs into the inadvertent risk of converting gene names to dates. As pathway analysis tools rely on gene symbols to query against pathway databases, the genes that are converted to dates will not be recognized, potentially causing voids in p...

Descripción completa

Detalles Bibliográficos
Autores principales: Koh, Clara W. T., Ooi, Justin S. G., Joly, Gabrielle L. C., Chan, Kuan Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325790/
https://www.ncbi.nlm.nih.gov/pubmed/35882976
http://dx.doi.org/10.1038/s41598-022-17104-3
_version_ 1784757134751170560
author Koh, Clara W. T.
Ooi, Justin S. G.
Joly, Gabrielle L. C.
Chan, Kuan Rong
author_facet Koh, Clara W. T.
Ooi, Justin S. G.
Joly, Gabrielle L. C.
Chan, Kuan Rong
author_sort Koh, Clara W. T.
collection PubMed
description Opening and processing gene expression data files in Excel runs into the inadvertent risk of converting gene names to dates. As pathway analysis tools rely on gene symbols to query against pathway databases, the genes that are converted to dates will not be recognized, potentially causing voids in pathway analysis. Molecular pathways related to cell division, exocytosis, cilium assembly, protein ubiquitination and nitric oxide biosynthesis were found to be most affected by Excel auto-conversion. A plausible solution is hence to update these genes and dates to the newly approved gene names as recommended by the HUGO Gene Nomenclature Committee (HGNC), which are resilient to Excel auto-conversion. Herein, we developed a web tool with Streamlit that can convert old gene names and dates back into the new gene names recommended by HGNC. The web app is named Gene Updater, which is open source and can be either hosted locally or at https://share.streamlit.io/kuanrongchan/date-to-gene-converter/main/date_gene_tool.py. Additionally, as Mar-01 and Mar-02 can each be potentially mapped to 2 different gene names, users can assign the date terms to the appropriate gene names within the Gene Updater web tool. This user-friendly web tool ensures that the accuracy and integrity of gene expression data is preserved by minimizing errors in labelling gene names due to Excel auto-conversions.
format Online
Article
Text
id pubmed-9325790
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93257902022-07-28 Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names Koh, Clara W. T. Ooi, Justin S. G. Joly, Gabrielle L. C. Chan, Kuan Rong Sci Rep Article Opening and processing gene expression data files in Excel runs into the inadvertent risk of converting gene names to dates. As pathway analysis tools rely on gene symbols to query against pathway databases, the genes that are converted to dates will not be recognized, potentially causing voids in pathway analysis. Molecular pathways related to cell division, exocytosis, cilium assembly, protein ubiquitination and nitric oxide biosynthesis were found to be most affected by Excel auto-conversion. A plausible solution is hence to update these genes and dates to the newly approved gene names as recommended by the HUGO Gene Nomenclature Committee (HGNC), which are resilient to Excel auto-conversion. Herein, we developed a web tool with Streamlit that can convert old gene names and dates back into the new gene names recommended by HGNC. The web app is named Gene Updater, which is open source and can be either hosted locally or at https://share.streamlit.io/kuanrongchan/date-to-gene-converter/main/date_gene_tool.py. Additionally, as Mar-01 and Mar-02 can each be potentially mapped to 2 different gene names, users can assign the date terms to the appropriate gene names within the Gene Updater web tool. This user-friendly web tool ensures that the accuracy and integrity of gene expression data is preserved by minimizing errors in labelling gene names due to Excel auto-conversions. Nature Publishing Group UK 2022-07-26 /pmc/articles/PMC9325790/ /pubmed/35882976 http://dx.doi.org/10.1038/s41598-022-17104-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Koh, Clara W. T.
Ooi, Justin S. G.
Joly, Gabrielle L. C.
Chan, Kuan Rong
Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names
title Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names
title_full Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names
title_fullStr Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names
title_full_unstemmed Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names
title_short Gene Updater: a web tool that autocorrects and updates for Excel misidentified gene names
title_sort gene updater: a web tool that autocorrects and updates for excel misidentified gene names
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9325790/
https://www.ncbi.nlm.nih.gov/pubmed/35882976
http://dx.doi.org/10.1038/s41598-022-17104-3
work_keys_str_mv AT kohclarawt geneupdaterawebtoolthatautocorrectsandupdatesforexcelmisidentifiedgenenames
AT ooijustinsg geneupdaterawebtoolthatautocorrectsandupdatesforexcelmisidentifiedgenenames
AT jolygabriellelc geneupdaterawebtoolthatautocorrectsandupdatesforexcelmisidentifiedgenenames
AT chankuanrong geneupdaterawebtoolthatautocorrectsandupdatesforexcelmisidentifiedgenenames