Cargando…

UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene

Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processe...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chengyou, Zhou, Leilei, Wang, Yuhe, Tian, Shuchang, Zhu, Junlin, Qin, Hang, Ding, Yong, Jiang, Hongbing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927121/
https://www.ncbi.nlm.nih.gov/pubmed/31865918
http://dx.doi.org/10.1186/s12976-019-0117-1
_version_ 1783482244232380416
author Liu, Chengyou
Zhou, Leilei
Wang, Yuhe
Tian, Shuchang
Zhu, Junlin
Qin, Hang
Ding, Yong
Jiang, Hongbing
author_facet Liu, Chengyou
Zhou, Leilei
Wang, Yuhe
Tian, Shuchang
Zhu, Junlin
Qin, Hang
Ding, Yong
Jiang, Hongbing
author_sort Liu, Chengyou
collection PubMed
description Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processes have been proved successful, the probability that the number of non-differentially expressed genes measured by false discovery rate (FDR) has a large standard deviation, and the misidentification rate (type I error) grows rapidly when the number of genes to be detected become larger. In this study we developed a new method, Unit Gamma Measurement (UGM), accounting for multiple hypotheses test statistics distribution, which could reduce the dependency problem. Simulated expression profile data and breast cancer RNA-Seq data were utilized to testify the accuracy of UGM. The results show that the number of non-differentially expressed genes identified by the UGM is very close to the real-evidence data, and the UGM also has a smaller standard error, range, quartile range and RMS error. In addition, the UGM can be used to screen many breast cancer-associated genes, such as BRCA1, BRCA2, PTEN, BRIP1, etc., provides better accuracy, robustness and efficiency, the method of identification differentially expressed genes in high-throughput sequencing.
format Online
Article
Text
id pubmed-6927121
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69271212019-12-30 UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene Liu, Chengyou Zhou, Leilei Wang, Yuhe Tian, Shuchang Zhu, Junlin Qin, Hang Ding, Yong Jiang, Hongbing Theor Biol Med Model Review Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processes have been proved successful, the probability that the number of non-differentially expressed genes measured by false discovery rate (FDR) has a large standard deviation, and the misidentification rate (type I error) grows rapidly when the number of genes to be detected become larger. In this study we developed a new method, Unit Gamma Measurement (UGM), accounting for multiple hypotheses test statistics distribution, which could reduce the dependency problem. Simulated expression profile data and breast cancer RNA-Seq data were utilized to testify the accuracy of UGM. The results show that the number of non-differentially expressed genes identified by the UGM is very close to the real-evidence data, and the UGM also has a smaller standard error, range, quartile range and RMS error. In addition, the UGM can be used to screen many breast cancer-associated genes, such as BRCA1, BRCA2, PTEN, BRIP1, etc., provides better accuracy, robustness and efficiency, the method of identification differentially expressed genes in high-throughput sequencing. BioMed Central 2019-12-23 /pmc/articles/PMC6927121/ /pubmed/31865918 http://dx.doi.org/10.1186/s12976-019-0117-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Review
Liu, Chengyou
Zhou, Leilei
Wang, Yuhe
Tian, Shuchang
Zhu, Junlin
Qin, Hang
Ding, Yong
Jiang, Hongbing
UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
title UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
title_full UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
title_fullStr UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
title_full_unstemmed UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
title_short UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
title_sort ugm: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927121/
https://www.ncbi.nlm.nih.gov/pubmed/31865918
http://dx.doi.org/10.1186/s12976-019-0117-1
work_keys_str_mv AT liuchengyou ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT zhouleilei ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT wangyuhe ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT tianshuchang ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT zhujunlin ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT qinhang ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT dingyong ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene
AT jianghongbing ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene