Cargando…
UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene
Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processe...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927121/ https://www.ncbi.nlm.nih.gov/pubmed/31865918 http://dx.doi.org/10.1186/s12976-019-0117-1 |
_version_ | 1783482244232380416 |
---|---|
author | Liu, Chengyou Zhou, Leilei Wang, Yuhe Tian, Shuchang Zhu, Junlin Qin, Hang Ding, Yong Jiang, Hongbing |
author_facet | Liu, Chengyou Zhou, Leilei Wang, Yuhe Tian, Shuchang Zhu, Junlin Qin, Hang Ding, Yong Jiang, Hongbing |
author_sort | Liu, Chengyou |
collection | PubMed |
description | Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processes have been proved successful, the probability that the number of non-differentially expressed genes measured by false discovery rate (FDR) has a large standard deviation, and the misidentification rate (type I error) grows rapidly when the number of genes to be detected become larger. In this study we developed a new method, Unit Gamma Measurement (UGM), accounting for multiple hypotheses test statistics distribution, which could reduce the dependency problem. Simulated expression profile data and breast cancer RNA-Seq data were utilized to testify the accuracy of UGM. The results show that the number of non-differentially expressed genes identified by the UGM is very close to the real-evidence data, and the UGM also has a smaller standard error, range, quartile range and RMS error. In addition, the UGM can be used to screen many breast cancer-associated genes, such as BRCA1, BRCA2, PTEN, BRIP1, etc., provides better accuracy, robustness and efficiency, the method of identification differentially expressed genes in high-throughput sequencing. |
format | Online Article Text |
id | pubmed-6927121 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69271212019-12-30 UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene Liu, Chengyou Zhou, Leilei Wang, Yuhe Tian, Shuchang Zhu, Junlin Qin, Hang Ding, Yong Jiang, Hongbing Theor Biol Med Model Review Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processes have been proved successful, the probability that the number of non-differentially expressed genes measured by false discovery rate (FDR) has a large standard deviation, and the misidentification rate (type I error) grows rapidly when the number of genes to be detected become larger. In this study we developed a new method, Unit Gamma Measurement (UGM), accounting for multiple hypotheses test statistics distribution, which could reduce the dependency problem. Simulated expression profile data and breast cancer RNA-Seq data were utilized to testify the accuracy of UGM. The results show that the number of non-differentially expressed genes identified by the UGM is very close to the real-evidence data, and the UGM also has a smaller standard error, range, quartile range and RMS error. In addition, the UGM can be used to screen many breast cancer-associated genes, such as BRCA1, BRCA2, PTEN, BRIP1, etc., provides better accuracy, robustness and efficiency, the method of identification differentially expressed genes in high-throughput sequencing. BioMed Central 2019-12-23 /pmc/articles/PMC6927121/ /pubmed/31865918 http://dx.doi.org/10.1186/s12976-019-0117-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Review Liu, Chengyou Zhou, Leilei Wang, Yuhe Tian, Shuchang Zhu, Junlin Qin, Hang Ding, Yong Jiang, Hongbing UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
title | UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
title_full | UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
title_fullStr | UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
title_full_unstemmed | UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
title_short | UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
title_sort | ugm: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927121/ https://www.ncbi.nlm.nih.gov/pubmed/31865918 http://dx.doi.org/10.1186/s12976-019-0117-1 |
work_keys_str_mv | AT liuchengyou ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT zhouleilei ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT wangyuhe ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT tianshuchang ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT zhujunlin ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT qinhang ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT dingyong ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene AT jianghongbing ugmamorestableprocedureforlargescalemultipletestingproblemsnewsolutionstoidentifyoncogene |