Cargando…
PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
BACKGROUND: Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associat...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486473/ https://www.ncbi.nlm.nih.gov/pubmed/31029062 http://dx.doi.org/10.1093/gigascience/giz046 |
_version_ | 1783414345578840064 |
---|---|
author | Johnson, Travis S Li, Sihong Franz, Eric Huang, Zhi Dan Li, Shuyu Campbell, Moray J Huang, Kun Zhang, Yan |
author_facet | Johnson, Travis S Li, Sihong Franz, Eric Huang, Zhi Dan Li, Shuyu Campbell, Moray J Huang, Kun Zhang, Yan |
author_sort | Johnson, Travis S |
collection | PubMed |
description | BACKGROUND: Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associations can be identified through the integration of biomedical data, such as sequence homology, functional pathways, gene expression, pseudogene expression, and microRNA expression. However, not all of the information has been integrated, and almost all previous pseudogene studies relied on 1:1 pseudogene–parent gene relationships without leveraging other homologous genes/pseudogenes. RESULTS: We produce PGG families that expand beyond the current 1:1 paradigm. First, we construct expansive PGG databases by (i) CUDAlign graphics processing unit (GPU) accelerated local alignment of all pseudogenes to gene families (totaling 1.6 billion individual local alignments and >40,000 GPU hours) and (ii) BLAST-based assignment of pseudogenes to gene families. Second, we create an open-source web application (PseudoFuN [Pseudogene Functional Networks]) to search for integrative functional relationships of sequence homology, microRNA expression, gene expression, pseudogene expression, and gene ontology. We produce four “flavors” of CUDAlign-based databases (>462,000,000 PGG pairwise alignments and 133,770 PGG families) that can be queried and downloaded using PseudoFuN. These databases are consistent with previous 1:1 PGG annotation and also are much more powerful including millions of de novo PGG associations. For example, we find multiple known (e.g., miR-20a-PTEN-PTENP1) and novel (e.g., miR-375-SOX15-PPP4R1L) microRNA-gene-pseudogene associations in prostate cancer. PseudoFuN provides a “one stop shop” for identifying and visualizing thousands of potential regulatory relationships related to pseudogenes in The Cancer Genome Atlas cancers. CONCLUSIONS: Thousands of new PGG associations can be explored in the context of microRNA-gene-pseudogene co-expression and differential expression with a simple-to-use online tool by bioinformaticians and oncologists alike. |
format | Online Article Text |
id | pubmed-6486473 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-64864732019-05-01 PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers Johnson, Travis S Li, Sihong Franz, Eric Huang, Zhi Dan Li, Shuyu Campbell, Moray J Huang, Kun Zhang, Yan Gigascience Technical Note BACKGROUND: Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associations can be identified through the integration of biomedical data, such as sequence homology, functional pathways, gene expression, pseudogene expression, and microRNA expression. However, not all of the information has been integrated, and almost all previous pseudogene studies relied on 1:1 pseudogene–parent gene relationships without leveraging other homologous genes/pseudogenes. RESULTS: We produce PGG families that expand beyond the current 1:1 paradigm. First, we construct expansive PGG databases by (i) CUDAlign graphics processing unit (GPU) accelerated local alignment of all pseudogenes to gene families (totaling 1.6 billion individual local alignments and >40,000 GPU hours) and (ii) BLAST-based assignment of pseudogenes to gene families. Second, we create an open-source web application (PseudoFuN [Pseudogene Functional Networks]) to search for integrative functional relationships of sequence homology, microRNA expression, gene expression, pseudogene expression, and gene ontology. We produce four “flavors” of CUDAlign-based databases (>462,000,000 PGG pairwise alignments and 133,770 PGG families) that can be queried and downloaded using PseudoFuN. These databases are consistent with previous 1:1 PGG annotation and also are much more powerful including millions of de novo PGG associations. For example, we find multiple known (e.g., miR-20a-PTEN-PTENP1) and novel (e.g., miR-375-SOX15-PPP4R1L) microRNA-gene-pseudogene associations in prostate cancer. PseudoFuN provides a “one stop shop” for identifying and visualizing thousands of potential regulatory relationships related to pseudogenes in The Cancer Genome Atlas cancers. CONCLUSIONS: Thousands of new PGG associations can be explored in the context of microRNA-gene-pseudogene co-expression and differential expression with a simple-to-use online tool by bioinformaticians and oncologists alike. Oxford University Press 2019-04-26 /pmc/articles/PMC6486473/ /pubmed/31029062 http://dx.doi.org/10.1093/gigascience/giz046 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Johnson, Travis S Li, Sihong Franz, Eric Huang, Zhi Dan Li, Shuyu Campbell, Moray J Huang, Kun Zhang, Yan PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers |
title | PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers |
title_full | PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers |
title_fullStr | PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers |
title_full_unstemmed | PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers |
title_short | PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers |
title_sort | pseudofun: deriving functional potentials of pseudogenes from integrative relationships with genes and micrornas across 32 cancers |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486473/ https://www.ncbi.nlm.nih.gov/pubmed/31029062 http://dx.doi.org/10.1093/gigascience/giz046 |
work_keys_str_mv | AT johnsontraviss pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT lisihong pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT franzeric pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT huangzhi pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT danlishuyu pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT campbellmorayj pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT huangkun pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers AT zhangyan pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers |