Cargando…

PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers

BACKGROUND: Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associat...

Descripción completa

Detalles Bibliográficos
Autores principales: Johnson, Travis S, Li, Sihong, Franz, Eric, Huang, Zhi, Dan Li, Shuyu, Campbell, Moray J, Huang, Kun, Zhang, Yan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486473/
https://www.ncbi.nlm.nih.gov/pubmed/31029062
http://dx.doi.org/10.1093/gigascience/giz046
_version_ 1783414345578840064
author Johnson, Travis S
Li, Sihong
Franz, Eric
Huang, Zhi
Dan Li, Shuyu
Campbell, Moray J
Huang, Kun
Zhang, Yan
author_facet Johnson, Travis S
Li, Sihong
Franz, Eric
Huang, Zhi
Dan Li, Shuyu
Campbell, Moray J
Huang, Kun
Zhang, Yan
author_sort Johnson, Travis S
collection PubMed
description BACKGROUND: Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associations can be identified through the integration of biomedical data, such as sequence homology, functional pathways, gene expression, pseudogene expression, and microRNA expression. However, not all of the information has been integrated, and almost all previous pseudogene studies relied on 1:1 pseudogene–parent gene relationships without leveraging other homologous genes/pseudogenes. RESULTS: We produce PGG families that expand beyond the current 1:1 paradigm. First, we construct expansive PGG databases by (i) CUDAlign graphics processing unit (GPU) accelerated local alignment of all pseudogenes to gene families (totaling 1.6 billion individual local alignments and >40,000 GPU hours) and (ii) BLAST-based assignment of pseudogenes to gene families. Second, we create an open-source web application (PseudoFuN [Pseudogene Functional Networks]) to search for integrative functional relationships of sequence homology, microRNA expression, gene expression, pseudogene expression, and gene ontology. We produce four “flavors” of CUDAlign-based databases (>462,000,000 PGG pairwise alignments and 133,770 PGG families) that can be queried and downloaded using PseudoFuN. These databases are consistent with previous 1:1 PGG annotation and also are much more powerful including millions of de novo PGG associations. For example, we find multiple known (e.g., miR-20a-PTEN-PTENP1) and novel (e.g., miR-375-SOX15-PPP4R1L) microRNA-gene-pseudogene associations in prostate cancer. PseudoFuN provides a “one stop shop” for identifying and visualizing thousands of potential regulatory relationships related to pseudogenes in The Cancer Genome Atlas cancers. CONCLUSIONS: Thousands of new PGG associations can be explored in the context of microRNA-gene-pseudogene co-expression and differential expression with a simple-to-use online tool by bioinformaticians and oncologists alike.
format Online
Article
Text
id pubmed-6486473
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64864732019-05-01 PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers Johnson, Travis S Li, Sihong Franz, Eric Huang, Zhi Dan Li, Shuyu Campbell, Moray J Huang, Kun Zhang, Yan Gigascience Technical Note BACKGROUND: Long thought “relics” of evolution, not until recently have pseudogenes been of medical interest regarding regulation in cancer. Often, these regulatory roles are a direct by-product of their close sequence homology to protein-coding genes. Novel pseudogene-gene (PGG) functional associations can be identified through the integration of biomedical data, such as sequence homology, functional pathways, gene expression, pseudogene expression, and microRNA expression. However, not all of the information has been integrated, and almost all previous pseudogene studies relied on 1:1 pseudogene–parent gene relationships without leveraging other homologous genes/pseudogenes. RESULTS: We produce PGG families that expand beyond the current 1:1 paradigm. First, we construct expansive PGG databases by (i) CUDAlign graphics processing unit (GPU) accelerated local alignment of all pseudogenes to gene families (totaling 1.6 billion individual local alignments and >40,000 GPU hours) and (ii) BLAST-based assignment of pseudogenes to gene families. Second, we create an open-source web application (PseudoFuN [Pseudogene Functional Networks]) to search for integrative functional relationships of sequence homology, microRNA expression, gene expression, pseudogene expression, and gene ontology. We produce four “flavors” of CUDAlign-based databases (>462,000,000 PGG pairwise alignments and 133,770 PGG families) that can be queried and downloaded using PseudoFuN. These databases are consistent with previous 1:1 PGG annotation and also are much more powerful including millions of de novo PGG associations. For example, we find multiple known (e.g., miR-20a-PTEN-PTENP1) and novel (e.g., miR-375-SOX15-PPP4R1L) microRNA-gene-pseudogene associations in prostate cancer. PseudoFuN provides a “one stop shop” for identifying and visualizing thousands of potential regulatory relationships related to pseudogenes in The Cancer Genome Atlas cancers. CONCLUSIONS: Thousands of new PGG associations can be explored in the context of microRNA-gene-pseudogene co-expression and differential expression with a simple-to-use online tool by bioinformaticians and oncologists alike. Oxford University Press 2019-04-26 /pmc/articles/PMC6486473/ /pubmed/31029062 http://dx.doi.org/10.1093/gigascience/giz046 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Johnson, Travis S
Li, Sihong
Franz, Eric
Huang, Zhi
Dan Li, Shuyu
Campbell, Moray J
Huang, Kun
Zhang, Yan
PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
title PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
title_full PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
title_fullStr PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
title_full_unstemmed PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
title_short PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers
title_sort pseudofun: deriving functional potentials of pseudogenes from integrative relationships with genes and micrornas across 32 cancers
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486473/
https://www.ncbi.nlm.nih.gov/pubmed/31029062
http://dx.doi.org/10.1093/gigascience/giz046
work_keys_str_mv AT johnsontraviss pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT lisihong pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT franzeric pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT huangzhi pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT danlishuyu pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT campbellmorayj pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT huangkun pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers
AT zhangyan pseudofunderivingfunctionalpotentialsofpseudogenesfromintegrativerelationshipswithgenesandmicrornasacross32cancers