Cargando…
A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
BACKGROUND: Early life impairments leading to lower lung function by adulthood are considered as risk factors for chronic obstructive pulmonary disease (COPD). Recently, we compared the lung transcriptomic profile between two mouse strains with extreme total lung capacities to identify plausible pul...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5989378/ https://www.ncbi.nlm.nih.gov/pubmed/29871630 http://dx.doi.org/10.1186/s12931-018-0795-y |
_version_ | 1783329449186426880 |
---|---|
author | Vishweswaraiah, Sangeetha George, Leema Purushothaman, Natarajan Ganguly, Koustav |
author_facet | Vishweswaraiah, Sangeetha George, Leema Purushothaman, Natarajan Ganguly, Koustav |
author_sort | Vishweswaraiah, Sangeetha |
collection | PubMed |
description | BACKGROUND: Early life impairments leading to lower lung function by adulthood are considered as risk factors for chronic obstructive pulmonary disease (COPD). Recently, we compared the lung transcriptomic profile between two mouse strains with extreme total lung capacities to identify plausible pulmonary function determining genes using microarray analysis (GSE80078). Advancement of high-throughput techniques like deep sequencing (eg. RNA-seq) and microarray have resulted in an explosion of genomic data in the online public repositories which however remains under-exploited. Strategic curation of publicly available genomic data with a mouse-human translational approach can effectively implement “3R- Tenet” by reducing screening experiments with animals and performing mechanistic studies using physiologically relevant in vitro model systems. Therefore, we sought to analyze the association of functional variations within human orthologs of mouse lung function candidate genes in a publicly available COPD lung RNA-seq data-set. METHODS: Association of missense single nucleotide polymorphisms, insertions, deletions, and splice junction variants were analyzed for susceptibility to COPD using RNA-seq data of a Korean population (GSE57148). Expression of the associated genes were studied using the Gene Paint (mouse embryo) and Human Protein Atlas (normal adult human lung) databases. The genes were also assessed for replication of the associations and expression in COPD−/mouse cigarette smoke exposed lung tissues using other datasets. RESULTS: Significant association (p < 0.05) of variations in 20 genes to higher COPD susceptibility have been detected within the investigated cohort. Association of HJURP, MCRS1 and TLR8 are novel in relation to COPD. The associated ADAM19 and KIT loci have been reported earlier. The remaining 15 genes have also been previously associated to COPD. Differential transcript expression levels of the associated genes in COPD- and/ or mouse emphysematous lung tissues have been detected. CONCLUSION: Our findings suggest strategic mouse-human datamining approaches can identify novel COPD candidate genes using existing datasets in the online repositories. The candidates can be further evaluated for mechanistic role through in vitro studies using appropriate primary cells/cell lines. Functional studies can be limited to transgenic animal models of only well supported candidate genes. This approach will lead to a significant reduction of animal experimentation in respiratory research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12931-018-0795-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5989378 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59893782018-06-20 A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research Vishweswaraiah, Sangeetha George, Leema Purushothaman, Natarajan Ganguly, Koustav Respir Res Research BACKGROUND: Early life impairments leading to lower lung function by adulthood are considered as risk factors for chronic obstructive pulmonary disease (COPD). Recently, we compared the lung transcriptomic profile between two mouse strains with extreme total lung capacities to identify plausible pulmonary function determining genes using microarray analysis (GSE80078). Advancement of high-throughput techniques like deep sequencing (eg. RNA-seq) and microarray have resulted in an explosion of genomic data in the online public repositories which however remains under-exploited. Strategic curation of publicly available genomic data with a mouse-human translational approach can effectively implement “3R- Tenet” by reducing screening experiments with animals and performing mechanistic studies using physiologically relevant in vitro model systems. Therefore, we sought to analyze the association of functional variations within human orthologs of mouse lung function candidate genes in a publicly available COPD lung RNA-seq data-set. METHODS: Association of missense single nucleotide polymorphisms, insertions, deletions, and splice junction variants were analyzed for susceptibility to COPD using RNA-seq data of a Korean population (GSE57148). Expression of the associated genes were studied using the Gene Paint (mouse embryo) and Human Protein Atlas (normal adult human lung) databases. The genes were also assessed for replication of the associations and expression in COPD−/mouse cigarette smoke exposed lung tissues using other datasets. RESULTS: Significant association (p < 0.05) of variations in 20 genes to higher COPD susceptibility have been detected within the investigated cohort. Association of HJURP, MCRS1 and TLR8 are novel in relation to COPD. The associated ADAM19 and KIT loci have been reported earlier. The remaining 15 genes have also been previously associated to COPD. Differential transcript expression levels of the associated genes in COPD- and/ or mouse emphysematous lung tissues have been detected. CONCLUSION: Our findings suggest strategic mouse-human datamining approaches can identify novel COPD candidate genes using existing datasets in the online repositories. The candidates can be further evaluated for mechanistic role through in vitro studies using appropriate primary cells/cell lines. Functional studies can be limited to transgenic animal models of only well supported candidate genes. This approach will lead to a significant reduction of animal experimentation in respiratory research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12931-018-0795-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-06 2018 /pmc/articles/PMC5989378/ /pubmed/29871630 http://dx.doi.org/10.1186/s12931-018-0795-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Vishweswaraiah, Sangeetha George, Leema Purushothaman, Natarajan Ganguly, Koustav A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research |
title | A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research |
title_full | A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research |
title_fullStr | A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research |
title_full_unstemmed | A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research |
title_short | A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research |
title_sort | candidate gene identification strategy utilizing mouse to human big-data mining: “3r-tenet” in copd genetic research |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5989378/ https://www.ncbi.nlm.nih.gov/pubmed/29871630 http://dx.doi.org/10.1186/s12931-018-0795-y |
work_keys_str_mv | AT vishweswaraiahsangeetha acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT georgeleema acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT purushothamannatarajan acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT gangulykoustav acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT vishweswaraiahsangeetha candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT georgeleema candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT purushothamannatarajan candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch AT gangulykoustav candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch |