Cargando…

A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research

BACKGROUND: Early life impairments leading to lower lung function by adulthood are considered as risk factors for chronic obstructive pulmonary disease (COPD). Recently, we compared the lung transcriptomic profile between two mouse strains with extreme total lung capacities to identify plausible pul...

Descripción completa

Detalles Bibliográficos
Autores principales: Vishweswaraiah, Sangeetha, George, Leema, Purushothaman, Natarajan, Ganguly, Koustav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5989378/
https://www.ncbi.nlm.nih.gov/pubmed/29871630
http://dx.doi.org/10.1186/s12931-018-0795-y
_version_ 1783329449186426880
author Vishweswaraiah, Sangeetha
George, Leema
Purushothaman, Natarajan
Ganguly, Koustav
author_facet Vishweswaraiah, Sangeetha
George, Leema
Purushothaman, Natarajan
Ganguly, Koustav
author_sort Vishweswaraiah, Sangeetha
collection PubMed
description BACKGROUND: Early life impairments leading to lower lung function by adulthood are considered as risk factors for chronic obstructive pulmonary disease (COPD). Recently, we compared the lung transcriptomic profile between two mouse strains with extreme total lung capacities to identify plausible pulmonary function determining genes using microarray analysis (GSE80078). Advancement of high-throughput techniques like deep sequencing (eg. RNA-seq) and microarray have resulted in an explosion of genomic data in the online public repositories which however remains under-exploited. Strategic curation of publicly available genomic data with a mouse-human translational approach can effectively implement “3R- Tenet” by reducing screening experiments with animals and performing mechanistic studies using physiologically relevant in vitro model systems. Therefore, we sought to analyze the association of functional variations within human orthologs of mouse lung function candidate genes in a publicly available COPD lung RNA-seq data-set. METHODS: Association of missense single nucleotide polymorphisms, insertions, deletions, and splice junction variants were analyzed for susceptibility to COPD using RNA-seq data of a Korean population (GSE57148). Expression of the associated genes were studied using the Gene Paint (mouse embryo) and Human Protein Atlas (normal adult human lung) databases. The genes were also assessed for replication of the associations and expression in COPD−/mouse cigarette smoke exposed lung tissues using other datasets. RESULTS: Significant association (p <  0.05) of variations in 20 genes to higher COPD susceptibility have been detected within the investigated cohort. Association of HJURP, MCRS1 and TLR8 are novel in relation to COPD. The associated ADAM19 and KIT loci have been reported earlier. The remaining 15 genes have also been previously associated to COPD. Differential transcript expression levels of the associated genes in COPD- and/ or mouse emphysematous lung tissues have been detected. CONCLUSION: Our findings suggest strategic mouse-human datamining approaches can identify novel COPD candidate genes using existing datasets in the online repositories. The candidates can be further evaluated for mechanistic role through in vitro studies using appropriate primary cells/cell lines. Functional studies can be limited to transgenic animal models of only well supported candidate genes. This approach will lead to a significant reduction of animal experimentation in respiratory research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12931-018-0795-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5989378
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59893782018-06-20 A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research Vishweswaraiah, Sangeetha George, Leema Purushothaman, Natarajan Ganguly, Koustav Respir Res Research BACKGROUND: Early life impairments leading to lower lung function by adulthood are considered as risk factors for chronic obstructive pulmonary disease (COPD). Recently, we compared the lung transcriptomic profile between two mouse strains with extreme total lung capacities to identify plausible pulmonary function determining genes using microarray analysis (GSE80078). Advancement of high-throughput techniques like deep sequencing (eg. RNA-seq) and microarray have resulted in an explosion of genomic data in the online public repositories which however remains under-exploited. Strategic curation of publicly available genomic data with a mouse-human translational approach can effectively implement “3R- Tenet” by reducing screening experiments with animals and performing mechanistic studies using physiologically relevant in vitro model systems. Therefore, we sought to analyze the association of functional variations within human orthologs of mouse lung function candidate genes in a publicly available COPD lung RNA-seq data-set. METHODS: Association of missense single nucleotide polymorphisms, insertions, deletions, and splice junction variants were analyzed for susceptibility to COPD using RNA-seq data of a Korean population (GSE57148). Expression of the associated genes were studied using the Gene Paint (mouse embryo) and Human Protein Atlas (normal adult human lung) databases. The genes were also assessed for replication of the associations and expression in COPD−/mouse cigarette smoke exposed lung tissues using other datasets. RESULTS: Significant association (p <  0.05) of variations in 20 genes to higher COPD susceptibility have been detected within the investigated cohort. Association of HJURP, MCRS1 and TLR8 are novel in relation to COPD. The associated ADAM19 and KIT loci have been reported earlier. The remaining 15 genes have also been previously associated to COPD. Differential transcript expression levels of the associated genes in COPD- and/ or mouse emphysematous lung tissues have been detected. CONCLUSION: Our findings suggest strategic mouse-human datamining approaches can identify novel COPD candidate genes using existing datasets in the online repositories. The candidates can be further evaluated for mechanistic role through in vitro studies using appropriate primary cells/cell lines. Functional studies can be limited to transgenic animal models of only well supported candidate genes. This approach will lead to a significant reduction of animal experimentation in respiratory research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12931-018-0795-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-06 2018 /pmc/articles/PMC5989378/ /pubmed/29871630 http://dx.doi.org/10.1186/s12931-018-0795-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Vishweswaraiah, Sangeetha
George, Leema
Purushothaman, Natarajan
Ganguly, Koustav
A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
title A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
title_full A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
title_fullStr A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
title_full_unstemmed A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
title_short A candidate gene identification strategy utilizing mouse to human big-data mining: “3R-tenet” in COPD genetic research
title_sort candidate gene identification strategy utilizing mouse to human big-data mining: “3r-tenet” in copd genetic research
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5989378/
https://www.ncbi.nlm.nih.gov/pubmed/29871630
http://dx.doi.org/10.1186/s12931-018-0795-y
work_keys_str_mv AT vishweswaraiahsangeetha acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT georgeleema acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT purushothamannatarajan acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT gangulykoustav acandidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT vishweswaraiahsangeetha candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT georgeleema candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT purushothamannatarajan candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch
AT gangulykoustav candidategeneidentificationstrategyutilizingmousetohumanbigdatamining3rtenetincopdgeneticresearch