Cargando…

In silico candidate variant and gene identification using inbred mouse strains

Mice are the most widely used animal model to study genotype to phenotype relationships. Inbred mice are genetically identical, which eliminates genetic heterogeneity and makes them particularly useful for genetic studies. Many different strains have been bred over decades and a vast amount of pheno...

Descripción completa

Detalles Bibliográficos
Autores principales: Munz, Matthias, Khodaygani, Mohammad, Aherrahrou, Zouhair, Busch, Hauke, Wohlers, Inken
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7956000/
https://www.ncbi.nlm.nih.gov/pubmed/33763305
http://dx.doi.org/10.7717/peerj.11017
_version_ 1783664357160255488
author Munz, Matthias
Khodaygani, Mohammad
Aherrahrou, Zouhair
Busch, Hauke
Wohlers, Inken
author_facet Munz, Matthias
Khodaygani, Mohammad
Aherrahrou, Zouhair
Busch, Hauke
Wohlers, Inken
author_sort Munz, Matthias
collection PubMed
description Mice are the most widely used animal model to study genotype to phenotype relationships. Inbred mice are genetically identical, which eliminates genetic heterogeneity and makes them particularly useful for genetic studies. Many different strains have been bred over decades and a vast amount of phenotypic data has been generated. In addition, recently whole genome sequencing-based genome-wide genotype data for many widely used inbred strains has been released. Here, we present an approach for in silico fine-mapping that uses genotypic data of 37 inbred mouse strains together with phenotypic data provided by the user to propose candidate variants and genes for the phenotype under study. Public genome-wide genotype data covering more than 74 million variant sites is queried efficiently in real-time to provide those variants that are compatible with the observed phenotype differences between strains. Variants can be filtered by molecular consequences and by corresponding molecular impact. Candidate gene lists can be generated from variant lists on the fly. Fine-mapping together with annotation or filtering of results is provided in a Bioconductor package called MouseFM. In order to characterize candidate variant lists under various settings, MouseFM was applied to two expression data sets across 20 inbred mouse strains, one from neutrophils and one from CD4(+) T cells. Fine-mapping was assessed for about 10,000 genes, respectively, and identified candidate variants and haplotypes for many expression quantitative trait loci (eQTLs) reported previously based on these data. For albinism, MouseFM reports only one variant allele of moderate or high molecular impact that only albino mice share: a missense variant in the Tyr gene, reported previously to be causal for this phenotype. Performing in silico fine-mapping for interfrontal bone formation in mice using four strains with and five strains without interfrontal bone results in 12 genes. Of these, three are related to skull shaping abnormality. Finally performing fine-mapping for dystrophic cardiac calcification by comparing 9 strains showing the phenotype with eight strains lacking it, we identify only one moderate impact variant in the known causal gene Abcc6. In summary, this illustrates the benefit of using MouseFM for candidate variant and gene identification.
format Online
Article
Text
id pubmed-7956000
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79560002021-03-23 In silico candidate variant and gene identification using inbred mouse strains Munz, Matthias Khodaygani, Mohammad Aherrahrou, Zouhair Busch, Hauke Wohlers, Inken PeerJ Bioinformatics Mice are the most widely used animal model to study genotype to phenotype relationships. Inbred mice are genetically identical, which eliminates genetic heterogeneity and makes them particularly useful for genetic studies. Many different strains have been bred over decades and a vast amount of phenotypic data has been generated. In addition, recently whole genome sequencing-based genome-wide genotype data for many widely used inbred strains has been released. Here, we present an approach for in silico fine-mapping that uses genotypic data of 37 inbred mouse strains together with phenotypic data provided by the user to propose candidate variants and genes for the phenotype under study. Public genome-wide genotype data covering more than 74 million variant sites is queried efficiently in real-time to provide those variants that are compatible with the observed phenotype differences between strains. Variants can be filtered by molecular consequences and by corresponding molecular impact. Candidate gene lists can be generated from variant lists on the fly. Fine-mapping together with annotation or filtering of results is provided in a Bioconductor package called MouseFM. In order to characterize candidate variant lists under various settings, MouseFM was applied to two expression data sets across 20 inbred mouse strains, one from neutrophils and one from CD4(+) T cells. Fine-mapping was assessed for about 10,000 genes, respectively, and identified candidate variants and haplotypes for many expression quantitative trait loci (eQTLs) reported previously based on these data. For albinism, MouseFM reports only one variant allele of moderate or high molecular impact that only albino mice share: a missense variant in the Tyr gene, reported previously to be causal for this phenotype. Performing in silico fine-mapping for interfrontal bone formation in mice using four strains with and five strains without interfrontal bone results in 12 genes. Of these, three are related to skull shaping abnormality. Finally performing fine-mapping for dystrophic cardiac calcification by comparing 9 strains showing the phenotype with eight strains lacking it, we identify only one moderate impact variant in the known causal gene Abcc6. In summary, this illustrates the benefit of using MouseFM for candidate variant and gene identification. PeerJ Inc. 2021-03-11 /pmc/articles/PMC7956000/ /pubmed/33763305 http://dx.doi.org/10.7717/peerj.11017 Text en ©2021 Munz et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Munz, Matthias
Khodaygani, Mohammad
Aherrahrou, Zouhair
Busch, Hauke
Wohlers, Inken
In silico candidate variant and gene identification using inbred mouse strains
title In silico candidate variant and gene identification using inbred mouse strains
title_full In silico candidate variant and gene identification using inbred mouse strains
title_fullStr In silico candidate variant and gene identification using inbred mouse strains
title_full_unstemmed In silico candidate variant and gene identification using inbred mouse strains
title_short In silico candidate variant and gene identification using inbred mouse strains
title_sort in silico candidate variant and gene identification using inbred mouse strains
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7956000/
https://www.ncbi.nlm.nih.gov/pubmed/33763305
http://dx.doi.org/10.7717/peerj.11017
work_keys_str_mv AT munzmatthias insilicocandidatevariantandgeneidentificationusinginbredmousestrains
AT khodayganimohammad insilicocandidatevariantandgeneidentificationusinginbredmousestrains
AT aherrahrouzouhair insilicocandidatevariantandgeneidentificationusinginbredmousestrains
AT buschhauke insilicocandidatevariantandgeneidentificationusinginbredmousestrains
AT wohlersinken insilicocandidatevariantandgeneidentificationusinginbredmousestrains