Cargando…
In silico candidate variant and gene identification using inbred mouse strains
Mice are the most widely used animal model to study genotype to phenotype relationships. Inbred mice are genetically identical, which eliminates genetic heterogeneity and makes them particularly useful for genetic studies. Many different strains have been bred over decades and a vast amount of pheno...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7956000/ https://www.ncbi.nlm.nih.gov/pubmed/33763305 http://dx.doi.org/10.7717/peerj.11017 |
_version_ | 1783664357160255488 |
---|---|
author | Munz, Matthias Khodaygani, Mohammad Aherrahrou, Zouhair Busch, Hauke Wohlers, Inken |
author_facet | Munz, Matthias Khodaygani, Mohammad Aherrahrou, Zouhair Busch, Hauke Wohlers, Inken |
author_sort | Munz, Matthias |
collection | PubMed |
description | Mice are the most widely used animal model to study genotype to phenotype relationships. Inbred mice are genetically identical, which eliminates genetic heterogeneity and makes them particularly useful for genetic studies. Many different strains have been bred over decades and a vast amount of phenotypic data has been generated. In addition, recently whole genome sequencing-based genome-wide genotype data for many widely used inbred strains has been released. Here, we present an approach for in silico fine-mapping that uses genotypic data of 37 inbred mouse strains together with phenotypic data provided by the user to propose candidate variants and genes for the phenotype under study. Public genome-wide genotype data covering more than 74 million variant sites is queried efficiently in real-time to provide those variants that are compatible with the observed phenotype differences between strains. Variants can be filtered by molecular consequences and by corresponding molecular impact. Candidate gene lists can be generated from variant lists on the fly. Fine-mapping together with annotation or filtering of results is provided in a Bioconductor package called MouseFM. In order to characterize candidate variant lists under various settings, MouseFM was applied to two expression data sets across 20 inbred mouse strains, one from neutrophils and one from CD4(+) T cells. Fine-mapping was assessed for about 10,000 genes, respectively, and identified candidate variants and haplotypes for many expression quantitative trait loci (eQTLs) reported previously based on these data. For albinism, MouseFM reports only one variant allele of moderate or high molecular impact that only albino mice share: a missense variant in the Tyr gene, reported previously to be causal for this phenotype. Performing in silico fine-mapping for interfrontal bone formation in mice using four strains with and five strains without interfrontal bone results in 12 genes. Of these, three are related to skull shaping abnormality. Finally performing fine-mapping for dystrophic cardiac calcification by comparing 9 strains showing the phenotype with eight strains lacking it, we identify only one moderate impact variant in the known causal gene Abcc6. In summary, this illustrates the benefit of using MouseFM for candidate variant and gene identification. |
format | Online Article Text |
id | pubmed-7956000 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79560002021-03-23 In silico candidate variant and gene identification using inbred mouse strains Munz, Matthias Khodaygani, Mohammad Aherrahrou, Zouhair Busch, Hauke Wohlers, Inken PeerJ Bioinformatics Mice are the most widely used animal model to study genotype to phenotype relationships. Inbred mice are genetically identical, which eliminates genetic heterogeneity and makes them particularly useful for genetic studies. Many different strains have been bred over decades and a vast amount of phenotypic data has been generated. In addition, recently whole genome sequencing-based genome-wide genotype data for many widely used inbred strains has been released. Here, we present an approach for in silico fine-mapping that uses genotypic data of 37 inbred mouse strains together with phenotypic data provided by the user to propose candidate variants and genes for the phenotype under study. Public genome-wide genotype data covering more than 74 million variant sites is queried efficiently in real-time to provide those variants that are compatible with the observed phenotype differences between strains. Variants can be filtered by molecular consequences and by corresponding molecular impact. Candidate gene lists can be generated from variant lists on the fly. Fine-mapping together with annotation or filtering of results is provided in a Bioconductor package called MouseFM. In order to characterize candidate variant lists under various settings, MouseFM was applied to two expression data sets across 20 inbred mouse strains, one from neutrophils and one from CD4(+) T cells. Fine-mapping was assessed for about 10,000 genes, respectively, and identified candidate variants and haplotypes for many expression quantitative trait loci (eQTLs) reported previously based on these data. For albinism, MouseFM reports only one variant allele of moderate or high molecular impact that only albino mice share: a missense variant in the Tyr gene, reported previously to be causal for this phenotype. Performing in silico fine-mapping for interfrontal bone formation in mice using four strains with and five strains without interfrontal bone results in 12 genes. Of these, three are related to skull shaping abnormality. Finally performing fine-mapping for dystrophic cardiac calcification by comparing 9 strains showing the phenotype with eight strains lacking it, we identify only one moderate impact variant in the known causal gene Abcc6. In summary, this illustrates the benefit of using MouseFM for candidate variant and gene identification. PeerJ Inc. 2021-03-11 /pmc/articles/PMC7956000/ /pubmed/33763305 http://dx.doi.org/10.7717/peerj.11017 Text en ©2021 Munz et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Munz, Matthias Khodaygani, Mohammad Aherrahrou, Zouhair Busch, Hauke Wohlers, Inken In silico candidate variant and gene identification using inbred mouse strains |
title | In silico candidate variant and gene identification using inbred mouse strains |
title_full | In silico candidate variant and gene identification using inbred mouse strains |
title_fullStr | In silico candidate variant and gene identification using inbred mouse strains |
title_full_unstemmed | In silico candidate variant and gene identification using inbred mouse strains |
title_short | In silico candidate variant and gene identification using inbred mouse strains |
title_sort | in silico candidate variant and gene identification using inbred mouse strains |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7956000/ https://www.ncbi.nlm.nih.gov/pubmed/33763305 http://dx.doi.org/10.7717/peerj.11017 |
work_keys_str_mv | AT munzmatthias insilicocandidatevariantandgeneidentificationusinginbredmousestrains AT khodayganimohammad insilicocandidatevariantandgeneidentificationusinginbredmousestrains AT aherrahrouzouhair insilicocandidatevariantandgeneidentificationusinginbredmousestrains AT buschhauke insilicocandidatevariantandgeneidentificationusinginbredmousestrains AT wohlersinken insilicocandidatevariantandgeneidentificationusinginbredmousestrains |