Cargando…

Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides

BACKGROUND: The laboratory mouse is the most commonly used model for studying variation in complex traits relevant to human disease. Here we present the whole-genome sequences of two inbred strains, LG/J and SM/J, which are frequently used to study variation in complex traits as diverse as aging, bo...

Descripción completa

Detalles Bibliográficos
Autores principales: Nikolskiy, Igor, Conrad, Donald F, Chun, Sung, Fay, Justin C, Cheverud, James M, Lawson, Heather A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4445795/
https://www.ncbi.nlm.nih.gov/pubmed/26016481
http://dx.doi.org/10.1186/s12864-015-1592-3
_version_ 1782373328578674688
author Nikolskiy, Igor
Conrad, Donald F
Chun, Sung
Fay, Justin C
Cheverud, James M
Lawson, Heather A
author_facet Nikolskiy, Igor
Conrad, Donald F
Chun, Sung
Fay, Justin C
Cheverud, James M
Lawson, Heather A
author_sort Nikolskiy, Igor
collection PubMed
description BACKGROUND: The laboratory mouse is the most commonly used model for studying variation in complex traits relevant to human disease. Here we present the whole-genome sequences of two inbred strains, LG/J and SM/J, which are frequently used to study variation in complex traits as diverse as aging, bone-growth, adiposity, maternal behavior, and methamphetamine sensitivity. RESULTS: We identified small nucleotide variants (SNVs) and structural variants (SVs) in the LG/J and SM/J strains relative to the reference genome and discovered novel variants in these two strains by comparing their sequences to other mouse genomes. We find that 39% of the LG/J and SM/J genomes are identical-by-descent (IBD). We characterized amino-acid changing mutations using three algorithms: LRT, PolyPhen-2 and SIFT. We also identified polymorphisms between LG/J and SM/J that fall in regulatory regions and highly informative transcription factor binding sites (TFBS). We intersected these functional predictions with quantitative trait loci (QTL) mapped in advanced intercrosses of these two strains. We find that QTL are both over-represented in non-IBD regions and highly enriched for variants predicted to have a functional impact. Variants in QTL associated with metabolic (231 QTL identified in an F(16) generation) and developmental (41 QTL identified in an F(34) generation) traits were interrogated and we highlight candidate quantitative trait genes (QTG) and nucleotides (QTN) in a QTL on chr13 associated with variation in basal glucose levels and in a QTL on chr6 associated with variation in tibia length. CONCLUSIONS: We show how integrating genomic sequence with QTL reduces the QTL search space and helps researchers prioritize candidate genes and nucleotides for experimental follow-up. Additionally, given the LG/J and SM/J phylogenetic context among inbred strains, these data contribute important information to the genomic landscape of the laboratory mouse. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1592-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4445795
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44457952015-05-28 Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides Nikolskiy, Igor Conrad, Donald F Chun, Sung Fay, Justin C Cheverud, James M Lawson, Heather A BMC Genomics Research Article BACKGROUND: The laboratory mouse is the most commonly used model for studying variation in complex traits relevant to human disease. Here we present the whole-genome sequences of two inbred strains, LG/J and SM/J, which are frequently used to study variation in complex traits as diverse as aging, bone-growth, adiposity, maternal behavior, and methamphetamine sensitivity. RESULTS: We identified small nucleotide variants (SNVs) and structural variants (SVs) in the LG/J and SM/J strains relative to the reference genome and discovered novel variants in these two strains by comparing their sequences to other mouse genomes. We find that 39% of the LG/J and SM/J genomes are identical-by-descent (IBD). We characterized amino-acid changing mutations using three algorithms: LRT, PolyPhen-2 and SIFT. We also identified polymorphisms between LG/J and SM/J that fall in regulatory regions and highly informative transcription factor binding sites (TFBS). We intersected these functional predictions with quantitative trait loci (QTL) mapped in advanced intercrosses of these two strains. We find that QTL are both over-represented in non-IBD regions and highly enriched for variants predicted to have a functional impact. Variants in QTL associated with metabolic (231 QTL identified in an F(16) generation) and developmental (41 QTL identified in an F(34) generation) traits were interrogated and we highlight candidate quantitative trait genes (QTG) and nucleotides (QTN) in a QTL on chr13 associated with variation in basal glucose levels and in a QTL on chr6 associated with variation in tibia length. CONCLUSIONS: We show how integrating genomic sequence with QTL reduces the QTL search space and helps researchers prioritize candidate genes and nucleotides for experimental follow-up. Additionally, given the LG/J and SM/J phylogenetic context among inbred strains, these data contribute important information to the genomic landscape of the laboratory mouse. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1592-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-28 /pmc/articles/PMC4445795/ /pubmed/26016481 http://dx.doi.org/10.1186/s12864-015-1592-3 Text en © Nikolskiy et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Nikolskiy, Igor
Conrad, Donald F
Chun, Sung
Fay, Justin C
Cheverud, James M
Lawson, Heather A
Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides
title Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides
title_full Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides
title_fullStr Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides
title_full_unstemmed Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides
title_short Using whole-genome sequences of the LG/J and SM/J inbred mouse strains to prioritize quantitative trait genes and nucleotides
title_sort using whole-genome sequences of the lg/j and sm/j inbred mouse strains to prioritize quantitative trait genes and nucleotides
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4445795/
https://www.ncbi.nlm.nih.gov/pubmed/26016481
http://dx.doi.org/10.1186/s12864-015-1592-3
work_keys_str_mv AT nikolskiyigor usingwholegenomesequencesofthelgjandsmjinbredmousestrainstoprioritizequantitativetraitgenesandnucleotides
AT conraddonaldf usingwholegenomesequencesofthelgjandsmjinbredmousestrainstoprioritizequantitativetraitgenesandnucleotides
AT chunsung usingwholegenomesequencesofthelgjandsmjinbredmousestrainstoprioritizequantitativetraitgenesandnucleotides
AT fayjustinc usingwholegenomesequencesofthelgjandsmjinbredmousestrainstoprioritizequantitativetraitgenesandnucleotides
AT cheverudjamesm usingwholegenomesequencesofthelgjandsmjinbredmousestrainstoprioritizequantitativetraitgenesandnucleotides
AT lawsonheathera usingwholegenomesequencesofthelgjandsmjinbredmousestrainstoprioritizequantitativetraitgenesandnucleotides