Cargando…
Genomes of the Mouse Collaborative Cross
The Collaborative Cross (CC) is a multiparent panel of recombinant inbred (RI) mouse strains derived from eight founder laboratory strains. RI panels are popular because of their long-term genetic stability, which enhances reproducibility and integration of data collected across time and conditions....
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499171/ https://www.ncbi.nlm.nih.gov/pubmed/28592495 http://dx.doi.org/10.1534/genetics.116.198838 |
_version_ | 1783248427612635136 |
---|---|
author | Srivastava, Anuj Morgan, Andrew P. Najarian, Maya L. Sarsani, Vishal Kumar Sigmon, J. Sebastian Shorter, John R. Kashfeen, Anwica McMullan, Rachel C. Williams, Lucy H. Giusti-Rodríguez, Paola Ferris, Martin T. Sullivan, Patrick Hock, Pablo Miller, Darla R. Bell, Timothy A. McMillan, Leonard Churchill, Gary A. de Villena, Fernando Pardo-Manuel |
author_facet | Srivastava, Anuj Morgan, Andrew P. Najarian, Maya L. Sarsani, Vishal Kumar Sigmon, J. Sebastian Shorter, John R. Kashfeen, Anwica McMullan, Rachel C. Williams, Lucy H. Giusti-Rodríguez, Paola Ferris, Martin T. Sullivan, Patrick Hock, Pablo Miller, Darla R. Bell, Timothy A. McMillan, Leonard Churchill, Gary A. de Villena, Fernando Pardo-Manuel |
author_sort | Srivastava, Anuj |
collection | PubMed |
description | The Collaborative Cross (CC) is a multiparent panel of recombinant inbred (RI) mouse strains derived from eight founder laboratory strains. RI panels are popular because of their long-term genetic stability, which enhances reproducibility and integration of data collected across time and conditions. Characterization of their genomes can be a community effort, reducing the burden on individual users. Here we present the genomes of the CC strains using two complementary approaches as a resource to improve power and interpretation of genetic experiments. Our study also provides a cautionary tale regarding the limitations imposed by such basic biological processes as mutation and selection. A distinct advantage of inbred panels is that genotyping only needs to be performed on the panel, not on each individual mouse. The initial CC genome data were haplotype reconstructions based on dense genotyping of the most recent common ancestors (MRCAs) of each strain followed by imputation from the genome sequence of the corresponding founder inbred strain. The MRCA resource captured segregating regions in strains that were not fully inbred, but it had limited resolution in the transition regions between founder haplotypes, and there was uncertainty about founder assignment in regions of limited diversity. Here we report the whole genome sequence of 69 CC strains generated by paired-end short reads at 30× coverage of a single male per strain. Sequencing leads to a substantial improvement in the fine structure and completeness of the genomes of the CC. Both MRCAs and sequenced samples show a significant reduction in the genome-wide haplotype frequencies from two wild-derived strains, CAST/EiJ and PWK/PhJ. In addition, analysis of the evolution of the patterns of heterozygosity indicates that selection against three wild-derived founder strains played a significant role in shaping the genomes of the CC. The sequencing resource provides the first description of tens of thousands of new genetic variants introduced by mutation and drift in the CC genomes. We estimate that new SNP mutations are accumulating in each CC strain at a rate of 2.4 ± 0.4 per gigabase per generation. The fixation of new mutations by genetic drift has introduced thousands of new variants into the CC strains. The majority of these mutations are novel compared to currently sequenced laboratory stocks and wild mice, and some are predicted to alter gene function. Approximately one-third of the CC inbred strains have acquired large deletions (>10 kb) many of which overlap known coding genes and functional elements. The sequence of these mice is a critical resource to CC users, increases threefold the number of mouse inbred strain genomes available publicly, and provides insight into the effect of mutation and drift on common resources. |
format | Online Article Text |
id | pubmed-5499171 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-54991712017-07-10 Genomes of the Mouse Collaborative Cross Srivastava, Anuj Morgan, Andrew P. Najarian, Maya L. Sarsani, Vishal Kumar Sigmon, J. Sebastian Shorter, John R. Kashfeen, Anwica McMullan, Rachel C. Williams, Lucy H. Giusti-Rodríguez, Paola Ferris, Martin T. Sullivan, Patrick Hock, Pablo Miller, Darla R. Bell, Timothy A. McMillan, Leonard Churchill, Gary A. de Villena, Fernando Pardo-Manuel Genetics Multiparental Populations The Collaborative Cross (CC) is a multiparent panel of recombinant inbred (RI) mouse strains derived from eight founder laboratory strains. RI panels are popular because of their long-term genetic stability, which enhances reproducibility and integration of data collected across time and conditions. Characterization of their genomes can be a community effort, reducing the burden on individual users. Here we present the genomes of the CC strains using two complementary approaches as a resource to improve power and interpretation of genetic experiments. Our study also provides a cautionary tale regarding the limitations imposed by such basic biological processes as mutation and selection. A distinct advantage of inbred panels is that genotyping only needs to be performed on the panel, not on each individual mouse. The initial CC genome data were haplotype reconstructions based on dense genotyping of the most recent common ancestors (MRCAs) of each strain followed by imputation from the genome sequence of the corresponding founder inbred strain. The MRCA resource captured segregating regions in strains that were not fully inbred, but it had limited resolution in the transition regions between founder haplotypes, and there was uncertainty about founder assignment in regions of limited diversity. Here we report the whole genome sequence of 69 CC strains generated by paired-end short reads at 30× coverage of a single male per strain. Sequencing leads to a substantial improvement in the fine structure and completeness of the genomes of the CC. Both MRCAs and sequenced samples show a significant reduction in the genome-wide haplotype frequencies from two wild-derived strains, CAST/EiJ and PWK/PhJ. In addition, analysis of the evolution of the patterns of heterozygosity indicates that selection against three wild-derived founder strains played a significant role in shaping the genomes of the CC. The sequencing resource provides the first description of tens of thousands of new genetic variants introduced by mutation and drift in the CC genomes. We estimate that new SNP mutations are accumulating in each CC strain at a rate of 2.4 ± 0.4 per gigabase per generation. The fixation of new mutations by genetic drift has introduced thousands of new variants into the CC strains. The majority of these mutations are novel compared to currently sequenced laboratory stocks and wild mice, and some are predicted to alter gene function. Approximately one-third of the CC inbred strains have acquired large deletions (>10 kb) many of which overlap known coding genes and functional elements. The sequence of these mice is a critical resource to CC users, increases threefold the number of mouse inbred strain genomes available publicly, and provides insight into the effect of mutation and drift on common resources. Genetics Society of America 2017-06 2017-06-06 /pmc/articles/PMC5499171/ /pubmed/28592495 http://dx.doi.org/10.1534/genetics.116.198838 Text en Copyright © 2017 Srivastava et al. Available freely online through the author-supported open access option. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Multiparental Populations Srivastava, Anuj Morgan, Andrew P. Najarian, Maya L. Sarsani, Vishal Kumar Sigmon, J. Sebastian Shorter, John R. Kashfeen, Anwica McMullan, Rachel C. Williams, Lucy H. Giusti-Rodríguez, Paola Ferris, Martin T. Sullivan, Patrick Hock, Pablo Miller, Darla R. Bell, Timothy A. McMillan, Leonard Churchill, Gary A. de Villena, Fernando Pardo-Manuel Genomes of the Mouse Collaborative Cross |
title | Genomes of the Mouse Collaborative Cross |
title_full | Genomes of the Mouse Collaborative Cross |
title_fullStr | Genomes of the Mouse Collaborative Cross |
title_full_unstemmed | Genomes of the Mouse Collaborative Cross |
title_short | Genomes of the Mouse Collaborative Cross |
title_sort | genomes of the mouse collaborative cross |
topic | Multiparental Populations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5499171/ https://www.ncbi.nlm.nih.gov/pubmed/28592495 http://dx.doi.org/10.1534/genetics.116.198838 |
work_keys_str_mv | AT srivastavaanuj genomesofthemousecollaborativecross AT morganandrewp genomesofthemousecollaborativecross AT najarianmayal genomesofthemousecollaborativecross AT sarsanivishalkumar genomesofthemousecollaborativecross AT sigmonjsebastian genomesofthemousecollaborativecross AT shorterjohnr genomesofthemousecollaborativecross AT kashfeenanwica genomesofthemousecollaborativecross AT mcmullanrachelc genomesofthemousecollaborativecross AT williamslucyh genomesofthemousecollaborativecross AT giustirodriguezpaola genomesofthemousecollaborativecross AT ferrismartint genomesofthemousecollaborativecross AT sullivanpatrick genomesofthemousecollaborativecross AT hockpablo genomesofthemousecollaborativecross AT millerdarlar genomesofthemousecollaborativecross AT belltimothya genomesofthemousecollaborativecross AT mcmillanleonard genomesofthemousecollaborativecross AT churchillgarya genomesofthemousecollaborativecross AT devillenafernandopardomanuel genomesofthemousecollaborativecross |