Cargando…
Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelme...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376740/ https://www.ncbi.nlm.nih.gov/pubmed/22724066 http://dx.doi.org/10.1098/rsob.120061 |
_version_ | 1782235863245127680 |
---|---|
author | Andrews, T. D. Whittle, B. Field, M. A. Balakishnan, B. Zhang, Y. Shao, Y. Cho, V. Kirk, M. Singh, M. Xia, Y. Hager, J. Winslade, S. Sjollema, G. Beutler, B. Enders, A. Goodnow, C. C. |
author_facet | Andrews, T. D. Whittle, B. Field, M. A. Balakishnan, B. Zhang, Y. Shao, Y. Cho, V. Kirk, M. Singh, M. Xia, Y. Hager, J. Winslade, S. Sjollema, G. Beutler, B. Enders, A. Goodnow, C. C. |
author_sort | Andrews, T. D. |
collection | PubMed |
description | Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelmed by false-positive calls that arise from a range of technical and biological sources. We describe a strategy using whole-exome capture, massively parallel DNA sequencing and computational analysis, which identifies with a low false-positive rate the majority of heterozygous and homozygous SNVs arising de novo with a frequency of one nucleotide substitution per megabase in progeny of N-ethyl-N-nitrosourea (ENU)-mutated C57BL/6j mice. We found that by applying a strategy of filtering raw SNV calls against known and platform-specific variants we could call true SNVs with a false-positive rate of 19.4 per cent and an estimated false-negative rate of 21.3 per cent. These error rates are small enough to enable calling a causative mutation from both homozygous and heterozygous candidate mutation lists with little or no further experimental validation. The efficacy of this approach is demonstrated by identifying the causative mutation in the Ptprc gene in a lymphocyte-deficient strain and in 11 other strains with immune disorders or obesity, without the need for meiotic mapping. Exome sequencing of first-generation mutant mice revealed hundreds of unphenotyped protein-changing mutations, 52 per cent of which are predicted to be deleterious, which now become available for breeding and experimental analysis. We show that exome sequencing data alone are sufficient to identify induced mutations. This approach transforms genetic screens in mice, establishes a general strategy for analysing rare DNA variants and opens up a large new source for experimental models of human disease. |
format | Online Article Text |
id | pubmed-3376740 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-33767402012-06-21 Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models Andrews, T. D. Whittle, B. Field, M. A. Balakishnan, B. Zhang, Y. Shao, Y. Cho, V. Kirk, M. Singh, M. Xia, Y. Hager, J. Winslade, S. Sjollema, G. Beutler, B. Enders, A. Goodnow, C. C. Open Biol Research Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelmed by false-positive calls that arise from a range of technical and biological sources. We describe a strategy using whole-exome capture, massively parallel DNA sequencing and computational analysis, which identifies with a low false-positive rate the majority of heterozygous and homozygous SNVs arising de novo with a frequency of one nucleotide substitution per megabase in progeny of N-ethyl-N-nitrosourea (ENU)-mutated C57BL/6j mice. We found that by applying a strategy of filtering raw SNV calls against known and platform-specific variants we could call true SNVs with a false-positive rate of 19.4 per cent and an estimated false-negative rate of 21.3 per cent. These error rates are small enough to enable calling a causative mutation from both homozygous and heterozygous candidate mutation lists with little or no further experimental validation. The efficacy of this approach is demonstrated by identifying the causative mutation in the Ptprc gene in a lymphocyte-deficient strain and in 11 other strains with immune disorders or obesity, without the need for meiotic mapping. Exome sequencing of first-generation mutant mice revealed hundreds of unphenotyped protein-changing mutations, 52 per cent of which are predicted to be deleterious, which now become available for breeding and experimental analysis. We show that exome sequencing data alone are sufficient to identify induced mutations. This approach transforms genetic screens in mice, establishes a general strategy for analysing rare DNA variants and opens up a large new source for experimental models of human disease. The Royal Society 2012-05 /pmc/articles/PMC3376740/ /pubmed/22724066 http://dx.doi.org/10.1098/rsob.120061 Text en http://creativecommons.org/licenses/by/3.0/ © 2012 The Authors. Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/3.0/, which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Research Andrews, T. D. Whittle, B. Field, M. A. Balakishnan, B. Zhang, Y. Shao, Y. Cho, V. Kirk, M. Singh, M. Xia, Y. Hager, J. Winslade, S. Sjollema, G. Beutler, B. Enders, A. Goodnow, C. C. Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
title | Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
title_full | Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
title_fullStr | Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
title_full_unstemmed | Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
title_short | Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
title_sort | massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376740/ https://www.ncbi.nlm.nih.gov/pubmed/22724066 http://dx.doi.org/10.1098/rsob.120061 |
work_keys_str_mv | AT andrewstd massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT whittleb massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT fieldma massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT balakishnanb massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT zhangy massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT shaoy massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT chov massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT kirkm massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT singhm massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT xiay massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT hagerj massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT winslades massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT sjollemag massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT beutlerb massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT endersa massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels AT goodnowcc massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels |