Cargando…

Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models

Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelme...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrews, T. D., Whittle, B., Field, M. A., Balakishnan, B., Zhang, Y., Shao, Y., Cho, V., Kirk, M., Singh, M., Xia, Y., Hager, J., Winslade, S., Sjollema, G., Beutler, B., Enders, A., Goodnow, C. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376740/
https://www.ncbi.nlm.nih.gov/pubmed/22724066
http://dx.doi.org/10.1098/rsob.120061
_version_ 1782235863245127680
author Andrews, T. D.
Whittle, B.
Field, M. A.
Balakishnan, B.
Zhang, Y.
Shao, Y.
Cho, V.
Kirk, M.
Singh, M.
Xia, Y.
Hager, J.
Winslade, S.
Sjollema, G.
Beutler, B.
Enders, A.
Goodnow, C. C.
author_facet Andrews, T. D.
Whittle, B.
Field, M. A.
Balakishnan, B.
Zhang, Y.
Shao, Y.
Cho, V.
Kirk, M.
Singh, M.
Xia, Y.
Hager, J.
Winslade, S.
Sjollema, G.
Beutler, B.
Enders, A.
Goodnow, C. C.
author_sort Andrews, T. D.
collection PubMed
description Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelmed by false-positive calls that arise from a range of technical and biological sources. We describe a strategy using whole-exome capture, massively parallel DNA sequencing and computational analysis, which identifies with a low false-positive rate the majority of heterozygous and homozygous SNVs arising de novo with a frequency of one nucleotide substitution per megabase in progeny of N-ethyl-N-nitrosourea (ENU)-mutated C57BL/6j mice. We found that by applying a strategy of filtering raw SNV calls against known and platform-specific variants we could call true SNVs with a false-positive rate of 19.4 per cent and an estimated false-negative rate of 21.3 per cent. These error rates are small enough to enable calling a causative mutation from both homozygous and heterozygous candidate mutation lists with little or no further experimental validation. The efficacy of this approach is demonstrated by identifying the causative mutation in the Ptprc gene in a lymphocyte-deficient strain and in 11 other strains with immune disorders or obesity, without the need for meiotic mapping. Exome sequencing of first-generation mutant mice revealed hundreds of unphenotyped protein-changing mutations, 52 per cent of which are predicted to be deleterious, which now become available for breeding and experimental analysis. We show that exome sequencing data alone are sufficient to identify induced mutations. This approach transforms genetic screens in mice, establishes a general strategy for analysing rare DNA variants and opens up a large new source for experimental models of human disease.
format Online
Article
Text
id pubmed-3376740
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-33767402012-06-21 Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models Andrews, T. D. Whittle, B. Field, M. A. Balakishnan, B. Zhang, Y. Shao, Y. Cho, V. Kirk, M. Singh, M. Xia, Y. Hager, J. Winslade, S. Sjollema, G. Beutler, B. Enders, A. Goodnow, C. C. Open Biol Research Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelmed by false-positive calls that arise from a range of technical and biological sources. We describe a strategy using whole-exome capture, massively parallel DNA sequencing and computational analysis, which identifies with a low false-positive rate the majority of heterozygous and homozygous SNVs arising de novo with a frequency of one nucleotide substitution per megabase in progeny of N-ethyl-N-nitrosourea (ENU)-mutated C57BL/6j mice. We found that by applying a strategy of filtering raw SNV calls against known and platform-specific variants we could call true SNVs with a false-positive rate of 19.4 per cent and an estimated false-negative rate of 21.3 per cent. These error rates are small enough to enable calling a causative mutation from both homozygous and heterozygous candidate mutation lists with little or no further experimental validation. The efficacy of this approach is demonstrated by identifying the causative mutation in the Ptprc gene in a lymphocyte-deficient strain and in 11 other strains with immune disorders or obesity, without the need for meiotic mapping. Exome sequencing of first-generation mutant mice revealed hundreds of unphenotyped protein-changing mutations, 52 per cent of which are predicted to be deleterious, which now become available for breeding and experimental analysis. We show that exome sequencing data alone are sufficient to identify induced mutations. This approach transforms genetic screens in mice, establishes a general strategy for analysing rare DNA variants and opens up a large new source for experimental models of human disease. The Royal Society 2012-05 /pmc/articles/PMC3376740/ /pubmed/22724066 http://dx.doi.org/10.1098/rsob.120061 Text en http://creativecommons.org/licenses/by/3.0/ © 2012 The Authors. Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/3.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Research
Andrews, T. D.
Whittle, B.
Field, M. A.
Balakishnan, B.
Zhang, Y.
Shao, Y.
Cho, V.
Kirk, M.
Singh, M.
Xia, Y.
Hager, J.
Winslade, S.
Sjollema, G.
Beutler, B.
Enders, A.
Goodnow, C. C.
Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
title Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
title_full Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
title_fullStr Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
title_full_unstemmed Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
title_short Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
title_sort massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376740/
https://www.ncbi.nlm.nih.gov/pubmed/22724066
http://dx.doi.org/10.1098/rsob.120061
work_keys_str_mv AT andrewstd massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT whittleb massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT fieldma massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT balakishnanb massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT zhangy massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT shaoy massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT chov massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT kirkm massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT singhm massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT xiay massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT hagerj massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT winslades massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT sjollemag massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT beutlerb massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT endersa massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels
AT goodnowcc massivelyparallelsequencingofthemouseexometoaccuratelyidentifyrareinducedmutationsanimmediatesourceforthousandsofnewmousemodels