Cargando…

Systematic prediction of EMS‐induced mutations in a sorghum mutant population

The precise detection of causal DNA mutations (deoxyribonucleic acid) is very crucial for forward genetic studies. Several sources of errors contribute to false‐positive detections by current variant‐calling algorithms, which impact associating phenotypes with genotypes. To improve the accuracy of m...

Descripción completa

Detalles Bibliográficos
Autores principales: Simons, Jared M., Herbert, Tim C., Kauffman, Coleby, Batete, Marc Y., Simpson, Andrew T., Katsuki, Yuka, Le, Dong, Amundson, Danielle, Buescher, Elizabeth M., Weil, Clifford, Tuinstra, Mitch, Addo‐Quaye, Charles
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9132608/
https://www.ncbi.nlm.nih.gov/pubmed/35647479
http://dx.doi.org/10.1002/pld3.404
_version_ 1784713416002240512
author Simons, Jared M.
Herbert, Tim C.
Kauffman, Coleby
Batete, Marc Y.
Simpson, Andrew T.
Katsuki, Yuka
Le, Dong
Amundson, Danielle
Buescher, Elizabeth M.
Weil, Clifford
Tuinstra, Mitch
Addo‐Quaye, Charles
author_facet Simons, Jared M.
Herbert, Tim C.
Kauffman, Coleby
Batete, Marc Y.
Simpson, Andrew T.
Katsuki, Yuka
Le, Dong
Amundson, Danielle
Buescher, Elizabeth M.
Weil, Clifford
Tuinstra, Mitch
Addo‐Quaye, Charles
author_sort Simons, Jared M.
collection PubMed
description The precise detection of causal DNA mutations (deoxyribonucleic acid) is very crucial for forward genetic studies. Several sources of errors contribute to false‐positive detections by current variant‐calling algorithms, which impact associating phenotypes with genotypes. To improve the accuracy of mutation detection, we implemented a binning method for the accurate detection of likely ethyl methanesulfonate (EMS)‐induced mutations in a sequenced mutant population. We also implemented a clustering algorithm for detecting likely false negatives with high accuracy. Sorghum bicolor is a very valuable crop species with tremendous potential for uncovering novel gene functions associated with highly desirable agronomical traits. We demonstrate the precision of the described approach in the detection of likely EMS‐induced mutations from the publicly available low‐cost sequencing of the M(3) generation from 600 sorghum BTx623 mutants. The approach detected 3,274,606 single nucleotide polymorphisms (SNPs), of which 96% (3,141,908) were G/C to A/T DNA substitutions, as expected by EMS‐mutagenesis mode of action. We demonstrated the general applicability of the described method and showed a high concordance, 94% (3,074,759) SNPs overlap between SAMtools‐based and GATK‐based variant‐calling algorithms. Our clustering algorithm uncovered evidence for an additional 223,048 likely false‐negative shared EMS‐induced mutations. The final 3,497,654 SNPs represent an 87% increase in SNPs detected from the previous analysis of the mutant population, with an average of one SNP per 125 kb in the sorghum genome. Annotation of the final SNPs revealed 10,263 high‐impact and 136,639 moderate‐impact SNPs, including 7217 stop‐gained mutations, which averages 12 stop‐gained mutations per mutant, and four high‐ or medium‐impact SNPs per sorghum gene. We have implemented a public search database for this new genetic resource of 30,285 distinct sorghum genes containing medium‐ or high‐impact EMS‐induced mutations. Seedstock for a select 486 of the 600 described mutants are publicly available in the Germplasm Resources Information Network (GRIN) database.
format Online
Article
Text
id pubmed-9132608
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-91326082022-05-26 Systematic prediction of EMS‐induced mutations in a sorghum mutant population Simons, Jared M. Herbert, Tim C. Kauffman, Coleby Batete, Marc Y. Simpson, Andrew T. Katsuki, Yuka Le, Dong Amundson, Danielle Buescher, Elizabeth M. Weil, Clifford Tuinstra, Mitch Addo‐Quaye, Charles Plant Direct Original Research The precise detection of causal DNA mutations (deoxyribonucleic acid) is very crucial for forward genetic studies. Several sources of errors contribute to false‐positive detections by current variant‐calling algorithms, which impact associating phenotypes with genotypes. To improve the accuracy of mutation detection, we implemented a binning method for the accurate detection of likely ethyl methanesulfonate (EMS)‐induced mutations in a sequenced mutant population. We also implemented a clustering algorithm for detecting likely false negatives with high accuracy. Sorghum bicolor is a very valuable crop species with tremendous potential for uncovering novel gene functions associated with highly desirable agronomical traits. We demonstrate the precision of the described approach in the detection of likely EMS‐induced mutations from the publicly available low‐cost sequencing of the M(3) generation from 600 sorghum BTx623 mutants. The approach detected 3,274,606 single nucleotide polymorphisms (SNPs), of which 96% (3,141,908) were G/C to A/T DNA substitutions, as expected by EMS‐mutagenesis mode of action. We demonstrated the general applicability of the described method and showed a high concordance, 94% (3,074,759) SNPs overlap between SAMtools‐based and GATK‐based variant‐calling algorithms. Our clustering algorithm uncovered evidence for an additional 223,048 likely false‐negative shared EMS‐induced mutations. The final 3,497,654 SNPs represent an 87% increase in SNPs detected from the previous analysis of the mutant population, with an average of one SNP per 125 kb in the sorghum genome. Annotation of the final SNPs revealed 10,263 high‐impact and 136,639 moderate‐impact SNPs, including 7217 stop‐gained mutations, which averages 12 stop‐gained mutations per mutant, and four high‐ or medium‐impact SNPs per sorghum gene. We have implemented a public search database for this new genetic resource of 30,285 distinct sorghum genes containing medium‐ or high‐impact EMS‐induced mutations. Seedstock for a select 486 of the 600 described mutants are publicly available in the Germplasm Resources Information Network (GRIN) database. John Wiley and Sons Inc. 2022-05-25 /pmc/articles/PMC9132608/ /pubmed/35647479 http://dx.doi.org/10.1002/pld3.404 Text en © 2022 The Authors. Plant Direct published by American Society of Plant Biologists and the Society for Experimental Biology and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Original Research
Simons, Jared M.
Herbert, Tim C.
Kauffman, Coleby
Batete, Marc Y.
Simpson, Andrew T.
Katsuki, Yuka
Le, Dong
Amundson, Danielle
Buescher, Elizabeth M.
Weil, Clifford
Tuinstra, Mitch
Addo‐Quaye, Charles
Systematic prediction of EMS‐induced mutations in a sorghum mutant population
title Systematic prediction of EMS‐induced mutations in a sorghum mutant population
title_full Systematic prediction of EMS‐induced mutations in a sorghum mutant population
title_fullStr Systematic prediction of EMS‐induced mutations in a sorghum mutant population
title_full_unstemmed Systematic prediction of EMS‐induced mutations in a sorghum mutant population
title_short Systematic prediction of EMS‐induced mutations in a sorghum mutant population
title_sort systematic prediction of ems‐induced mutations in a sorghum mutant population
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9132608/
https://www.ncbi.nlm.nih.gov/pubmed/35647479
http://dx.doi.org/10.1002/pld3.404
work_keys_str_mv AT simonsjaredm systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT herberttimc systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT kauffmancoleby systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT batetemarcy systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT simpsonandrewt systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT katsukiyuka systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT ledong systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT amundsondanielle systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT buescherelizabethm systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT weilclifford systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT tuinstramitch systematicpredictionofemsinducedmutationsinasorghummutantpopulation
AT addoquayecharles systematicpredictionofemsinducedmutationsinasorghummutantpopulation