Cargando…

Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments

TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful reverse genetics method in plant functional genomics and breeding to identify mutagenized individuals with improved behavior for a trait of interest. Pooled high throughput sequencing (HTS) of the targeted genes allows efficient iden...

Descripción completa

Detalles Bibliográficos
Autores principales: Gil, Juanita, Andrade-Martínez, Juan Sebastian, Duitama, Jorge
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7886796/
https://www.ncbi.nlm.nih.gov/pubmed/33613641
http://dx.doi.org/10.3389/fgene.2021.624513
_version_ 1783651873297793024
author Gil, Juanita
Andrade-Martínez, Juan Sebastian
Duitama, Jorge
author_facet Gil, Juanita
Andrade-Martínez, Juan Sebastian
Duitama, Jorge
author_sort Gil, Juanita
collection PubMed
description TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful reverse genetics method in plant functional genomics and breeding to identify mutagenized individuals with improved behavior for a trait of interest. Pooled high throughput sequencing (HTS) of the targeted genes allows efficient identification and sample assignment of variants within genes of interest in hundreds of individuals. Although TILLING has been used successfully in different crops and even applied to natural populations, one of the main issues for a successful TILLING experiment is that most currently available bioinformatics tools for variant detection are not designed to identify mutations with low frequencies in pooled samples or to perform sample identification from variants identified in overlapping pools. Our research group maintains the Next Generation Sequencing Experience Platform (NGSEP), an open source solution for analysis of HTS data. In this manuscript, we present three novel components within NGSEP to facilitate the design and analysis of TILLING experiments: a pooled variants detector, a sample identifier from variants detected in overlapping pools and a simulator of TILLING experiments. A new implementation of the NGSEP calling model for variant detection allows accurate detection of low frequency mutations within pools. The samples identifier implements the process to triangulate the mutations called within overlapping pools in order to assign mutations to single individuals whenever possible. Finally, we developed a complete simulator of TILLING experiments to enable benchmarking of different tools and to facilitate the design of experimental alternatives varying the number of pools and individuals per pool. Simulation experiments based on genes from the common bean genome indicate that NGSEP provides similar accuracy and better efficiency than other tools to perform pooled variants detection. To the best of our knowledge, NGSEP is currently the only tool that generates individual assignments of the mutations discovered from the pooled data. We expect that this development will be of great use for different groups implementing TILLING as an alternative for plant breeding and even to research groups performing pooled sequencing for other applications.
format Online
Article
Text
id pubmed-7886796
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-78867962021-02-18 Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments Gil, Juanita Andrade-Martínez, Juan Sebastian Duitama, Jorge Front Genet Genetics TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful reverse genetics method in plant functional genomics and breeding to identify mutagenized individuals with improved behavior for a trait of interest. Pooled high throughput sequencing (HTS) of the targeted genes allows efficient identification and sample assignment of variants within genes of interest in hundreds of individuals. Although TILLING has been used successfully in different crops and even applied to natural populations, one of the main issues for a successful TILLING experiment is that most currently available bioinformatics tools for variant detection are not designed to identify mutations with low frequencies in pooled samples or to perform sample identification from variants identified in overlapping pools. Our research group maintains the Next Generation Sequencing Experience Platform (NGSEP), an open source solution for analysis of HTS data. In this manuscript, we present three novel components within NGSEP to facilitate the design and analysis of TILLING experiments: a pooled variants detector, a sample identifier from variants detected in overlapping pools and a simulator of TILLING experiments. A new implementation of the NGSEP calling model for variant detection allows accurate detection of low frequency mutations within pools. The samples identifier implements the process to triangulate the mutations called within overlapping pools in order to assign mutations to single individuals whenever possible. Finally, we developed a complete simulator of TILLING experiments to enable benchmarking of different tools and to facilitate the design of experimental alternatives varying the number of pools and individuals per pool. Simulation experiments based on genes from the common bean genome indicate that NGSEP provides similar accuracy and better efficiency than other tools to perform pooled variants detection. To the best of our knowledge, NGSEP is currently the only tool that generates individual assignments of the mutations discovered from the pooled data. We expect that this development will be of great use for different groups implementing TILLING as an alternative for plant breeding and even to research groups performing pooled sequencing for other applications. Frontiers Media S.A. 2021-02-03 /pmc/articles/PMC7886796/ /pubmed/33613641 http://dx.doi.org/10.3389/fgene.2021.624513 Text en Copyright © 2021 Gil, Andrade-Martínez and Duitama. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Gil, Juanita
Andrade-Martínez, Juan Sebastian
Duitama, Jorge
Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_full Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_fullStr Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_full_unstemmed Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_short Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments
title_sort accurate, efficient and user-friendly mutation calling and sample identification for tilling experiments
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7886796/
https://www.ncbi.nlm.nih.gov/pubmed/33613641
http://dx.doi.org/10.3389/fgene.2021.624513
work_keys_str_mv AT giljuanita accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments
AT andrademartinezjuansebastian accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments
AT duitamajorge accurateefficientanduserfriendlymutationcallingandsampleidentificationfortillingexperiments