Cargando…

Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management

BACKGROUND: Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like...

Descripción completa

Detalles Bibliográficos
Autores principales: Zucca, Susanna, Villaraggia, Margherita, Gagliardi, Stella, Grieco, Gaetano Salvatore, Valente, Marialuisa, Cereda, Cristina, Magni, Paolo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123238/
https://www.ncbi.nlm.nih.gov/pubmed/28185542
http://dx.doi.org/10.1186/s12859-016-1189-0
_version_ 1782469692320907264
author Zucca, Susanna
Villaraggia, Margherita
Gagliardi, Stella
Grieco, Gaetano Salvatore
Valente, Marialuisa
Cereda, Cristina
Magni, Paolo
author_facet Zucca, Susanna
Villaraggia, Margherita
Gagliardi, Stella
Grieco, Gaetano Salvatore
Valente, Marialuisa
Cereda, Cristina
Magni, Paolo
author_sort Zucca, Susanna
collection PubMed
description BACKGROUND: Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like amplification can be encountered. Between these artifacts, allele drop-out, which is the preferential amplification of one allele, causes an artificial increase in homozygosity when heterozygous mutations fall on a primer pairing region. Here, a procedure to manage such artifacts, based on a pipeline composed of two steps of alignment and variant calling, is proposed. This methodology has been compared to the Illumina Custom Amplicon workflow, available on Illumina MiSeq, on the analysis of data obtained with four newly designed TruSeq Custom Amplicon gene panels. RESULTS: Four gene panels, specific for Parkinson disease, for Intracerebral Hemorrhage Diseases (COL4A1 and COL4A2 genes) and for Familial Hemiplegic Migraine (CACNA1A and ATP1A2 genes) were designed. A total of 119 samples were re-sequenced with Illumina MiSeq sequencer and panel characterization in terms of coverage, number of variants found and allele drop-out potential impact has been carried out. Results show that 14 % of identified variants is potentially affected by allele drop-out artifacts and that both the Custom Amplicon workflow and the procedure proposed here could correctly identify them. Furthermore, a more complex configuration in presence of two mutations was simulated in silico. In this configuration, our proposed methodology outperforms Custom Amplicon workflow, being able to correctly identify two mutations in all the studied configurations. CONCLUSIONS: Allele drop-out plays a crucial role in amplicon-based targeted re-sequencing and specific procedures in data analysis of amplicon data should be adopted. Although a consensus has been established in the elimination of primer sequences from aligned data (e.g., via primer sequence trimming or soft clipping), more complex configurations need to be managed in order to increase the retrieved information from available data. Our method shows how to manage one of these complex configurations, when two mutations occur. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1189-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5123238
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51232382016-12-06 Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management Zucca, Susanna Villaraggia, Margherita Gagliardi, Stella Grieco, Gaetano Salvatore Valente, Marialuisa Cereda, Cristina Magni, Paolo BMC Bioinformatics Research BACKGROUND: Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like amplification can be encountered. Between these artifacts, allele drop-out, which is the preferential amplification of one allele, causes an artificial increase in homozygosity when heterozygous mutations fall on a primer pairing region. Here, a procedure to manage such artifacts, based on a pipeline composed of two steps of alignment and variant calling, is proposed. This methodology has been compared to the Illumina Custom Amplicon workflow, available on Illumina MiSeq, on the analysis of data obtained with four newly designed TruSeq Custom Amplicon gene panels. RESULTS: Four gene panels, specific for Parkinson disease, for Intracerebral Hemorrhage Diseases (COL4A1 and COL4A2 genes) and for Familial Hemiplegic Migraine (CACNA1A and ATP1A2 genes) were designed. A total of 119 samples were re-sequenced with Illumina MiSeq sequencer and panel characterization in terms of coverage, number of variants found and allele drop-out potential impact has been carried out. Results show that 14 % of identified variants is potentially affected by allele drop-out artifacts and that both the Custom Amplicon workflow and the procedure proposed here could correctly identify them. Furthermore, a more complex configuration in presence of two mutations was simulated in silico. In this configuration, our proposed methodology outperforms Custom Amplicon workflow, being able to correctly identify two mutations in all the studied configurations. CONCLUSIONS: Allele drop-out plays a crucial role in amplicon-based targeted re-sequencing and specific procedures in data analysis of amplicon data should be adopted. Although a consensus has been established in the elimination of primer sequences from aligned data (e.g., via primer sequence trimming or soft clipping), more complex configurations need to be managed in order to increase the retrieved information from available data. Our method shows how to manage one of these complex configurations, when two mutations occur. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1189-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-08 /pmc/articles/PMC5123238/ /pubmed/28185542 http://dx.doi.org/10.1186/s12859-016-1189-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zucca, Susanna
Villaraggia, Margherita
Gagliardi, Stella
Grieco, Gaetano Salvatore
Valente, Marialuisa
Cereda, Cristina
Magni, Paolo
Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
title Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
title_full Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
title_fullStr Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
title_full_unstemmed Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
title_short Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
title_sort analysis of amplicon-based ngs data from neurological disease gene panels: a new method for allele drop-out management
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123238/
https://www.ncbi.nlm.nih.gov/pubmed/28185542
http://dx.doi.org/10.1186/s12859-016-1189-0
work_keys_str_mv AT zuccasusanna analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement
AT villaraggiamargherita analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement
AT gagliardistella analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement
AT griecogaetanosalvatore analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement
AT valentemarialuisa analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement
AT ceredacristina analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement
AT magnipaolo analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement