Cargando…
Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
BACKGROUND: Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123238/ https://www.ncbi.nlm.nih.gov/pubmed/28185542 http://dx.doi.org/10.1186/s12859-016-1189-0 |
_version_ | 1782469692320907264 |
---|---|
author | Zucca, Susanna Villaraggia, Margherita Gagliardi, Stella Grieco, Gaetano Salvatore Valente, Marialuisa Cereda, Cristina Magni, Paolo |
author_facet | Zucca, Susanna Villaraggia, Margherita Gagliardi, Stella Grieco, Gaetano Salvatore Valente, Marialuisa Cereda, Cristina Magni, Paolo |
author_sort | Zucca, Susanna |
collection | PubMed |
description | BACKGROUND: Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like amplification can be encountered. Between these artifacts, allele drop-out, which is the preferential amplification of one allele, causes an artificial increase in homozygosity when heterozygous mutations fall on a primer pairing region. Here, a procedure to manage such artifacts, based on a pipeline composed of two steps of alignment and variant calling, is proposed. This methodology has been compared to the Illumina Custom Amplicon workflow, available on Illumina MiSeq, on the analysis of data obtained with four newly designed TruSeq Custom Amplicon gene panels. RESULTS: Four gene panels, specific for Parkinson disease, for Intracerebral Hemorrhage Diseases (COL4A1 and COL4A2 genes) and for Familial Hemiplegic Migraine (CACNA1A and ATP1A2 genes) were designed. A total of 119 samples were re-sequenced with Illumina MiSeq sequencer and panel characterization in terms of coverage, number of variants found and allele drop-out potential impact has been carried out. Results show that 14 % of identified variants is potentially affected by allele drop-out artifacts and that both the Custom Amplicon workflow and the procedure proposed here could correctly identify them. Furthermore, a more complex configuration in presence of two mutations was simulated in silico. In this configuration, our proposed methodology outperforms Custom Amplicon workflow, being able to correctly identify two mutations in all the studied configurations. CONCLUSIONS: Allele drop-out plays a crucial role in amplicon-based targeted re-sequencing and specific procedures in data analysis of amplicon data should be adopted. Although a consensus has been established in the elimination of primer sequences from aligned data (e.g., via primer sequence trimming or soft clipping), more complex configurations need to be managed in order to increase the retrieved information from available data. Our method shows how to manage one of these complex configurations, when two mutations occur. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1189-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5123238 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51232382016-12-06 Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management Zucca, Susanna Villaraggia, Margherita Gagliardi, Stella Grieco, Gaetano Salvatore Valente, Marialuisa Cereda, Cristina Magni, Paolo BMC Bioinformatics Research BACKGROUND: Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like amplification can be encountered. Between these artifacts, allele drop-out, which is the preferential amplification of one allele, causes an artificial increase in homozygosity when heterozygous mutations fall on a primer pairing region. Here, a procedure to manage such artifacts, based on a pipeline composed of two steps of alignment and variant calling, is proposed. This methodology has been compared to the Illumina Custom Amplicon workflow, available on Illumina MiSeq, on the analysis of data obtained with four newly designed TruSeq Custom Amplicon gene panels. RESULTS: Four gene panels, specific for Parkinson disease, for Intracerebral Hemorrhage Diseases (COL4A1 and COL4A2 genes) and for Familial Hemiplegic Migraine (CACNA1A and ATP1A2 genes) were designed. A total of 119 samples were re-sequenced with Illumina MiSeq sequencer and panel characterization in terms of coverage, number of variants found and allele drop-out potential impact has been carried out. Results show that 14 % of identified variants is potentially affected by allele drop-out artifacts and that both the Custom Amplicon workflow and the procedure proposed here could correctly identify them. Furthermore, a more complex configuration in presence of two mutations was simulated in silico. In this configuration, our proposed methodology outperforms Custom Amplicon workflow, being able to correctly identify two mutations in all the studied configurations. CONCLUSIONS: Allele drop-out plays a crucial role in amplicon-based targeted re-sequencing and specific procedures in data analysis of amplicon data should be adopted. Although a consensus has been established in the elimination of primer sequences from aligned data (e.g., via primer sequence trimming or soft clipping), more complex configurations need to be managed in order to increase the retrieved information from available data. Our method shows how to manage one of these complex configurations, when two mutations occur. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1189-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-08 /pmc/articles/PMC5123238/ /pubmed/28185542 http://dx.doi.org/10.1186/s12859-016-1189-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zucca, Susanna Villaraggia, Margherita Gagliardi, Stella Grieco, Gaetano Salvatore Valente, Marialuisa Cereda, Cristina Magni, Paolo Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management |
title | Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management |
title_full | Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management |
title_fullStr | Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management |
title_full_unstemmed | Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management |
title_short | Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management |
title_sort | analysis of amplicon-based ngs data from neurological disease gene panels: a new method for allele drop-out management |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123238/ https://www.ncbi.nlm.nih.gov/pubmed/28185542 http://dx.doi.org/10.1186/s12859-016-1189-0 |
work_keys_str_mv | AT zuccasusanna analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement AT villaraggiamargherita analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement AT gagliardistella analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement AT griecogaetanosalvatore analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement AT valentemarialuisa analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement AT ceredacristina analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement AT magnipaolo analysisofampliconbasedngsdatafromneurologicaldiseasegenepanelsanewmethodforalleledropoutmanagement |