Cargando…

An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays

BACKGROUND: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Mei-Chu, Chuang, Tzu-Po, Chen, Chien-Hsiun, Wu, Jer-Yuarn, Chen, Yuan-Tsong, Li, Ling-Hui, Yang, Hsin-Chou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4815280/
https://www.ncbi.nlm.nih.gov/pubmed/27029637
http://dx.doi.org/10.1186/s12864-016-2478-8
_version_ 1782424572963848192
author Huang, Mei-Chu
Chuang, Tzu-Po
Chen, Chien-Hsiun
Wu, Jer-Yuarn
Chen, Yuan-Tsong
Li, Ling-Hui
Yang, Hsin-Chou
author_facet Huang, Mei-Chu
Chuang, Tzu-Po
Chen, Chien-Hsiun
Wu, Jer-Yuarn
Chen, Yuan-Tsong
Li, Ling-Hui
Yang, Hsin-Chou
author_sort Huang, Mei-Chu
collection PubMed
description BACKGROUND: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and genotypes for this new-generation population-optimized genotyping platform. RESULTS: A set of statistical methods was developed for an integrated analysis of allele frequency (AF), allelic imbalance (AI), loss of heterozygosity (LOH), long contiguous stretch of homozygosity (LCSH), and copy number variation or alteration (CNV/CNA) on the basis of SNP probe hybridization intensities and genotypes. This study analyzed 3,236 samples that were genotyped using different SNP platforms. The proposed AF adjustment method considerably increased the accuracy of AF estimation. The proposed quick circular binary segmentation algorithm for segmenting copy number reduced the computation time of the original segmentation method by 30–67 %. The proposed CNV/CNA detection, which integrates AI and LOH/LCSH detection, had a promising true positive rate and well-controlled false positive rate in simulation studies. Moreover, our real-time quantitative polymerase chain reaction experiments successfully validated the CNVs/CNAs that were identified in the Axiom data analyses using the proposed methods; some of the validated CNVs/CNAs were not detected in the Affymetrix Array 6.0 data analysis using the Affymetrix Genotyping Console. All the analysis functions are packaged into the ALICE (AF/LOH/LCSH/AI/CNV/CNA Enterprise) software. CONCLUSIONS: ALICE and the used genomic reference databases, which can be downloaded from http://hcyang.stat.sinica.edu.tw/software/ALICE.html, are useful resources for analyzing genomic data from the Axiom and other SNP arrays. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2478-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4815280
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48152802016-04-01 An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays Huang, Mei-Chu Chuang, Tzu-Po Chen, Chien-Hsiun Wu, Jer-Yuarn Chen, Yuan-Tsong Li, Ling-Hui Yang, Hsin-Chou BMC Genomics Research Article BACKGROUND: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and genotypes for this new-generation population-optimized genotyping platform. RESULTS: A set of statistical methods was developed for an integrated analysis of allele frequency (AF), allelic imbalance (AI), loss of heterozygosity (LOH), long contiguous stretch of homozygosity (LCSH), and copy number variation or alteration (CNV/CNA) on the basis of SNP probe hybridization intensities and genotypes. This study analyzed 3,236 samples that were genotyped using different SNP platforms. The proposed AF adjustment method considerably increased the accuracy of AF estimation. The proposed quick circular binary segmentation algorithm for segmenting copy number reduced the computation time of the original segmentation method by 30–67 %. The proposed CNV/CNA detection, which integrates AI and LOH/LCSH detection, had a promising true positive rate and well-controlled false positive rate in simulation studies. Moreover, our real-time quantitative polymerase chain reaction experiments successfully validated the CNVs/CNAs that were identified in the Axiom data analyses using the proposed methods; some of the validated CNVs/CNAs were not detected in the Affymetrix Array 6.0 data analysis using the Affymetrix Genotyping Console. All the analysis functions are packaged into the ALICE (AF/LOH/LCSH/AI/CNV/CNA Enterprise) software. CONCLUSIONS: ALICE and the used genomic reference databases, which can be downloaded from http://hcyang.stat.sinica.edu.tw/software/ALICE.html, are useful resources for analyzing genomic data from the Axiom and other SNP arrays. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2478-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-31 /pmc/articles/PMC4815280/ /pubmed/27029637 http://dx.doi.org/10.1186/s12864-016-2478-8 Text en © Huang et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Huang, Mei-Chu
Chuang, Tzu-Po
Chen, Chien-Hsiun
Wu, Jer-Yuarn
Chen, Yuan-Tsong
Li, Ling-Hui
Yang, Hsin-Chou
An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
title An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
title_full An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
title_fullStr An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
title_full_unstemmed An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
title_short An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
title_sort integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4815280/
https://www.ncbi.nlm.nih.gov/pubmed/27029637
http://dx.doi.org/10.1186/s12864-016-2478-8
work_keys_str_mv AT huangmeichu anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT chuangtzupo anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT chenchienhsiun anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT wujeryuarn anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT chenyuantsong anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT lilinghui anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT yanghsinchou anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT huangmeichu integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT chuangtzupo integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT chenchienhsiun integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT wujeryuarn integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT chenyuantsong integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT lilinghui integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays
AT yanghsinchou integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays