Cargando…
An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays
BACKGROUND: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and ge...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4815280/ https://www.ncbi.nlm.nih.gov/pubmed/27029637 http://dx.doi.org/10.1186/s12864-016-2478-8 |
_version_ | 1782424572963848192 |
---|---|
author | Huang, Mei-Chu Chuang, Tzu-Po Chen, Chien-Hsiun Wu, Jer-Yuarn Chen, Yuan-Tsong Li, Ling-Hui Yang, Hsin-Chou |
author_facet | Huang, Mei-Chu Chuang, Tzu-Po Chen, Chien-Hsiun Wu, Jer-Yuarn Chen, Yuan-Tsong Li, Ling-Hui Yang, Hsin-Chou |
author_sort | Huang, Mei-Chu |
collection | PubMed |
description | BACKGROUND: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and genotypes for this new-generation population-optimized genotyping platform. RESULTS: A set of statistical methods was developed for an integrated analysis of allele frequency (AF), allelic imbalance (AI), loss of heterozygosity (LOH), long contiguous stretch of homozygosity (LCSH), and copy number variation or alteration (CNV/CNA) on the basis of SNP probe hybridization intensities and genotypes. This study analyzed 3,236 samples that were genotyped using different SNP platforms. The proposed AF adjustment method considerably increased the accuracy of AF estimation. The proposed quick circular binary segmentation algorithm for segmenting copy number reduced the computation time of the original segmentation method by 30–67 %. The proposed CNV/CNA detection, which integrates AI and LOH/LCSH detection, had a promising true positive rate and well-controlled false positive rate in simulation studies. Moreover, our real-time quantitative polymerase chain reaction experiments successfully validated the CNVs/CNAs that were identified in the Axiom data analyses using the proposed methods; some of the validated CNVs/CNAs were not detected in the Affymetrix Array 6.0 data analysis using the Affymetrix Genotyping Console. All the analysis functions are packaged into the ALICE (AF/LOH/LCSH/AI/CNV/CNA Enterprise) software. CONCLUSIONS: ALICE and the used genomic reference databases, which can be downloaded from http://hcyang.stat.sinica.edu.tw/software/ALICE.html, are useful resources for analyzing genomic data from the Axiom and other SNP arrays. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2478-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4815280 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48152802016-04-01 An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays Huang, Mei-Chu Chuang, Tzu-Po Chen, Chien-Hsiun Wu, Jer-Yuarn Chen, Yuan-Tsong Li, Ling-Hui Yang, Hsin-Chou BMC Genomics Research Article BACKGROUND: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and genotypes for this new-generation population-optimized genotyping platform. RESULTS: A set of statistical methods was developed for an integrated analysis of allele frequency (AF), allelic imbalance (AI), loss of heterozygosity (LOH), long contiguous stretch of homozygosity (LCSH), and copy number variation or alteration (CNV/CNA) on the basis of SNP probe hybridization intensities and genotypes. This study analyzed 3,236 samples that were genotyped using different SNP platforms. The proposed AF adjustment method considerably increased the accuracy of AF estimation. The proposed quick circular binary segmentation algorithm for segmenting copy number reduced the computation time of the original segmentation method by 30–67 %. The proposed CNV/CNA detection, which integrates AI and LOH/LCSH detection, had a promising true positive rate and well-controlled false positive rate in simulation studies. Moreover, our real-time quantitative polymerase chain reaction experiments successfully validated the CNVs/CNAs that were identified in the Axiom data analyses using the proposed methods; some of the validated CNVs/CNAs were not detected in the Affymetrix Array 6.0 data analysis using the Affymetrix Genotyping Console. All the analysis functions are packaged into the ALICE (AF/LOH/LCSH/AI/CNV/CNA Enterprise) software. CONCLUSIONS: ALICE and the used genomic reference databases, which can be downloaded from http://hcyang.stat.sinica.edu.tw/software/ALICE.html, are useful resources for analyzing genomic data from the Axiom and other SNP arrays. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2478-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-31 /pmc/articles/PMC4815280/ /pubmed/27029637 http://dx.doi.org/10.1186/s12864-016-2478-8 Text en © Huang et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Huang, Mei-Chu Chuang, Tzu-Po Chen, Chien-Hsiun Wu, Jer-Yuarn Chen, Yuan-Tsong Li, Ling-Hui Yang, Hsin-Chou An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
title | An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
title_full | An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
title_fullStr | An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
title_full_unstemmed | An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
title_short | An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
title_sort | integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4815280/ https://www.ncbi.nlm.nih.gov/pubmed/27029637 http://dx.doi.org/10.1186/s12864-016-2478-8 |
work_keys_str_mv | AT huangmeichu anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT chuangtzupo anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT chenchienhsiun anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT wujeryuarn anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT chenyuantsong anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT lilinghui anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT yanghsinchou anintegratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT huangmeichu integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT chuangtzupo integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT chenchienhsiun integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT wujeryuarn integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT chenyuantsong integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT lilinghui integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays AT yanghsinchou integratedanalysistoolforanalyzinghybridizationintensitiesandgenotypesusingnewgenerationpopulationoptimizedhumanarrays |