Cargando…

A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat

BACKGROUND: The Affymetrix GeneChip technology uses multiple probes per gene to measure its expression level. Individual probe signals can vary widely, which hampers proper interpretation. This variation can be caused by probes that do not properly match their target gene or that match multiple gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Alberts, Rudi, Terpstra, Peter, Hardonk, Menno, Bystrykh, Leonid V, de Haan, Gerald, Breitling, Rainer, Nap, Jan-Peter, Jansen, Ritsert C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1865557/
https://www.ncbi.nlm.nih.gov/pubmed/17448222
http://dx.doi.org/10.1186/1471-2105-8-132
_version_ 1782133234591596544
author Alberts, Rudi
Terpstra, Peter
Hardonk, Menno
Bystrykh, Leonid V
de Haan, Gerald
Breitling, Rainer
Nap, Jan-Peter
Jansen, Ritsert C
author_facet Alberts, Rudi
Terpstra, Peter
Hardonk, Menno
Bystrykh, Leonid V
de Haan, Gerald
Breitling, Rainer
Nap, Jan-Peter
Jansen, Ritsert C
author_sort Alberts, Rudi
collection PubMed
description BACKGROUND: The Affymetrix GeneChip technology uses multiple probes per gene to measure its expression level. Individual probe signals can vary widely, which hampers proper interpretation. This variation can be caused by probes that do not properly match their target gene or that match multiple genes. To determine the accuracy of Affymetrix arrays, we developed an extensive verification protocol, for mouse arrays incorporating the NCBI RefSeq, NCBI UniGene Unique, NIA Mouse Gene Index, and UCSC mouse genome databases. RESULTS: Applying this protocol to Affymetrix Mouse Genome arrays (the earlier U74Av2 and the newer 430 2.0 array), the number of sequence-verified probes with perfect matches was no less than 85% and 95%, respectively; and for 74% and 85% of the probe sets all probes were sequence verified. The latter percentages increased to 80% and 94% after discarding one or two unverifiable probes per probe set, and even further to 84% and 97% when, in addition, allowing for one or two mismatches between probe and target gene. Similar results were obtained for other mouse arrays, as well as for human and rat arrays. Based on these data, refined chip definition files for all arrays are provided online. Researchers can choose the version appropriate for their study to (re)analyze expression data. CONCLUSION: The accuracy of Affymetrix probe sequences is higher than previously reported, particularly on newer arrays. Yet, refined probe set definitions have clear effects on the detection of differentially expressed genes. We demonstrate that the interpretation of the results of Affymetrix arrays is improved when the new chip definition files are used.
format Text
id pubmed-1865557
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18655572007-05-05 A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat Alberts, Rudi Terpstra, Peter Hardonk, Menno Bystrykh, Leonid V de Haan, Gerald Breitling, Rainer Nap, Jan-Peter Jansen, Ritsert C BMC Bioinformatics Research Article BACKGROUND: The Affymetrix GeneChip technology uses multiple probes per gene to measure its expression level. Individual probe signals can vary widely, which hampers proper interpretation. This variation can be caused by probes that do not properly match their target gene or that match multiple genes. To determine the accuracy of Affymetrix arrays, we developed an extensive verification protocol, for mouse arrays incorporating the NCBI RefSeq, NCBI UniGene Unique, NIA Mouse Gene Index, and UCSC mouse genome databases. RESULTS: Applying this protocol to Affymetrix Mouse Genome arrays (the earlier U74Av2 and the newer 430 2.0 array), the number of sequence-verified probes with perfect matches was no less than 85% and 95%, respectively; and for 74% and 85% of the probe sets all probes were sequence verified. The latter percentages increased to 80% and 94% after discarding one or two unverifiable probes per probe set, and even further to 84% and 97% when, in addition, allowing for one or two mismatches between probe and target gene. Similar results were obtained for other mouse arrays, as well as for human and rat arrays. Based on these data, refined chip definition files for all arrays are provided online. Researchers can choose the version appropriate for their study to (re)analyze expression data. CONCLUSION: The accuracy of Affymetrix probe sequences is higher than previously reported, particularly on newer arrays. Yet, refined probe set definitions have clear effects on the detection of differentially expressed genes. We demonstrate that the interpretation of the results of Affymetrix arrays is improved when the new chip definition files are used. BioMed Central 2007-04-20 /pmc/articles/PMC1865557/ /pubmed/17448222 http://dx.doi.org/10.1186/1471-2105-8-132 Text en Copyright © 2007 Alberts et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Alberts, Rudi
Terpstra, Peter
Hardonk, Menno
Bystrykh, Leonid V
de Haan, Gerald
Breitling, Rainer
Nap, Jan-Peter
Jansen, Ritsert C
A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
title A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
title_full A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
title_fullStr A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
title_full_unstemmed A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
title_short A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
title_sort verification protocol for the probe sequences of affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1865557/
https://www.ncbi.nlm.nih.gov/pubmed/17448222
http://dx.doi.org/10.1186/1471-2105-8-132
work_keys_str_mv AT albertsrudi averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT terpstrapeter averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT hardonkmenno averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT bystrykhleonidv averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT dehaangerald averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT breitlingrainer averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT napjanpeter averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT jansenritsertc averificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT albertsrudi verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT terpstrapeter verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT hardonkmenno verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT bystrykhleonidv verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT dehaangerald verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT breitlingrainer verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT napjanpeter verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat
AT jansenritsertc verificationprotocolfortheprobesequencesofaffymetrixgenomearraysrevealshighprobeaccuracyforstudiesinmousehumanandrat