Cargando…

Simulation of microarray data with realistic characteristics

BACKGROUND: Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algori...

Descripción completa

Detalles Bibliográficos
Autores principales: Nykter, Matti, Aho, Tommi, Ahdesmäki, Miika, Ruusuvuori, Pekka, Lehmussola, Antti, Yli-Harja, Olli
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1574357/
https://www.ncbi.nlm.nih.gov/pubmed/16848902
http://dx.doi.org/10.1186/1471-2105-7-349
_version_ 1782130294243983360
author Nykter, Matti
Aho, Tommi
Ahdesmäki, Miika
Ruusuvuori, Pekka
Lehmussola, Antti
Yli-Harja, Olli
author_facet Nykter, Matti
Aho, Tommi
Ahdesmäki, Miika
Ruusuvuori, Pekka
Lehmussola, Antti
Yli-Harja, Olli
author_sort Nykter, Matti
collection PubMed
description BACKGROUND: Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed. RESULTS: We present a microarray simulation model which can be used to validate different kinds of data analysis algorithms. The proposed model is unique in the sense that it includes all the steps that affect the quality of real microarray data. These steps include the simulation of biological ground truth data, applying biological and measurement technology specific error models, and finally simulating the microarray slide manufacturing and hybridization. After all these steps are taken into account, the simulated data has realistic biological and statistical characteristics. The applicability of the proposed model is demonstrated by several examples. CONCLUSION: The proposed microarray simulation model is modular and can be used in different kinds of applications. It includes several error models that have been proposed earlier and it can be used with different types of input data. The model can be used to simulate both spotted two-channel and oligonucleotide based single-channel microarrays. All this makes the model a valuable tool for example in validation of data analysis algorithms.
format Text
id pubmed-1574357
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15743572006-09-26 Simulation of microarray data with realistic characteristics Nykter, Matti Aho, Tommi Ahdesmäki, Miika Ruusuvuori, Pekka Lehmussola, Antti Yli-Harja, Olli BMC Bioinformatics Methodology Article BACKGROUND: Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed. RESULTS: We present a microarray simulation model which can be used to validate different kinds of data analysis algorithms. The proposed model is unique in the sense that it includes all the steps that affect the quality of real microarray data. These steps include the simulation of biological ground truth data, applying biological and measurement technology specific error models, and finally simulating the microarray slide manufacturing and hybridization. After all these steps are taken into account, the simulated data has realistic biological and statistical characteristics. The applicability of the proposed model is demonstrated by several examples. CONCLUSION: The proposed microarray simulation model is modular and can be used in different kinds of applications. It includes several error models that have been proposed earlier and it can be used with different types of input data. The model can be used to simulate both spotted two-channel and oligonucleotide based single-channel microarrays. All this makes the model a valuable tool for example in validation of data analysis algorithms. BioMed Central 2006-07-18 /pmc/articles/PMC1574357/ /pubmed/16848902 http://dx.doi.org/10.1186/1471-2105-7-349 Text en Copyright © 2006 Nykter et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Nykter, Matti
Aho, Tommi
Ahdesmäki, Miika
Ruusuvuori, Pekka
Lehmussola, Antti
Yli-Harja, Olli
Simulation of microarray data with realistic characteristics
title Simulation of microarray data with realistic characteristics
title_full Simulation of microarray data with realistic characteristics
title_fullStr Simulation of microarray data with realistic characteristics
title_full_unstemmed Simulation of microarray data with realistic characteristics
title_short Simulation of microarray data with realistic characteristics
title_sort simulation of microarray data with realistic characteristics
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1574357/
https://www.ncbi.nlm.nih.gov/pubmed/16848902
http://dx.doi.org/10.1186/1471-2105-7-349
work_keys_str_mv AT nyktermatti simulationofmicroarraydatawithrealisticcharacteristics
AT ahotommi simulationofmicroarraydatawithrealisticcharacteristics
AT ahdesmakimiika simulationofmicroarraydatawithrealisticcharacteristics
AT ruusuvuoripekka simulationofmicroarraydatawithrealisticcharacteristics
AT lehmussolaantti simulationofmicroarraydatawithrealisticcharacteristics
AT yliharjaolli simulationofmicroarraydatawithrealisticcharacteristics