Cargando…

Challenges of the information age: the impact of false discovery on pathway identification

BACKGROUND: Pathways with members that have known relevance to a disease are used to support hypotheses generated from analyses of gene expression and proteomic studies. Using cancer as an example, the pitfalls of searching pathways databases as support for genes and proteins that could represent fa...

Descripción completa

Detalles Bibliográficos
Autores principales: Rog, Colin J, Chekuri, Srinivasa C, Edgerton, Mary E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534555/
https://www.ncbi.nlm.nih.gov/pubmed/23171633
http://dx.doi.org/10.1186/1756-0500-5-647
_version_ 1782475354030473216
author Rog, Colin J
Chekuri, Srinivasa C
Edgerton, Mary E
author_facet Rog, Colin J
Chekuri, Srinivasa C
Edgerton, Mary E
author_sort Rog, Colin J
collection PubMed
description BACKGROUND: Pathways with members that have known relevance to a disease are used to support hypotheses generated from analyses of gene expression and proteomic studies. Using cancer as an example, the pitfalls of searching pathways databases as support for genes and proteins that could represent false discoveries are explored. FINDINGS: The frequency with which networks could be generated from 100 instances each of randomly selected five and ten genes sets as input to MetaCore, a commercial pathways database, was measured. A PubMed search enumerated cancer-related literature published for any gene in the networks. Using three, two, and one maximum intervening step between input genes to populate the network, networks were generated with frequencies of 97%, 77%, and 7% using ten gene sets and 73%, 27%, and 1% using five gene sets. PubMed reported an average of 4225 cancer-related articles per network gene. DISCUSSION: This can be attributed to the richly populated pathways databases and the interest in the molecular basis of cancer. As information sources become enriched, they are more likely to generate plausible mechanisms for false discoveries.
format Online
Article
Text
id pubmed-3534555
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35345552013-01-03 Challenges of the information age: the impact of false discovery on pathway identification Rog, Colin J Chekuri, Srinivasa C Edgerton, Mary E BMC Res Notes Correspondence BACKGROUND: Pathways with members that have known relevance to a disease are used to support hypotheses generated from analyses of gene expression and proteomic studies. Using cancer as an example, the pitfalls of searching pathways databases as support for genes and proteins that could represent false discoveries are explored. FINDINGS: The frequency with which networks could be generated from 100 instances each of randomly selected five and ten genes sets as input to MetaCore, a commercial pathways database, was measured. A PubMed search enumerated cancer-related literature published for any gene in the networks. Using three, two, and one maximum intervening step between input genes to populate the network, networks were generated with frequencies of 97%, 77%, and 7% using ten gene sets and 73%, 27%, and 1% using five gene sets. PubMed reported an average of 4225 cancer-related articles per network gene. DISCUSSION: This can be attributed to the richly populated pathways databases and the interest in the molecular basis of cancer. As information sources become enriched, they are more likely to generate plausible mechanisms for false discoveries. BioMed Central 2012-11-21 /pmc/articles/PMC3534555/ /pubmed/23171633 http://dx.doi.org/10.1186/1756-0500-5-647 Text en Copyright ©2012 Rog et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Correspondence
Rog, Colin J
Chekuri, Srinivasa C
Edgerton, Mary E
Challenges of the information age: the impact of false discovery on pathway identification
title Challenges of the information age: the impact of false discovery on pathway identification
title_full Challenges of the information age: the impact of false discovery on pathway identification
title_fullStr Challenges of the information age: the impact of false discovery on pathway identification
title_full_unstemmed Challenges of the information age: the impact of false discovery on pathway identification
title_short Challenges of the information age: the impact of false discovery on pathway identification
title_sort challenges of the information age: the impact of false discovery on pathway identification
topic Correspondence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534555/
https://www.ncbi.nlm.nih.gov/pubmed/23171633
http://dx.doi.org/10.1186/1756-0500-5-647
work_keys_str_mv AT rogcolinj challengesoftheinformationagetheimpactoffalsediscoveryonpathwayidentification
AT chekurisrinivasac challengesoftheinformationagetheimpactoffalsediscoveryonpathwayidentification
AT edgertonmarye challengesoftheinformationagetheimpactoffalsediscoveryonpathwayidentification