Cargando…
Scoring functions for drug-effect similarity
MOTIVATION: The difficulty to find new drugs and bring them to the market has led to an increased interest to find new applications for known compounds. Biological samples from many disease contexts have been extensively profiled by transcriptomics, and, intuitively, this motivates to search for com...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138836/ https://www.ncbi.nlm.nih.gov/pubmed/32484516 http://dx.doi.org/10.1093/bib/bbaa072 |
_version_ | 1783695886965014528 |
---|---|
author | Struckmann, Stephan Ernst, Mathias Fischer, Sarah Mah, Nancy Fuellen, Georg Möller, Steffen |
author_facet | Struckmann, Stephan Ernst, Mathias Fischer, Sarah Mah, Nancy Fuellen, Georg Möller, Steffen |
author_sort | Struckmann, Stephan |
collection | PubMed |
description | MOTIVATION: The difficulty to find new drugs and bring them to the market has led to an increased interest to find new applications for known compounds. Biological samples from many disease contexts have been extensively profiled by transcriptomics, and, intuitively, this motivates to search for compounds with a reversing effect on the expression of characteristic disease genes. However, disease effects may be cell line-specific and also depend on other factors, such as genetics and environment. Transcription profile changes between healthy and diseased cells relate in complex ways to profile changes gathered from cell lines upon stimulation with a drug. Despite these differences, we expect that there will be some similarity in the gene regulatory networks at play in both situations. The challenge is to match transcriptomes for both diseases and drugs alike, even though the exact molecular pathology/pharmacogenomics may not be known. RESULTS: We substitute the challenge to match a drug effect to a disease effect with the challenge to match a drug effect to the effect of the same drug at another concentration or in another cell line. This is welldefined, reproducible in vitro and in silico and extendable with external data. Based on the Connectivity Map (CMap) dataset, we combined 26 different similarity scores with six different heuristics to reduce the number of genes in the model. Such gene filters may also utilize external knowledge e.g. from biological networks. We found that no similarity score always outperforms all others for all drugs, but the Pearson correlation finds the same drug with the highest reliability. Results are improved by filtering for highly expressed genes and to a lesser degree for genes with large fold changes. Also a network-based reduction of contributing transcripts was beneficial, here implemented by the FocusHeuristics. We found no drop in prediction accuracy when reducing the whole transcriptome to the set of 1000 landmark genes of the CMap’s successor project Library of Integrated Network-based Cellular Signatures. All source code to re-analyze and extend the CMap data, the source code of heuristics, filters and their evaluation are available to propel the development of new methods for drug repurposing. AVAILABILITY: https://bitbucket.org/ibima/moldrugeffectsdb CONTACT: steffen.moeller@uni-rostock.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online. |
format | Online Article Text |
id | pubmed-8138836 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-81388362021-05-25 Scoring functions for drug-effect similarity Struckmann, Stephan Ernst, Mathias Fischer, Sarah Mah, Nancy Fuellen, Georg Möller, Steffen Brief Bioinform Methods Review MOTIVATION: The difficulty to find new drugs and bring them to the market has led to an increased interest to find new applications for known compounds. Biological samples from many disease contexts have been extensively profiled by transcriptomics, and, intuitively, this motivates to search for compounds with a reversing effect on the expression of characteristic disease genes. However, disease effects may be cell line-specific and also depend on other factors, such as genetics and environment. Transcription profile changes between healthy and diseased cells relate in complex ways to profile changes gathered from cell lines upon stimulation with a drug. Despite these differences, we expect that there will be some similarity in the gene regulatory networks at play in both situations. The challenge is to match transcriptomes for both diseases and drugs alike, even though the exact molecular pathology/pharmacogenomics may not be known. RESULTS: We substitute the challenge to match a drug effect to a disease effect with the challenge to match a drug effect to the effect of the same drug at another concentration or in another cell line. This is welldefined, reproducible in vitro and in silico and extendable with external data. Based on the Connectivity Map (CMap) dataset, we combined 26 different similarity scores with six different heuristics to reduce the number of genes in the model. Such gene filters may also utilize external knowledge e.g. from biological networks. We found that no similarity score always outperforms all others for all drugs, but the Pearson correlation finds the same drug with the highest reliability. Results are improved by filtering for highly expressed genes and to a lesser degree for genes with large fold changes. Also a network-based reduction of contributing transcripts was beneficial, here implemented by the FocusHeuristics. We found no drop in prediction accuracy when reducing the whole transcriptome to the set of 1000 landmark genes of the CMap’s successor project Library of Integrated Network-based Cellular Signatures. All source code to re-analyze and extend the CMap data, the source code of heuristics, filters and their evaluation are available to propel the development of new methods for drug repurposing. AVAILABILITY: https://bitbucket.org/ibima/moldrugeffectsdb CONTACT: steffen.moeller@uni-rostock.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online. Oxford University Press 2020-06-02 /pmc/articles/PMC8138836/ /pubmed/32484516 http://dx.doi.org/10.1093/bib/bbaa072 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Review Struckmann, Stephan Ernst, Mathias Fischer, Sarah Mah, Nancy Fuellen, Georg Möller, Steffen Scoring functions for drug-effect similarity |
title | Scoring functions for drug-effect similarity |
title_full | Scoring functions for drug-effect similarity |
title_fullStr | Scoring functions for drug-effect similarity |
title_full_unstemmed | Scoring functions for drug-effect similarity |
title_short | Scoring functions for drug-effect similarity |
title_sort | scoring functions for drug-effect similarity |
topic | Methods Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138836/ https://www.ncbi.nlm.nih.gov/pubmed/32484516 http://dx.doi.org/10.1093/bib/bbaa072 |
work_keys_str_mv | AT struckmannstephan scoringfunctionsfordrugeffectsimilarity AT ernstmathias scoringfunctionsfordrugeffectsimilarity AT fischersarah scoringfunctionsfordrugeffectsimilarity AT mahnancy scoringfunctionsfordrugeffectsimilarity AT fuellengeorg scoringfunctionsfordrugeffectsimilarity AT mollersteffen scoringfunctionsfordrugeffectsimilarity |