Cargando…

Scoring functions for drug-effect similarity

MOTIVATION: The difficulty to find new drugs and bring them to the market has led to an increased interest to find new applications for known compounds. Biological samples from many disease contexts have been extensively profiled by transcriptomics, and, intuitively, this motivates to search for com...

Descripción completa

Detalles Bibliográficos
Autores principales: Struckmann, Stephan, Ernst, Mathias, Fischer, Sarah, Mah, Nancy, Fuellen, Georg, Möller, Steffen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138836/
https://www.ncbi.nlm.nih.gov/pubmed/32484516
http://dx.doi.org/10.1093/bib/bbaa072
_version_ 1783695886965014528
author Struckmann, Stephan
Ernst, Mathias
Fischer, Sarah
Mah, Nancy
Fuellen, Georg
Möller, Steffen
author_facet Struckmann, Stephan
Ernst, Mathias
Fischer, Sarah
Mah, Nancy
Fuellen, Georg
Möller, Steffen
author_sort Struckmann, Stephan
collection PubMed
description MOTIVATION: The difficulty to find new drugs and bring them to the market has led to an increased interest to find new applications for known compounds. Biological samples from many disease contexts have been extensively profiled by transcriptomics, and, intuitively, this motivates to search for compounds with a reversing effect on the expression of characteristic disease genes. However, disease effects may be cell line-specific and also depend on other factors, such as genetics and environment. Transcription profile changes between healthy and diseased cells relate in complex ways to profile changes gathered from cell lines upon stimulation with a drug. Despite these differences, we expect that there will be some similarity in the gene regulatory networks at play in both situations. The challenge is to match transcriptomes for both diseases and drugs alike, even though the exact molecular pathology/pharmacogenomics may not be known. RESULTS: We substitute the challenge to match a drug effect to a disease effect with the challenge to match a drug effect to the effect of the same drug at another concentration or in another cell line. This is welldefined, reproducible in vitro and in silico and extendable with external data. Based on the Connectivity Map (CMap) dataset, we combined 26 different similarity scores with six different heuristics to reduce the number of genes in the model. Such gene filters may also utilize external knowledge e.g. from biological networks. We found that no similarity score always outperforms all others for all drugs, but the Pearson correlation finds the same drug with the highest reliability. Results are improved by filtering for highly expressed genes and to a lesser degree for genes with large fold changes. Also a network-based reduction of contributing transcripts was beneficial, here implemented by the FocusHeuristics. We found no drop in prediction accuracy when reducing the whole transcriptome to the set of 1000 landmark genes of the CMap’s successor project Library of Integrated Network-based Cellular Signatures. All source code to re-analyze and extend the CMap data, the source code of heuristics, filters and their evaluation are available to propel the development of new methods for drug repurposing. AVAILABILITY: https://bitbucket.org/ibima/moldrugeffectsdb CONTACT: steffen.moeller@uni-rostock.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online.
format Online
Article
Text
id pubmed-8138836
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81388362021-05-25 Scoring functions for drug-effect similarity Struckmann, Stephan Ernst, Mathias Fischer, Sarah Mah, Nancy Fuellen, Georg Möller, Steffen Brief Bioinform Methods Review MOTIVATION: The difficulty to find new drugs and bring them to the market has led to an increased interest to find new applications for known compounds. Biological samples from many disease contexts have been extensively profiled by transcriptomics, and, intuitively, this motivates to search for compounds with a reversing effect on the expression of characteristic disease genes. However, disease effects may be cell line-specific and also depend on other factors, such as genetics and environment. Transcription profile changes between healthy and diseased cells relate in complex ways to profile changes gathered from cell lines upon stimulation with a drug. Despite these differences, we expect that there will be some similarity in the gene regulatory networks at play in both situations. The challenge is to match transcriptomes for both diseases and drugs alike, even though the exact molecular pathology/pharmacogenomics may not be known. RESULTS: We substitute the challenge to match a drug effect to a disease effect with the challenge to match a drug effect to the effect of the same drug at another concentration or in another cell line. This is welldefined, reproducible in vitro and in silico and extendable with external data. Based on the Connectivity Map (CMap) dataset, we combined 26 different similarity scores with six different heuristics to reduce the number of genes in the model. Such gene filters may also utilize external knowledge e.g. from biological networks. We found that no similarity score always outperforms all others for all drugs, but the Pearson correlation finds the same drug with the highest reliability. Results are improved by filtering for highly expressed genes and to a lesser degree for genes with large fold changes. Also a network-based reduction of contributing transcripts was beneficial, here implemented by the FocusHeuristics. We found no drop in prediction accuracy when reducing the whole transcriptome to the set of 1000 landmark genes of the CMap’s successor project Library of Integrated Network-based Cellular Signatures. All source code to re-analyze and extend the CMap data, the source code of heuristics, filters and their evaluation are available to propel the development of new methods for drug repurposing. AVAILABILITY: https://bitbucket.org/ibima/moldrugeffectsdb CONTACT: steffen.moeller@uni-rostock.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online. Oxford University Press 2020-06-02 /pmc/articles/PMC8138836/ /pubmed/32484516 http://dx.doi.org/10.1093/bib/bbaa072 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Review
Struckmann, Stephan
Ernst, Mathias
Fischer, Sarah
Mah, Nancy
Fuellen, Georg
Möller, Steffen
Scoring functions for drug-effect similarity
title Scoring functions for drug-effect similarity
title_full Scoring functions for drug-effect similarity
title_fullStr Scoring functions for drug-effect similarity
title_full_unstemmed Scoring functions for drug-effect similarity
title_short Scoring functions for drug-effect similarity
title_sort scoring functions for drug-effect similarity
topic Methods Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138836/
https://www.ncbi.nlm.nih.gov/pubmed/32484516
http://dx.doi.org/10.1093/bib/bbaa072
work_keys_str_mv AT struckmannstephan scoringfunctionsfordrugeffectsimilarity
AT ernstmathias scoringfunctionsfordrugeffectsimilarity
AT fischersarah scoringfunctionsfordrugeffectsimilarity
AT mahnancy scoringfunctionsfordrugeffectsimilarity
AT fuellengeorg scoringfunctionsfordrugeffectsimilarity
AT mollersteffen scoringfunctionsfordrugeffectsimilarity