Cargando…

Ecological interactions and the Netflix problem

Species interactions are a key component of ecosystems but we generally have an incomplete picture of who-eats-who in a given community. Different techniques have been devised to predict species interactions using theoretical models or abundances. Here, we explore the K nearest neighbour approach, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Desjardins-Proulx, Philippe, Laigle, Idaline, Poisot, Timothée, Gravel, Dominique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5554597/
https://www.ncbi.nlm.nih.gov/pubmed/28828250
http://dx.doi.org/10.7717/peerj.3644
_version_ 1783256820102463488
author Desjardins-Proulx, Philippe
Laigle, Idaline
Poisot, Timothée
Gravel, Dominique
author_facet Desjardins-Proulx, Philippe
Laigle, Idaline
Poisot, Timothée
Gravel, Dominique
author_sort Desjardins-Proulx, Philippe
collection PubMed
description Species interactions are a key component of ecosystems but we generally have an incomplete picture of who-eats-who in a given community. Different techniques have been devised to predict species interactions using theoretical models or abundances. Here, we explore the K nearest neighbour approach, with a special emphasis on recommendation, along with a supervised machine learning technique. Recommenders are algorithms developed for companies like Netflix to predict whether a customer will like a product given the preferences of similar customers. These machine learning techniques are well-suited to study binary ecological interactions since they focus on positive-only data. By removing a prey from a predator, we find that recommenders can guess the missing prey around 50% of the times on the first try, with up to 881 possibilities. Traits do not improve significantly the results for the K nearest neighbour, although a simple test with a supervised learning approach (random forests) show we can predict interactions with high accuracy using only three traits per species. This result shows that binary interactions can be predicted without regard to the ecological community given only three variables: body mass and two variables for the species’ phylogeny. These techniques are complementary, as recommenders can predict interactions in the absence of traits, using only information about other species’ interactions, while supervised learning algorithms such as random forests base their predictions on traits only but do not exploit other species’ interactions. Further work should focus on developing custom similarity measures specialized for ecology to improve the KNN algorithms and using richer data to capture indirect relationships between species.
format Online
Article
Text
id pubmed-5554597
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-55545972017-08-21 Ecological interactions and the Netflix problem Desjardins-Proulx, Philippe Laigle, Idaline Poisot, Timothée Gravel, Dominique PeerJ Bioinformatics Species interactions are a key component of ecosystems but we generally have an incomplete picture of who-eats-who in a given community. Different techniques have been devised to predict species interactions using theoretical models or abundances. Here, we explore the K nearest neighbour approach, with a special emphasis on recommendation, along with a supervised machine learning technique. Recommenders are algorithms developed for companies like Netflix to predict whether a customer will like a product given the preferences of similar customers. These machine learning techniques are well-suited to study binary ecological interactions since they focus on positive-only data. By removing a prey from a predator, we find that recommenders can guess the missing prey around 50% of the times on the first try, with up to 881 possibilities. Traits do not improve significantly the results for the K nearest neighbour, although a simple test with a supervised learning approach (random forests) show we can predict interactions with high accuracy using only three traits per species. This result shows that binary interactions can be predicted without regard to the ecological community given only three variables: body mass and two variables for the species’ phylogeny. These techniques are complementary, as recommenders can predict interactions in the absence of traits, using only information about other species’ interactions, while supervised learning algorithms such as random forests base their predictions on traits only but do not exploit other species’ interactions. Further work should focus on developing custom similarity measures specialized for ecology to improve the KNN algorithms and using richer data to capture indirect relationships between species. PeerJ Inc. 2017-08-10 /pmc/articles/PMC5554597/ /pubmed/28828250 http://dx.doi.org/10.7717/peerj.3644 Text en ©2017 Desjardins-Proulx et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Desjardins-Proulx, Philippe
Laigle, Idaline
Poisot, Timothée
Gravel, Dominique
Ecological interactions and the Netflix problem
title Ecological interactions and the Netflix problem
title_full Ecological interactions and the Netflix problem
title_fullStr Ecological interactions and the Netflix problem
title_full_unstemmed Ecological interactions and the Netflix problem
title_short Ecological interactions and the Netflix problem
title_sort ecological interactions and the netflix problem
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5554597/
https://www.ncbi.nlm.nih.gov/pubmed/28828250
http://dx.doi.org/10.7717/peerj.3644
work_keys_str_mv AT desjardinsproulxphilippe ecologicalinteractionsandthenetflixproblem
AT laigleidaline ecologicalinteractionsandthenetflixproblem
AT poisottimothee ecologicalinteractionsandthenetflixproblem
AT graveldominique ecologicalinteractionsandthenetflixproblem