Cargando…

Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network

English Wikipedia, containing more than five millions articles, has approximately eleven thousands web pages devoted to proteins or genes most of which were generated by the Gene Wiki project. These pages contain information about interactions between proteins and their functional relationships. At...

Descripción completa

Detalles Bibliográficos
Autores principales: Zinovyev, Andrei, Czerwinska, Urszula, Cantini, Laura, Barillot, Emmanuel, Frahm, Klaus M., Shepelyansky, Dima L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7048313/
https://www.ncbi.nlm.nih.gov/pubmed/32069277
http://dx.doi.org/10.1371/journal.pcbi.1007652
_version_ 1783502277665882112
author Zinovyev, Andrei
Czerwinska, Urszula
Cantini, Laura
Barillot, Emmanuel
Frahm, Klaus M.
Shepelyansky, Dima L.
author_facet Zinovyev, Andrei
Czerwinska, Urszula
Cantini, Laura
Barillot, Emmanuel
Frahm, Klaus M.
Shepelyansky, Dima L.
author_sort Zinovyev, Andrei
collection PubMed
description English Wikipedia, containing more than five millions articles, has approximately eleven thousands web pages devoted to proteins or genes most of which were generated by the Gene Wiki project. These pages contain information about interactions between proteins and their functional relationships. At the same time, they are interconnected with other Wikipedia pages describing biological functions, diseases, drugs and other topics curated by independent, not coordinated collective efforts. Therefore, Wikipedia contains a directed network of protein functional relations or physical interactions embedded into the global network of the encyclopedia terms, which defines hidden (indirect) functional proximity between proteins. We applied the recently developed reduced Google Matrix (REGOMAX) algorithm in order to extract the network of hidden functional connections between proteins in Wikipedia. In this network we discovered tight communities which reflect areas of interest in molecular biology or medicine and can be considered as definitions of biological functions shaped by collective intelligence. Moreover, by comparing two snapshots of Wikipedia graph (from years 2013 and 2017), we studied the evolution of the network of direct and hidden protein connections. We concluded that the hidden connections are more dynamic compared to the direct ones and that the size of the hidden interaction communities grows with time. We recapitulate the results of Wikipedia protein community analysis and annotation in the form of an interactive online map, which can serve as a portal to the Gene Wiki project.
format Online
Article
Text
id pubmed-7048313
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70483132020-03-09 Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network Zinovyev, Andrei Czerwinska, Urszula Cantini, Laura Barillot, Emmanuel Frahm, Klaus M. Shepelyansky, Dima L. PLoS Comput Biol Research Article English Wikipedia, containing more than five millions articles, has approximately eleven thousands web pages devoted to proteins or genes most of which were generated by the Gene Wiki project. These pages contain information about interactions between proteins and their functional relationships. At the same time, they are interconnected with other Wikipedia pages describing biological functions, diseases, drugs and other topics curated by independent, not coordinated collective efforts. Therefore, Wikipedia contains a directed network of protein functional relations or physical interactions embedded into the global network of the encyclopedia terms, which defines hidden (indirect) functional proximity between proteins. We applied the recently developed reduced Google Matrix (REGOMAX) algorithm in order to extract the network of hidden functional connections between proteins in Wikipedia. In this network we discovered tight communities which reflect areas of interest in molecular biology or medicine and can be considered as definitions of biological functions shaped by collective intelligence. Moreover, by comparing two snapshots of Wikipedia graph (from years 2013 and 2017), we studied the evolution of the network of direct and hidden protein connections. We concluded that the hidden connections are more dynamic compared to the direct ones and that the size of the hidden interaction communities grows with time. We recapitulate the results of Wikipedia protein community analysis and annotation in the form of an interactive online map, which can serve as a portal to the Gene Wiki project. Public Library of Science 2020-02-18 /pmc/articles/PMC7048313/ /pubmed/32069277 http://dx.doi.org/10.1371/journal.pcbi.1007652 Text en © 2020 Zinovyev et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zinovyev, Andrei
Czerwinska, Urszula
Cantini, Laura
Barillot, Emmanuel
Frahm, Klaus M.
Shepelyansky, Dima L.
Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network
title Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network
title_full Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network
title_fullStr Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network
title_full_unstemmed Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network
title_short Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network
title_sort collective intelligence defines biological functions in wikipedia as communities in the hidden protein connection network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7048313/
https://www.ncbi.nlm.nih.gov/pubmed/32069277
http://dx.doi.org/10.1371/journal.pcbi.1007652
work_keys_str_mv AT zinovyevandrei collectiveintelligencedefinesbiologicalfunctionsinwikipediaascommunitiesinthehiddenproteinconnectionnetwork
AT czerwinskaurszula collectiveintelligencedefinesbiologicalfunctionsinwikipediaascommunitiesinthehiddenproteinconnectionnetwork
AT cantinilaura collectiveintelligencedefinesbiologicalfunctionsinwikipediaascommunitiesinthehiddenproteinconnectionnetwork
AT barillotemmanuel collectiveintelligencedefinesbiologicalfunctionsinwikipediaascommunitiesinthehiddenproteinconnectionnetwork
AT frahmklausm collectiveintelligencedefinesbiologicalfunctionsinwikipediaascommunitiesinthehiddenproteinconnectionnetwork
AT shepelyanskydimal collectiveintelligencedefinesbiologicalfunctionsinwikipediaascommunitiesinthehiddenproteinconnectionnetwork