Cargando…

PecanPy: a fast, efficient and parallelized Python implementation of node2vec

SUMMARY: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network densi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Renming, Krishnan, Arjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504639/
https://www.ncbi.nlm.nih.gov/pubmed/33760066
http://dx.doi.org/10.1093/bioinformatics/btab202
_version_ 1784581360808099840
author Liu, Renming
Krishnan, Arjun
author_facet Liu, Renming
Krishnan, Arjun
author_sort Liu, Renming
collection PubMed
description SUMMARY: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network density, failing for dense biological networks with hundreds of millions of edges. We have developed PecanPy, a new Python implementation of node2vec that uses cache-optimized compact graph data structures and precomputing/parallelization to result in fast, high-quality node embeddings for biological networks of all sizes and densities. AVAILABILITYAND IMPLEMENTATION: PecanPy software is freely available at https://github.com/krishnanlab/PecanPy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8504639
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85046392021-10-13 PecanPy: a fast, efficient and parallelized Python implementation of node2vec Liu, Renming Krishnan, Arjun Bioinformatics Applications Notes SUMMARY: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network density, failing for dense biological networks with hundreds of millions of edges. We have developed PecanPy, a new Python implementation of node2vec that uses cache-optimized compact graph data structures and precomputing/parallelization to result in fast, high-quality node embeddings for biological networks of all sizes and densities. AVAILABILITYAND IMPLEMENTATION: PecanPy software is freely available at https://github.com/krishnanlab/PecanPy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-03-24 /pmc/articles/PMC8504639/ /pubmed/33760066 http://dx.doi.org/10.1093/bioinformatics/btab202 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Liu, Renming
Krishnan, Arjun
PecanPy: a fast, efficient and parallelized Python implementation of node2vec
title PecanPy: a fast, efficient and parallelized Python implementation of node2vec
title_full PecanPy: a fast, efficient and parallelized Python implementation of node2vec
title_fullStr PecanPy: a fast, efficient and parallelized Python implementation of node2vec
title_full_unstemmed PecanPy: a fast, efficient and parallelized Python implementation of node2vec
title_short PecanPy: a fast, efficient and parallelized Python implementation of node2vec
title_sort pecanpy: a fast, efficient and parallelized python implementation of node2vec
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504639/
https://www.ncbi.nlm.nih.gov/pubmed/33760066
http://dx.doi.org/10.1093/bioinformatics/btab202
work_keys_str_mv AT liurenming pecanpyafastefficientandparallelizedpythonimplementationofnode2vec
AT krishnanarjun pecanpyafastefficientandparallelizedpythonimplementationofnode2vec