Cargando…
PecanPy: a fast, efficient and parallelized Python implementation of node2vec
SUMMARY: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network densi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504639/ https://www.ncbi.nlm.nih.gov/pubmed/33760066 http://dx.doi.org/10.1093/bioinformatics/btab202 |
_version_ | 1784581360808099840 |
---|---|
author | Liu, Renming Krishnan, Arjun |
author_facet | Liu, Renming Krishnan, Arjun |
author_sort | Liu, Renming |
collection | PubMed |
description | SUMMARY: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network density, failing for dense biological networks with hundreds of millions of edges. We have developed PecanPy, a new Python implementation of node2vec that uses cache-optimized compact graph data structures and precomputing/parallelization to result in fast, high-quality node embeddings for biological networks of all sizes and densities. AVAILABILITYAND IMPLEMENTATION: PecanPy software is freely available at https://github.com/krishnanlab/PecanPy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8504639 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-85046392021-10-13 PecanPy: a fast, efficient and parallelized Python implementation of node2vec Liu, Renming Krishnan, Arjun Bioinformatics Applications Notes SUMMARY: Learning low-dimensional representations (embeddings) of nodes in large graphs is key to applying machine learning on massive biological networks. Node2vec is the most widely used method for node embedding. However, its original Python and C++ implementations scale poorly with network density, failing for dense biological networks with hundreds of millions of edges. We have developed PecanPy, a new Python implementation of node2vec that uses cache-optimized compact graph data structures and precomputing/parallelization to result in fast, high-quality node embeddings for biological networks of all sizes and densities. AVAILABILITYAND IMPLEMENTATION: PecanPy software is freely available at https://github.com/krishnanlab/PecanPy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-03-24 /pmc/articles/PMC8504639/ /pubmed/33760066 http://dx.doi.org/10.1093/bioinformatics/btab202 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Liu, Renming Krishnan, Arjun PecanPy: a fast, efficient and parallelized Python implementation of node2vec |
title | PecanPy: a fast, efficient and parallelized Python implementation of node2vec |
title_full | PecanPy: a fast, efficient and parallelized Python implementation of node2vec |
title_fullStr | PecanPy: a fast, efficient and parallelized Python implementation of node2vec |
title_full_unstemmed | PecanPy: a fast, efficient and parallelized Python implementation of node2vec |
title_short | PecanPy: a fast, efficient and parallelized Python implementation of node2vec |
title_sort | pecanpy: a fast, efficient and parallelized python implementation of node2vec |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504639/ https://www.ncbi.nlm.nih.gov/pubmed/33760066 http://dx.doi.org/10.1093/bioinformatics/btab202 |
work_keys_str_mv | AT liurenming pecanpyafastefficientandparallelizedpythonimplementationofnode2vec AT krishnanarjun pecanpyafastefficientandparallelizedpythonimplementationofnode2vec |