Cargando…
Code4ML: a large-scale dataset of annotated Machine Learning code
The use of program code as a data source is increasingly expanding among data scientists. The purpose of the usage varies from the semantic classification of code to the automatic generation of programs. However, the machine learning model application is somewhat limited without annotating the code...
Autores principales: | Drozdova, Anastasia, Trofimova, Ekaterina, Guseva, Polina, Scherbakova, Anna, Ustyuzhanin, Andrey |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280557/ https://www.ncbi.nlm.nih.gov/pubmed/37346615 http://dx.doi.org/10.7717/peerj-cs.1230 |
Ejemplares similares
-
Yandex and ML
por: Ustyuzhanin, Andrey
Publicado: (2017) -
Interval Coded Scoring: a toolbox for interpretable scoring systems
por: Billiet, Lieven, et al.
Publicado: (2018) -
Multi-label multi-class COVID-19 Arabic Twitter dataset with fine-grained misinformation and situational information annotations
por: Obeidat, Rasha, et al.
Publicado: (2022) -
Clone-advisor: recommending code tokens and clone methods with deep learning and information retrieval
por: Hammad, Muhammad, et al.
Publicado: (2021) -
An experimental study on the performance of collaborative filtering based on user reviews for large-scale datasets
por: AL-Ghuribi, Sumaia, et al.
Publicado: (2023)