Cargando…

A Compressed Language Model Embedding Dataset of ICD 10 CM Descriptions

This paper presents novel datasets providing numerical representations of ICD-10-CM codes by generating description embeddings using a large language model followed by a dimension reduction via autoencoder. The embeddings serve as informative input features for machine learning models by capturing r...

Descripción completa

Detalles Bibliográficos
Autores principales: Kane, Michael J., King, Casey, Esserman, Denise, Latham, Nancy K., Greene, Erich J., Ganz, David A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168496/
https://www.ncbi.nlm.nih.gov/pubmed/37162903
http://dx.doi.org/10.1101/2023.04.24.23289046