Cargando…
A Compressed Language Model Embedding Dataset of ICD 10 CM Descriptions
This paper presents novel datasets providing numerical representations of ICD-10-CM codes by generating description embeddings using a large language model followed by a dimension reduction via autoencoder. The embeddings serve as informative input features for machine learning models by capturing r...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168496/ https://www.ncbi.nlm.nih.gov/pubmed/37162903 http://dx.doi.org/10.1101/2023.04.24.23289046 |