Cargando…

Random Steinhaus Distances for Robust Syntax-Based Classification of Partially Inconsistent Linguistic Data

We use the Steinhaus transform of metric distances to deal with inconsistency in linguistic classification. We focus on data due to G. Longobardi’s school: languages are represented through yes-no strings of length 53, each string position corresponding to a syntactic feature which can be present or...

Descripción completa

Detalles Bibliográficos
Autores principales: Franzoi, Laura, Sgarro, Andrea, Dinu, Anca, Dinu, Liviu P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7274662/
http://dx.doi.org/10.1007/978-3-030-50153-2_2
Descripción
Sumario:We use the Steinhaus transform of metric distances to deal with inconsistency in linguistic classification. We focus on data due to G. Longobardi’s school: languages are represented through yes-no strings of length 53, each string position corresponding to a syntactic feature which can be present or absent. However, due to a complex network of logical implications which constrain features, some positions might be undefined (logically inconsistent). To take into account linguistic inconsistency, the distances we use are Steinhaus metric distances generalizing the normalized Hamming distance. To validate the robustness of classifications based on Longobardi’s data we resort to randomized transforms. Experimental results are provided and commented upon.