Cargando…

Netlang: A software for the linguistic analysis of corpora by means of complex networks

To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a...

Descripción completa

Detalles Bibliográficos
Autores principales: Barceló-Coblijn, Lluís, Serna Salazar, Diego, Isaza, Gustavo, Castillo Ossa, Luis F., Bedia, Manuel G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568436/
https://www.ncbi.nlm.nih.gov/pubmed/28832598
http://dx.doi.org/10.1371/journal.pone.0181341
_version_ 1783258862702297088
author Barceló-Coblijn, Lluís
Serna Salazar, Diego
Isaza, Gustavo
Castillo Ossa, Luis F.
Bedia, Manuel G.
author_facet Barceló-Coblijn, Lluís
Serna Salazar, Diego
Isaza, Gustavo
Castillo Ossa, Luis F.
Bedia, Manuel G.
author_sort Barceló-Coblijn, Lluís
collection PubMed
description To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a complex system. However, most proposals do not analyze language according to linguistic theory, but use instead computational systems that should save time at the price of leaving aside many crucial aspects for linguistic theory. Some approaches to network studies on language do apply precise linguistic analyses, made by a linguist. The problem until now has been the lack of interface between the analysis of a sentence and its integration into the network that could be managed by a linguist and that could save the analysis of any language. Previous works have used old software that was not created for these purposes and that often produced problems with some idiosyncrasies of the target language. The desired interface should be able to deal with the syntactic peculiarities of a particular language, the options of linguistic theory preferred by the user and the preservation of morpho-syntactic information (lexical categories and syntactic relations between items). Netlang is the first program able to do that. Recently, a new kind of linguistic analysis has been developed, which is able to extract a complexity pattern from the speaker's linguistic production which is depicted as a network where words are inside nodes, and these nodes connect each other by means of edges or links (the information inside the edge can be syntactic, semantic, etc.). The Netlang software has become the bridge between rough linguistic data and the network program. Netlang has integrated and improved the functions of programs used in the past, namely the DGA annotator and two scripts (ToXML.pl and Xml2Pairs.py) used for transforming and pruning data. Netlang allows the researcher to make accurate linguistic analysis by means of syntactic dependency relations between words, while tracking record of the nature of such syntactic relationships (subject, object, etc). The Netlang software is presented as a new tool that solve many problems detected in the past. The most important improvement is that Netlang integrates three past applications into one program, and is able to produce a series of file formats that can be read by a network program. Through the Netlang software, the linguistic network analysis based on syntactic analyses, characterized for its low cost and the completely non-invasive procedure aims to evolve into a sufficiently fine grained tool for clinical diagnosis in potential cases of language disorders.
format Online
Article
Text
id pubmed-5568436
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55684362017-09-09 Netlang: A software for the linguistic analysis of corpora by means of complex networks Barceló-Coblijn, Lluís Serna Salazar, Diego Isaza, Gustavo Castillo Ossa, Luis F. Bedia, Manuel G. PLoS One Research Article To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a complex system. However, most proposals do not analyze language according to linguistic theory, but use instead computational systems that should save time at the price of leaving aside many crucial aspects for linguistic theory. Some approaches to network studies on language do apply precise linguistic analyses, made by a linguist. The problem until now has been the lack of interface between the analysis of a sentence and its integration into the network that could be managed by a linguist and that could save the analysis of any language. Previous works have used old software that was not created for these purposes and that often produced problems with some idiosyncrasies of the target language. The desired interface should be able to deal with the syntactic peculiarities of a particular language, the options of linguistic theory preferred by the user and the preservation of morpho-syntactic information (lexical categories and syntactic relations between items). Netlang is the first program able to do that. Recently, a new kind of linguistic analysis has been developed, which is able to extract a complexity pattern from the speaker's linguistic production which is depicted as a network where words are inside nodes, and these nodes connect each other by means of edges or links (the information inside the edge can be syntactic, semantic, etc.). The Netlang software has become the bridge between rough linguistic data and the network program. Netlang has integrated and improved the functions of programs used in the past, namely the DGA annotator and two scripts (ToXML.pl and Xml2Pairs.py) used for transforming and pruning data. Netlang allows the researcher to make accurate linguistic analysis by means of syntactic dependency relations between words, while tracking record of the nature of such syntactic relationships (subject, object, etc). The Netlang software is presented as a new tool that solve many problems detected in the past. The most important improvement is that Netlang integrates three past applications into one program, and is able to produce a series of file formats that can be read by a network program. Through the Netlang software, the linguistic network analysis based on syntactic analyses, characterized for its low cost and the completely non-invasive procedure aims to evolve into a sufficiently fine grained tool for clinical diagnosis in potential cases of language disorders. Public Library of Science 2017-08-23 /pmc/articles/PMC5568436/ /pubmed/28832598 http://dx.doi.org/10.1371/journal.pone.0181341 Text en © 2017 Barceló-Coblijn et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Barceló-Coblijn, Lluís
Serna Salazar, Diego
Isaza, Gustavo
Castillo Ossa, Luis F.
Bedia, Manuel G.
Netlang: A software for the linguistic analysis of corpora by means of complex networks
title Netlang: A software for the linguistic analysis of corpora by means of complex networks
title_full Netlang: A software for the linguistic analysis of corpora by means of complex networks
title_fullStr Netlang: A software for the linguistic analysis of corpora by means of complex networks
title_full_unstemmed Netlang: A software for the linguistic analysis of corpora by means of complex networks
title_short Netlang: A software for the linguistic analysis of corpora by means of complex networks
title_sort netlang: a software for the linguistic analysis of corpora by means of complex networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568436/
https://www.ncbi.nlm.nih.gov/pubmed/28832598
http://dx.doi.org/10.1371/journal.pone.0181341
work_keys_str_mv AT barcelocoblijnlluis netlangasoftwareforthelinguisticanalysisofcorporabymeansofcomplexnetworks
AT sernasalazardiego netlangasoftwareforthelinguisticanalysisofcorporabymeansofcomplexnetworks
AT isazagustavo netlangasoftwareforthelinguisticanalysisofcorporabymeansofcomplexnetworks
AT castilloossaluisf netlangasoftwareforthelinguisticanalysisofcorporabymeansofcomplexnetworks
AT bediamanuelg netlangasoftwareforthelinguisticanalysisofcorporabymeansofcomplexnetworks