Cargando…

A corpus and a concordancer of academic journal articles

This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic jour...

Descripción completa

Detalles Bibliográficos
Autor principal: Kwary, Deny A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5694964/
https://www.ncbi.nlm.nih.gov/pubmed/29188227
http://dx.doi.org/10.1016/j.dib.2017.11.023
_version_ 1783280227498065920
author Kwary, Deny A.
author_facet Kwary, Deny A.
author_sort Kwary, Deny A.
collection PubMed
description This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic journals. The corpus contains 5,686,428 words selected from 895 journal articles published by Elsevier in 2011–2015. The corpus is classified into four subject areas: Health sciences, Life sciences, Physical Sciences, and Social Sciences, following the classifications of Scopus, which is the largest abstract and citation database of peer-reviewed scientific journals, books and conference proceedings. To ease the access and utilization of the corpus, a program to produce the key word in context (KWIC) and word frequency was created and placed on the website: corpus.kwary.net. The corpus is a valuable resource for researchers, teachers, and translators working on academic English.
format Online
Article
Text
id pubmed-5694964
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-56949642017-11-29 A corpus and a concordancer of academic journal articles Kwary, Deny A. Data Brief Arts and Humanity This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic journals. The corpus contains 5,686,428 words selected from 895 journal articles published by Elsevier in 2011–2015. The corpus is classified into four subject areas: Health sciences, Life sciences, Physical Sciences, and Social Sciences, following the classifications of Scopus, which is the largest abstract and citation database of peer-reviewed scientific journals, books and conference proceedings. To ease the access and utilization of the corpus, a program to produce the key word in context (KWIC) and word frequency was created and placed on the website: corpus.kwary.net. The corpus is a valuable resource for researchers, teachers, and translators working on academic English. Elsevier 2017-11-08 /pmc/articles/PMC5694964/ /pubmed/29188227 http://dx.doi.org/10.1016/j.dib.2017.11.023 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Arts and Humanity
Kwary, Deny A.
A corpus and a concordancer of academic journal articles
title A corpus and a concordancer of academic journal articles
title_full A corpus and a concordancer of academic journal articles
title_fullStr A corpus and a concordancer of academic journal articles
title_full_unstemmed A corpus and a concordancer of academic journal articles
title_short A corpus and a concordancer of academic journal articles
title_sort corpus and a concordancer of academic journal articles
topic Arts and Humanity
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5694964/
https://www.ncbi.nlm.nih.gov/pubmed/29188227
http://dx.doi.org/10.1016/j.dib.2017.11.023
work_keys_str_mv AT kwarydenya acorpusandaconcordancerofacademicjournalarticles
AT kwarydenya corpusandaconcordancerofacademicjournalarticles