Cargando…
A corpus and a concordancer of academic journal articles
This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic jour...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5694964/ https://www.ncbi.nlm.nih.gov/pubmed/29188227 http://dx.doi.org/10.1016/j.dib.2017.11.023 |
_version_ | 1783280227498065920 |
---|---|
author | Kwary, Deny A. |
author_facet | Kwary, Deny A. |
author_sort | Kwary, Deny A. |
collection | PubMed |
description | This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic journals. The corpus contains 5,686,428 words selected from 895 journal articles published by Elsevier in 2011–2015. The corpus is classified into four subject areas: Health sciences, Life sciences, Physical Sciences, and Social Sciences, following the classifications of Scopus, which is the largest abstract and citation database of peer-reviewed scientific journals, books and conference proceedings. To ease the access and utilization of the corpus, a program to produce the key word in context (KWIC) and word frequency was created and placed on the website: corpus.kwary.net. The corpus is a valuable resource for researchers, teachers, and translators working on academic English. |
format | Online Article Text |
id | pubmed-5694964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-56949642017-11-29 A corpus and a concordancer of academic journal articles Kwary, Deny A. Data Brief Arts and Humanity This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic journals. The corpus contains 5,686,428 words selected from 895 journal articles published by Elsevier in 2011–2015. The corpus is classified into four subject areas: Health sciences, Life sciences, Physical Sciences, and Social Sciences, following the classifications of Scopus, which is the largest abstract and citation database of peer-reviewed scientific journals, books and conference proceedings. To ease the access and utilization of the corpus, a program to produce the key word in context (KWIC) and word frequency was created and placed on the website: corpus.kwary.net. The corpus is a valuable resource for researchers, teachers, and translators working on academic English. Elsevier 2017-11-08 /pmc/articles/PMC5694964/ /pubmed/29188227 http://dx.doi.org/10.1016/j.dib.2017.11.023 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Arts and Humanity Kwary, Deny A. A corpus and a concordancer of academic journal articles |
title | A corpus and a concordancer of academic journal articles |
title_full | A corpus and a concordancer of academic journal articles |
title_fullStr | A corpus and a concordancer of academic journal articles |
title_full_unstemmed | A corpus and a concordancer of academic journal articles |
title_short | A corpus and a concordancer of academic journal articles |
title_sort | corpus and a concordancer of academic journal articles |
topic | Arts and Humanity |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5694964/ https://www.ncbi.nlm.nih.gov/pubmed/29188227 http://dx.doi.org/10.1016/j.dib.2017.11.023 |
work_keys_str_mv | AT kwarydenya acorpusandaconcordancerofacademicjournalarticles AT kwarydenya corpusandaconcordancerofacademicjournalarticles |