Cargando…

Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science

BACKGROUND: Research on health and sustainable development is growing at a pace such that conventional literature review methods appear increasingly unable to synthesize all relevant evidence. This paper employs a novel combination of natural language processing (NLP) and network science techniques...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Thomas Bryan, Vacca, Raffaele, Mantegazza, Luca, Capua, Ilaria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311734/
https://www.ncbi.nlm.nih.gov/pubmed/37386579
http://dx.doi.org/10.1186/s12992-023-00943-8
_version_ 1785066802539134976
author Smith, Thomas Bryan
Vacca, Raffaele
Mantegazza, Luca
Capua, Ilaria
author_facet Smith, Thomas Bryan
Vacca, Raffaele
Mantegazza, Luca
Capua, Ilaria
author_sort Smith, Thomas Bryan
collection PubMed
description BACKGROUND: Research on health and sustainable development is growing at a pace such that conventional literature review methods appear increasingly unable to synthesize all relevant evidence. This paper employs a novel combination of natural language processing (NLP) and network science techniques to address this problem and to answer two questions: (1) how is health thematically interconnected with the Sustainable Development Goals (SDGs) in global science? (2) What specific themes have emerged in research at the intersection between SDG 3 (“Good health and well-being”) and other sustainability goals? METHODS: After a descriptive analysis of the integration between SDGs in twenty years of global science (2001–2020) as indexed by dimensions.ai, we analyze abstracts of articles that are simultaneously relevant to SDG 3 and at least one other SDG (N = 27,928). We use the top2vec algorithm to discover topics in this corpus and measure semantic closeness between these topics. We then use network science methods to describe the network of substantive relationships between the topics and identify ‘zipper themes’, actionable domains of research and policy to co-advance health and other sustainability goals simultaneously. RESULTS: We observe a clear increase in scientific research integrating SDG 3 and other SDGs since 2001, both in absolute and relative terms, especially on topics relevant to interconnections between health and SDGs 2 (“Zero hunger”), 4 (“Quality education”), and 11 (“Sustainable cities and communities”). We distill a network of 197 topics from literature on health and sustainable development, with 19 distinct network communities – areas of growing integration with potential to further bridge health and sustainability science and policy. Literature focused explicitly on the SDGs is highly central in this network, while topical overlaps between SDG 3 and the environmental SDGs (12–15) are under-developed. CONCLUSION: Our analysis demonstrates the feasibility and promise of NLP and network science for synthesizing large amounts of health-related scientific literature and for suggesting novel research and policy domains to co-advance multiple SDGs. Many of the ‘zipper themes’ identified by our method resonate with the One Health perspective that human, animal, and plant health are closely interdependent. This and similar perspectives will help meet the challenge of ‘rewiring’ sustainability research to co-advance goals in health and sustainability. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12992-023-00943-8.
format Online
Article
Text
id pubmed-10311734
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103117342023-07-01 Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science Smith, Thomas Bryan Vacca, Raffaele Mantegazza, Luca Capua, Ilaria Global Health Research BACKGROUND: Research on health and sustainable development is growing at a pace such that conventional literature review methods appear increasingly unable to synthesize all relevant evidence. This paper employs a novel combination of natural language processing (NLP) and network science techniques to address this problem and to answer two questions: (1) how is health thematically interconnected with the Sustainable Development Goals (SDGs) in global science? (2) What specific themes have emerged in research at the intersection between SDG 3 (“Good health and well-being”) and other sustainability goals? METHODS: After a descriptive analysis of the integration between SDGs in twenty years of global science (2001–2020) as indexed by dimensions.ai, we analyze abstracts of articles that are simultaneously relevant to SDG 3 and at least one other SDG (N = 27,928). We use the top2vec algorithm to discover topics in this corpus and measure semantic closeness between these topics. We then use network science methods to describe the network of substantive relationships between the topics and identify ‘zipper themes’, actionable domains of research and policy to co-advance health and other sustainability goals simultaneously. RESULTS: We observe a clear increase in scientific research integrating SDG 3 and other SDGs since 2001, both in absolute and relative terms, especially on topics relevant to interconnections between health and SDGs 2 (“Zero hunger”), 4 (“Quality education”), and 11 (“Sustainable cities and communities”). We distill a network of 197 topics from literature on health and sustainable development, with 19 distinct network communities – areas of growing integration with potential to further bridge health and sustainability science and policy. Literature focused explicitly on the SDGs is highly central in this network, while topical overlaps between SDG 3 and the environmental SDGs (12–15) are under-developed. CONCLUSION: Our analysis demonstrates the feasibility and promise of NLP and network science for synthesizing large amounts of health-related scientific literature and for suggesting novel research and policy domains to co-advance multiple SDGs. Many of the ‘zipper themes’ identified by our method resonate with the One Health perspective that human, animal, and plant health are closely interdependent. This and similar perspectives will help meet the challenge of ‘rewiring’ sustainability research to co-advance goals in health and sustainability. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12992-023-00943-8. BioMed Central 2023-06-29 /pmc/articles/PMC10311734/ /pubmed/37386579 http://dx.doi.org/10.1186/s12992-023-00943-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Smith, Thomas Bryan
Vacca, Raffaele
Mantegazza, Luca
Capua, Ilaria
Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
title Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
title_full Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
title_fullStr Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
title_full_unstemmed Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
title_short Discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
title_sort discovering new pathways toward integration between health and sustainable development goals with natural language processing and network science
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311734/
https://www.ncbi.nlm.nih.gov/pubmed/37386579
http://dx.doi.org/10.1186/s12992-023-00943-8
work_keys_str_mv AT smiththomasbryan discoveringnewpathwaystowardintegrationbetweenhealthandsustainabledevelopmentgoalswithnaturallanguageprocessingandnetworkscience
AT vaccaraffaele discoveringnewpathwaystowardintegrationbetweenhealthandsustainabledevelopmentgoalswithnaturallanguageprocessingandnetworkscience
AT mantegazzaluca discoveringnewpathwaystowardintegrationbetweenhealthandsustainabledevelopmentgoalswithnaturallanguageprocessingandnetworkscience
AT capuailaria discoveringnewpathwaystowardintegrationbetweenhealthandsustainabledevelopmentgoalswithnaturallanguageprocessingandnetworkscience