Cargando…

Autonomous schema markups based on intelligent computing for search engine optimization

With advances in artificial intelligence and semantic technology, search engines are integrating semantics to address complex search queries to improve the results. This requires identification of well-known concepts or entities and their relationship from web page contents. But the increase in comp...

Descripción completa

Detalles Bibliográficos
Autores principales:	Abbasi, Burhan Ud Din, Fatima, Iram, Mukhtar, Hamid, Khan, Sharifullah, Alhumam, Abdulaziz, Ahmad, Hafiz Farooq
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2022
Materias:	Data Mining and Machine Learning
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9748814/ https://www.ncbi.nlm.nih.gov/pubmed/36532807 http://dx.doi.org/10.7717/peerj-cs.1163

_version_	1784849908675641344
author	Abbasi, Burhan Ud Din Fatima, Iram Mukhtar, Hamid Khan, Sharifullah Alhumam, Abdulaziz Ahmad, Hafiz Farooq
author_facet	Abbasi, Burhan Ud Din Fatima, Iram Mukhtar, Hamid Khan, Sharifullah Alhumam, Abdulaziz Ahmad, Hafiz Farooq
author_sort	Abbasi, Burhan Ud Din
collection	PubMed
description	With advances in artificial intelligence and semantic technology, search engines are integrating semantics to address complex search queries to improve the results. This requires identification of well-known concepts or entities and their relationship from web page contents. But the increase in complex unstructured data on web pages has made the task of concept identification overly complex. Existing research focuses on entity recognition from the perspective of linguistic structures such as complete sentences and paragraphs, whereas a huge part of the data on web pages exists as unstructured text fragments enclosed in HTML tags. Ontologies provide schemas to structure the data on the web. However, including them in the web pages requires additional resources and expertise from organizations or webmasters and thus becoming a major hindrance in their large-scale adoption. We propose an approach for autonomous identification of entities from short text present in web pages to populate semantic models based on a specific ontology model. The proposed approach has been applied to a public dataset containing academic web pages. We employ a long short-term memory (LSTM) deep learning network and the random forest machine learning algorithm to predict entities. The proposed methodology gives an overall accuracy of 0.94 on the test dataset, indicating a potential for automated prediction even in the case of a limited number of training samples for various entities, thus, significantly reducing the required manual workload in practical applications.
format	Online Article Text
id	pubmed-9748814
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-97488142022-12-15 Autonomous schema markups based on intelligent computing for search engine optimization Abbasi, Burhan Ud Din Fatima, Iram Mukhtar, Hamid Khan, Sharifullah Alhumam, Abdulaziz Ahmad, Hafiz Farooq PeerJ Comput Sci Data Mining and Machine Learning With advances in artificial intelligence and semantic technology, search engines are integrating semantics to address complex search queries to improve the results. This requires identification of well-known concepts or entities and their relationship from web page contents. But the increase in complex unstructured data on web pages has made the task of concept identification overly complex. Existing research focuses on entity recognition from the perspective of linguistic structures such as complete sentences and paragraphs, whereas a huge part of the data on web pages exists as unstructured text fragments enclosed in HTML tags. Ontologies provide schemas to structure the data on the web. However, including them in the web pages requires additional resources and expertise from organizations or webmasters and thus becoming a major hindrance in their large-scale adoption. We propose an approach for autonomous identification of entities from short text present in web pages to populate semantic models based on a specific ontology model. The proposed approach has been applied to a public dataset containing academic web pages. We employ a long short-term memory (LSTM) deep learning network and the random forest machine learning algorithm to predict entities. The proposed methodology gives an overall accuracy of 0.94 on the test dataset, indicating a potential for automated prediction even in the case of a limited number of training samples for various entities, thus, significantly reducing the required manual workload in practical applications. PeerJ Inc. 2022-12-08 /pmc/articles/PMC9748814/ /pubmed/36532807 http://dx.doi.org/10.7717/peerj-cs.1163 Text en © 2022 Abbasi et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Data Mining and Machine Learning Abbasi, Burhan Ud Din Fatima, Iram Mukhtar, Hamid Khan, Sharifullah Alhumam, Abdulaziz Ahmad, Hafiz Farooq Autonomous schema markups based on intelligent computing for search engine optimization
title	Autonomous schema markups based on intelligent computing for search engine optimization
title_full	Autonomous schema markups based on intelligent computing for search engine optimization
title_fullStr	Autonomous schema markups based on intelligent computing for search engine optimization
title_full_unstemmed	Autonomous schema markups based on intelligent computing for search engine optimization
title_short	Autonomous schema markups based on intelligent computing for search engine optimization
title_sort	autonomous schema markups based on intelligent computing for search engine optimization
topic	Data Mining and Machine Learning
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9748814/ https://www.ncbi.nlm.nih.gov/pubmed/36532807 http://dx.doi.org/10.7717/peerj-cs.1163
work_keys_str_mv	AT abbasiburhanuddin autonomousschemamarkupsbasedonintelligentcomputingforsearchengineoptimization AT fatimairam autonomousschemamarkupsbasedonintelligentcomputingforsearchengineoptimization AT mukhtarhamid autonomousschemamarkupsbasedonintelligentcomputingforsearchengineoptimization AT khansharifullah autonomousschemamarkupsbasedonintelligentcomputingforsearchengineoptimization AT alhumamabdulaziz autonomousschemamarkupsbasedonintelligentcomputingforsearchengineoptimization AT ahmadhafizfarooq autonomousschemamarkupsbasedonintelligentcomputingforsearchengineoptimization

Autonomous schema markups based on intelligent computing for search engine optimization

Ejemplares similares