Cargando…

‘Sciencenet’—towards a global search and share engine for all scientific knowledge

Summary: Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and cross-links all different data types in l...

Descripción completa

Detalles Bibliográficos
Autores principales: Lütjohann, Dominic S., Shah, Asmi H., Christen, Michael P., Richter, Florian, Knese, Karsten, Liebel, Urban
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3106183/
https://www.ncbi.nlm.nih.gov/pubmed/21493657
http://dx.doi.org/10.1093/bioinformatics/btr181
_version_ 1782204761399885824
author Lütjohann, Dominic S.
Shah, Asmi H.
Christen, Michael P.
Richter, Florian
Knese, Karsten
Liebel, Urban
author_facet Lütjohann, Dominic S.
Shah, Asmi H.
Christen, Michael P.
Richter, Florian
Knese, Karsten
Liebel, Urban
author_sort Lütjohann, Dominic S.
collection PubMed
description Summary: Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and cross-links all different data types in life sciences does not exist. We have developed a prototype distributed scientific search engine technology, ‘Sciencenet’, which facilitates rapid searching over this large data space. By ‘bringing the search engine to the data’, we do not require server farms. This platform also allows users to contribute to the search index and publish their large-scale data to support e-Science. Furthermore, a community-driven method guarantees that only scientific content is crawled and presented. Our peer-to-peer approach is sufficiently scalable for the science web without performance or capacity tradeoff. Availability and Implementation: The free to use search portal web page and the downloadable client are accessible at: http://sciencenet.kit.edu. The web portal for index administration is implemented in ASP.NET, the ‘AskMe’ experiment publisher is written in Python 2.7, and the backend ‘YaCy’ search engine is based on Java 1.6. Contact: urban.liebel@kit.edu Supplementary Material: Detailed instructions and descriptions can be found on the project homepage: http://sciencenet.kit.edu.
format Text
id pubmed-3106183
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31061832011-06-03 ‘Sciencenet’—towards a global search and share engine for all scientific knowledge Lütjohann, Dominic S. Shah, Asmi H. Christen, Michael P. Richter, Florian Knese, Karsten Liebel, Urban Bioinformatics Applications Note Summary: Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and cross-links all different data types in life sciences does not exist. We have developed a prototype distributed scientific search engine technology, ‘Sciencenet’, which facilitates rapid searching over this large data space. By ‘bringing the search engine to the data’, we do not require server farms. This platform also allows users to contribute to the search index and publish their large-scale data to support e-Science. Furthermore, a community-driven method guarantees that only scientific content is crawled and presented. Our peer-to-peer approach is sufficiently scalable for the science web without performance or capacity tradeoff. Availability and Implementation: The free to use search portal web page and the downloadable client are accessible at: http://sciencenet.kit.edu. The web portal for index administration is implemented in ASP.NET, the ‘AskMe’ experiment publisher is written in Python 2.7, and the backend ‘YaCy’ search engine is based on Java 1.6. Contact: urban.liebel@kit.edu Supplementary Material: Detailed instructions and descriptions can be found on the project homepage: http://sciencenet.kit.edu. Oxford University Press 2011-06-15 2011-04-14 /pmc/articles/PMC3106183/ /pubmed/21493657 http://dx.doi.org/10.1093/bioinformatics/btr181 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Lütjohann, Dominic S.
Shah, Asmi H.
Christen, Michael P.
Richter, Florian
Knese, Karsten
Liebel, Urban
‘Sciencenet’—towards a global search and share engine for all scientific knowledge
title ‘Sciencenet’—towards a global search and share engine for all scientific knowledge
title_full ‘Sciencenet’—towards a global search and share engine for all scientific knowledge
title_fullStr ‘Sciencenet’—towards a global search and share engine for all scientific knowledge
title_full_unstemmed ‘Sciencenet’—towards a global search and share engine for all scientific knowledge
title_short ‘Sciencenet’—towards a global search and share engine for all scientific knowledge
title_sort ‘sciencenet’—towards a global search and share engine for all scientific knowledge
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3106183/
https://www.ncbi.nlm.nih.gov/pubmed/21493657
http://dx.doi.org/10.1093/bioinformatics/btr181
work_keys_str_mv AT lutjohanndominics sciencenettowardsaglobalsearchandshareengineforallscientificknowledge
AT shahasmih sciencenettowardsaglobalsearchandshareengineforallscientificknowledge
AT christenmichaelp sciencenettowardsaglobalsearchandshareengineforallscientificknowledge
AT richterflorian sciencenettowardsaglobalsearchandshareengineforallscientificknowledge
AT knesekarsten sciencenettowardsaglobalsearchandshareengineforallscientificknowledge
AT liebelurban sciencenettowardsaglobalsearchandshareengineforallscientificknowledge