Cargando…
‘Sciencenet’—towards a global search and share engine for all scientific knowledge
Summary: Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and cross-links all different data types in l...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3106183/ https://www.ncbi.nlm.nih.gov/pubmed/21493657 http://dx.doi.org/10.1093/bioinformatics/btr181 |
_version_ | 1782204761399885824 |
---|---|
author | Lütjohann, Dominic S. Shah, Asmi H. Christen, Michael P. Richter, Florian Knese, Karsten Liebel, Urban |
author_facet | Lütjohann, Dominic S. Shah, Asmi H. Christen, Michael P. Richter, Florian Knese, Karsten Liebel, Urban |
author_sort | Lütjohann, Dominic S. |
collection | PubMed |
description | Summary: Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and cross-links all different data types in life sciences does not exist. We have developed a prototype distributed scientific search engine technology, ‘Sciencenet’, which facilitates rapid searching over this large data space. By ‘bringing the search engine to the data’, we do not require server farms. This platform also allows users to contribute to the search index and publish their large-scale data to support e-Science. Furthermore, a community-driven method guarantees that only scientific content is crawled and presented. Our peer-to-peer approach is sufficiently scalable for the science web without performance or capacity tradeoff. Availability and Implementation: The free to use search portal web page and the downloadable client are accessible at: http://sciencenet.kit.edu. The web portal for index administration is implemented in ASP.NET, the ‘AskMe’ experiment publisher is written in Python 2.7, and the backend ‘YaCy’ search engine is based on Java 1.6. Contact: urban.liebel@kit.edu Supplementary Material: Detailed instructions and descriptions can be found on the project homepage: http://sciencenet.kit.edu. |
format | Text |
id | pubmed-3106183 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31061832011-06-03 ‘Sciencenet’—towards a global search and share engine for all scientific knowledge Lütjohann, Dominic S. Shah, Asmi H. Christen, Michael P. Richter, Florian Knese, Karsten Liebel, Urban Bioinformatics Applications Note Summary: Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and cross-links all different data types in life sciences does not exist. We have developed a prototype distributed scientific search engine technology, ‘Sciencenet’, which facilitates rapid searching over this large data space. By ‘bringing the search engine to the data’, we do not require server farms. This platform also allows users to contribute to the search index and publish their large-scale data to support e-Science. Furthermore, a community-driven method guarantees that only scientific content is crawled and presented. Our peer-to-peer approach is sufficiently scalable for the science web without performance or capacity tradeoff. Availability and Implementation: The free to use search portal web page and the downloadable client are accessible at: http://sciencenet.kit.edu. The web portal for index administration is implemented in ASP.NET, the ‘AskMe’ experiment publisher is written in Python 2.7, and the backend ‘YaCy’ search engine is based on Java 1.6. Contact: urban.liebel@kit.edu Supplementary Material: Detailed instructions and descriptions can be found on the project homepage: http://sciencenet.kit.edu. Oxford University Press 2011-06-15 2011-04-14 /pmc/articles/PMC3106183/ /pubmed/21493657 http://dx.doi.org/10.1093/bioinformatics/btr181 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Lütjohann, Dominic S. Shah, Asmi H. Christen, Michael P. Richter, Florian Knese, Karsten Liebel, Urban ‘Sciencenet’—towards a global search and share engine for all scientific knowledge |
title | ‘Sciencenet’—towards a global search and share engine for all scientific knowledge |
title_full | ‘Sciencenet’—towards a global search and share engine for all scientific knowledge |
title_fullStr | ‘Sciencenet’—towards a global search and share engine for all scientific knowledge |
title_full_unstemmed | ‘Sciencenet’—towards a global search and share engine for all scientific knowledge |
title_short | ‘Sciencenet’—towards a global search and share engine for all scientific knowledge |
title_sort | ‘sciencenet’—towards a global search and share engine for all scientific knowledge |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3106183/ https://www.ncbi.nlm.nih.gov/pubmed/21493657 http://dx.doi.org/10.1093/bioinformatics/btr181 |
work_keys_str_mv | AT lutjohanndominics sciencenettowardsaglobalsearchandshareengineforallscientificknowledge AT shahasmih sciencenettowardsaglobalsearchandshareengineforallscientificknowledge AT christenmichaelp sciencenettowardsaglobalsearchandshareengineforallscientificknowledge AT richterflorian sciencenettowardsaglobalsearchandshareengineforallscientificknowledge AT knesekarsten sciencenettowardsaglobalsearchandshareengineforallscientificknowledge AT liebelurban sciencenettowardsaglobalsearchandshareengineforallscientificknowledge |