Cargando…

Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database

Phylogenetic Oligonucleotide Arrays (POAs) were recently adapted for studying the huge microbial communities in a flexible and easy-to-use way. POA coupled with the use of explorative probes to detect the unknown part is now one of the most powerful approaches for a better understanding of microbial...

Descripción completa

Detalles Bibliográficos
Autores principales: Jaziri, Faouzi, Peyretaillade, Eric, Missaoui, Mohieddine, Parisot, Nicolas, Cipière, Sébastien, Denonfoux, Jérémie, Mahul, Antoine, Peyret, Pierre, Hill, David R. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3913353/
https://www.ncbi.nlm.nih.gov/pubmed/24516366
http://dx.doi.org/10.1155/2014/350487
_version_ 1782302209942224896
author Jaziri, Faouzi
Peyretaillade, Eric
Missaoui, Mohieddine
Parisot, Nicolas
Cipière, Sébastien
Denonfoux, Jérémie
Mahul, Antoine
Peyret, Pierre
Hill, David R. C.
author_facet Jaziri, Faouzi
Peyretaillade, Eric
Missaoui, Mohieddine
Parisot, Nicolas
Cipière, Sébastien
Denonfoux, Jérémie
Mahul, Antoine
Peyret, Pierre
Hill, David R. C.
author_sort Jaziri, Faouzi
collection PubMed
description Phylogenetic Oligonucleotide Arrays (POAs) were recently adapted for studying the huge microbial communities in a flexible and easy-to-use way. POA coupled with the use of explorative probes to detect the unknown part is now one of the most powerful approaches for a better understanding of microbial community functioning. However, the selection of probes remains a very difficult task. The rapid growth of environmental databases has led to an exponential increase of data to be managed for an efficient design. Consequently, the use of high performance computing facilities is mandatory. In this paper, we present an efficient parallelization method to select known and explorative oligonucleotide probes at large scale using computing grids. We implemented a software that generates and monitors thousands of jobs over the European Computing Grid Infrastructure (EGI). We also developed a new algorithm for the construction of a high-quality curated phylogenetic database to avoid erroneous design due to bad sequence affiliation. We present here the performance and statistics of our method on real biological datasets based on a phylogenetic prokaryotic database at the genus level and a complete design of about 20,000 probes for 2,069 genera of prokaryotes.
format Online
Article
Text
id pubmed-3913353
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-39133532014-02-10 Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database Jaziri, Faouzi Peyretaillade, Eric Missaoui, Mohieddine Parisot, Nicolas Cipière, Sébastien Denonfoux, Jérémie Mahul, Antoine Peyret, Pierre Hill, David R. C. ScientificWorldJournal Research Article Phylogenetic Oligonucleotide Arrays (POAs) were recently adapted for studying the huge microbial communities in a flexible and easy-to-use way. POA coupled with the use of explorative probes to detect the unknown part is now one of the most powerful approaches for a better understanding of microbial community functioning. However, the selection of probes remains a very difficult task. The rapid growth of environmental databases has led to an exponential increase of data to be managed for an efficient design. Consequently, the use of high performance computing facilities is mandatory. In this paper, we present an efficient parallelization method to select known and explorative oligonucleotide probes at large scale using computing grids. We implemented a software that generates and monitors thousands of jobs over the European Computing Grid Infrastructure (EGI). We also developed a new algorithm for the construction of a high-quality curated phylogenetic database to avoid erroneous design due to bad sequence affiliation. We present here the performance and statistics of our method on real biological datasets based on a phylogenetic prokaryotic database at the genus level and a complete design of about 20,000 probes for 2,069 genera of prokaryotes. Hindawi Publishing Corporation 2014-01-06 /pmc/articles/PMC3913353/ /pubmed/24516366 http://dx.doi.org/10.1155/2014/350487 Text en Copyright © 2014 Faouzi Jaziri et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jaziri, Faouzi
Peyretaillade, Eric
Missaoui, Mohieddine
Parisot, Nicolas
Cipière, Sébastien
Denonfoux, Jérémie
Mahul, Antoine
Peyret, Pierre
Hill, David R. C.
Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
title Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
title_full Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
title_fullStr Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
title_full_unstemmed Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
title_short Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
title_sort large scale explorative oligonucleotide probe selection for thousands of genetic groups on a computing grid: application to phylogenetic probe design using a curated small subunit ribosomal rna gene database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3913353/
https://www.ncbi.nlm.nih.gov/pubmed/24516366
http://dx.doi.org/10.1155/2014/350487
work_keys_str_mv AT jazirifaouzi largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT peyretailladeeric largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT missaouimohieddine largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT parisotnicolas largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT cipieresebastien largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT denonfouxjeremie largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT mahulantoine largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT peyretpierre largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase
AT hilldavidrc largescaleexplorativeoligonucleotideprobeselectionforthousandsofgeneticgroupsonacomputinggridapplicationtophylogeneticprobedesignusingacuratedsmallsubunitribosomalrnagenedatabase