Cargando…

Smiles2Monomers: a link between chemical and biological structures for polymers

BACKGROUND: The monomeric composition of polymers is powerful for structure comparison and synthetic biology, among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Dufresne, Yoann, Noé, Laurent, Leclère, Valérie, Pupin, Maude
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4693424/
https://www.ncbi.nlm.nih.gov/pubmed/26715946
http://dx.doi.org/10.1186/s13321-015-0111-5
_version_ 1782407386809499648
author Dufresne, Yoann
Noé, Laurent
Leclère, Valérie
Pupin, Maude
author_facet Dufresne, Yoann
Noé, Laurent
Leclère, Valérie
Pupin, Maude
author_sort Dufresne, Yoann
collection PubMed
description BACKGROUND: The monomeric composition of polymers is powerful for structure comparison and synthetic biology, among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure. RESULTS: Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s. CONCLUSION: s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at http://bioinfo.lifl.fr/norine/smiles2monomers.jsp. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0111-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4693424
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-46934242015-12-30 Smiles2Monomers: a link between chemical and biological structures for polymers Dufresne, Yoann Noé, Laurent Leclère, Valérie Pupin, Maude J Cheminform Research Article BACKGROUND: The monomeric composition of polymers is powerful for structure comparison and synthetic biology, among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure. RESULTS: Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s. CONCLUSION: s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at http://bioinfo.lifl.fr/norine/smiles2monomers.jsp. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0111-5) contains supplementary material, which is available to authorized users. Springer International Publishing 2015-12-29 /pmc/articles/PMC4693424/ /pubmed/26715946 http://dx.doi.org/10.1186/s13321-015-0111-5 Text en © Dufresne et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Dufresne, Yoann
Noé, Laurent
Leclère, Valérie
Pupin, Maude
Smiles2Monomers: a link between chemical and biological structures for polymers
title Smiles2Monomers: a link between chemical and biological structures for polymers
title_full Smiles2Monomers: a link between chemical and biological structures for polymers
title_fullStr Smiles2Monomers: a link between chemical and biological structures for polymers
title_full_unstemmed Smiles2Monomers: a link between chemical and biological structures for polymers
title_short Smiles2Monomers: a link between chemical and biological structures for polymers
title_sort smiles2monomers: a link between chemical and biological structures for polymers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4693424/
https://www.ncbi.nlm.nih.gov/pubmed/26715946
http://dx.doi.org/10.1186/s13321-015-0111-5
work_keys_str_mv AT dufresneyoann smiles2monomersalinkbetweenchemicalandbiologicalstructuresforpolymers
AT noelaurent smiles2monomersalinkbetweenchemicalandbiologicalstructuresforpolymers
AT leclerevalerie smiles2monomersalinkbetweenchemicalandbiologicalstructuresforpolymers
AT pupinmaude smiles2monomersalinkbetweenchemicalandbiologicalstructuresforpolymers