Cargando…

QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy

[Image: see text] This paper introduces Quasar Science Resources–Autonomous University of Madrid atomic force microscopy image data set (QUAM-AFM), the largest data set of simulated atomic force microscopy (AFM) images generated from a selection of 685,513 molecules that span the most relevant bondi...

Descripción completa

Detalles Bibliográficos
Autores principales: Carracedo-Cosme, Jaime, Romero-Muñiz, Carlos, Pou, Pablo, Pérez, Rubén
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942089/
https://www.ncbi.nlm.nih.gov/pubmed/35234034
http://dx.doi.org/10.1021/acs.jcim.1c01323
_version_ 1784891418975666176
author Carracedo-Cosme, Jaime
Romero-Muñiz, Carlos
Pou, Pablo
Pérez, Rubén
author_facet Carracedo-Cosme, Jaime
Romero-Muñiz, Carlos
Pou, Pablo
Pérez, Rubén
author_sort Carracedo-Cosme, Jaime
collection PubMed
description [Image: see text] This paper introduces Quasar Science Resources–Autonomous University of Madrid atomic force microscopy image data set (QUAM-AFM), the largest data set of simulated atomic force microscopy (AFM) images generated from a selection of 685,513 molecules that span the most relevant bonding structures and chemical species in organic chemistry. QUAM-AFM contains, for each molecule, 24 3D image stacks, each consisting of constant-height images simulated for 10 tip–sample distances with a different combination of AFM operational parameters, resulting in a total of 165 million images with a resolution of 256 × 256 pixels. The 3D stacks are especially appropriate to tackle the goal of the chemical identification within AFM experiments by using deep learning techniques. The data provided for each molecule include, besides a set of AFM images, ball-and-stick depictions, IUPAC names, chemical formulas, atomic coordinates, and map of atom heights. In order to simplify the use of the collection as a source of information, we have developed a graphical user interface that allows the search for structures by CID number, IUPAC name, or chemical formula.
format Online
Article
Text
id pubmed-9942089
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-99420892023-02-22 QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy Carracedo-Cosme, Jaime Romero-Muñiz, Carlos Pou, Pablo Pérez, Rubén J Chem Inf Model [Image: see text] This paper introduces Quasar Science Resources–Autonomous University of Madrid atomic force microscopy image data set (QUAM-AFM), the largest data set of simulated atomic force microscopy (AFM) images generated from a selection of 685,513 molecules that span the most relevant bonding structures and chemical species in organic chemistry. QUAM-AFM contains, for each molecule, 24 3D image stacks, each consisting of constant-height images simulated for 10 tip–sample distances with a different combination of AFM operational parameters, resulting in a total of 165 million images with a resolution of 256 × 256 pixels. The 3D stacks are especially appropriate to tackle the goal of the chemical identification within AFM experiments by using deep learning techniques. The data provided for each molecule include, besides a set of AFM images, ball-and-stick depictions, IUPAC names, chemical formulas, atomic coordinates, and map of atom heights. In order to simplify the use of the collection as a source of information, we have developed a graphical user interface that allows the search for structures by CID number, IUPAC name, or chemical formula. American Chemical Society 2022-03-02 /pmc/articles/PMC9942089/ /pubmed/35234034 http://dx.doi.org/10.1021/acs.jcim.1c01323 Text en © 2022 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Carracedo-Cosme, Jaime
Romero-Muñiz, Carlos
Pou, Pablo
Pérez, Rubén
QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy
title QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy
title_full QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy
title_fullStr QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy
title_full_unstemmed QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy
title_short QUAM-AFM: A Free Database for Molecular Identification by Atomic Force Microscopy
title_sort quam-afm: a free database for molecular identification by atomic force microscopy
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942089/
https://www.ncbi.nlm.nih.gov/pubmed/35234034
http://dx.doi.org/10.1021/acs.jcim.1c01323
work_keys_str_mv AT carracedocosmejaime quamafmafreedatabaseformolecularidentificationbyatomicforcemicroscopy
AT romeromunizcarlos quamafmafreedatabaseformolecularidentificationbyatomicforcemicroscopy
AT poupablo quamafmafreedatabaseformolecularidentificationbyatomicforcemicroscopy
AT perezruben quamafmafreedatabaseformolecularidentificationbyatomicforcemicroscopy