Cargando…

Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net

Cryo-electron microscopy (cryo-EM) has revolutionized the field of structural biology by enabling the precise determination of large protein structures. Picking protein particles in cryo-EM micrographs (images) is a crucial step in the cryo-EM-based structure determination. However, existing methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Gyawali, Rajan, Dhakal, Ashwin, Wang, Liguo, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10592924/
https://www.ncbi.nlm.nih.gov/pubmed/37873264
http://dx.doi.org/10.1101/2023.10.02.560572
_version_ 1785124365076004864
author Gyawali, Rajan
Dhakal, Ashwin
Wang, Liguo
Cheng, Jianlin
author_facet Gyawali, Rajan
Dhakal, Ashwin
Wang, Liguo
Cheng, Jianlin
author_sort Gyawali, Rajan
collection PubMed
description Cryo-electron microscopy (cryo-EM) has revolutionized the field of structural biology by enabling the precise determination of large protein structures. Picking protein particles in cryo-EM micrographs (images) is a crucial step in the cryo-EM-based structure determination. However, existing methods trained on a limited amount of cryo-EM data still cannot accurately pick protein particles from complex, noisy, and heterogenous cryo-EM images. The general foundational artificial intelligence (AI)-based image segmentation model such as the Segment Anything Model (SAM) trained on huge amounts of general image data cannot segment protein particles well because their training data do not include cryo-EM images. In this work, we present a novel approach (CryoSegNet) of integrating the power of the encoder and decoder-based architecture of an attention-gated U-shape network (U-Net) specially designed and trained for cryo-EM particle picking and the SAM. The U-Net is first trained on a large cryo-EM image dataset and then used to generate input from original cryo-EM images for SAM to make particle pickings. CryoSegNet shows both high precision and recall in segmenting protein particles from cryo-EM micrographs, irrespective of protein type, shape, and size. On several independent datasets of various protein types, CryoSegNet outperforms two top machine learning particle pickers crYOLO and Topaz as well as SAM itself. The average resolution of density maps reconstructed from the particles picked by CryoSegNet is 3.05 Å, 15% better than 3.60 Å of Topaz and 49% better than 5.96 Å of crYOLO. Therefore, CryoSegNet can be applied to enhance the resolution of protein structures constructed from both existing and new cryo-EM data.
format Online
Article
Text
id pubmed-10592924
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-105929242023-10-24 Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net Gyawali, Rajan Dhakal, Ashwin Wang, Liguo Cheng, Jianlin bioRxiv Article Cryo-electron microscopy (cryo-EM) has revolutionized the field of structural biology by enabling the precise determination of large protein structures. Picking protein particles in cryo-EM micrographs (images) is a crucial step in the cryo-EM-based structure determination. However, existing methods trained on a limited amount of cryo-EM data still cannot accurately pick protein particles from complex, noisy, and heterogenous cryo-EM images. The general foundational artificial intelligence (AI)-based image segmentation model such as the Segment Anything Model (SAM) trained on huge amounts of general image data cannot segment protein particles well because their training data do not include cryo-EM images. In this work, we present a novel approach (CryoSegNet) of integrating the power of the encoder and decoder-based architecture of an attention-gated U-shape network (U-Net) specially designed and trained for cryo-EM particle picking and the SAM. The U-Net is first trained on a large cryo-EM image dataset and then used to generate input from original cryo-EM images for SAM to make particle pickings. CryoSegNet shows both high precision and recall in segmenting protein particles from cryo-EM micrographs, irrespective of protein type, shape, and size. On several independent datasets of various protein types, CryoSegNet outperforms two top machine learning particle pickers crYOLO and Topaz as well as SAM itself. The average resolution of density maps reconstructed from the particles picked by CryoSegNet is 3.05 Å, 15% better than 3.60 Å of Topaz and 49% better than 5.96 Å of crYOLO. Therefore, CryoSegNet can be applied to enhance the resolution of protein structures constructed from both existing and new cryo-EM data. Cold Spring Harbor Laboratory 2023-10-03 /pmc/articles/PMC10592924/ /pubmed/37873264 http://dx.doi.org/10.1101/2023.10.02.560572 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Gyawali, Rajan
Dhakal, Ashwin
Wang, Liguo
Cheng, Jianlin
Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net
title Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net
title_full Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net
title_fullStr Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net
title_full_unstemmed Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net
title_short Accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and specialized U-Net
title_sort accurate cryo-em protein particle picking by integrating the foundational ai image segmentation model and specialized u-net
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10592924/
https://www.ncbi.nlm.nih.gov/pubmed/37873264
http://dx.doi.org/10.1101/2023.10.02.560572
work_keys_str_mv AT gyawalirajan accuratecryoemproteinparticlepickingbyintegratingthefoundationalaiimagesegmentationmodelandspecializedunet
AT dhakalashwin accuratecryoemproteinparticlepickingbyintegratingthefoundationalaiimagesegmentationmodelandspecializedunet
AT wangliguo accuratecryoemproteinparticlepickingbyintegratingthefoundationalaiimagesegmentationmodelandspecializedunet
AT chengjianlin accuratecryoemproteinparticlepickingbyintegratingthefoundationalaiimagesegmentationmodelandspecializedunet