Cargando…

Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures

AlphaFold2 is reshaping biomedical research by enabling the prediction of a protein’s 3D structure solely based on its amino acid sequence. This breakthrough reduces reliance on labor-intensive experimental methods traditionally used to obtain protein structures, thereby accelerating the pace of sci...

Descripción completa

Detalles Bibliográficos
Autores principales:	Abbas, Usman, Chen, Jin, Shao, Qing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cold Spring Harbor Laboratory 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245900/ https://www.ncbi.nlm.nih.gov/pubmed/37293014 http://dx.doi.org/10.1101/2023.05.23.542006

_version_	1785054942826856448
author	Abbas, Usman Chen, Jin Shao, Qing
author_facet	Abbas, Usman Chen, Jin Shao, Qing
author_sort	Abbas, Usman
collection	PubMed
description	AlphaFold2 is reshaping biomedical research by enabling the prediction of a protein’s 3D structure solely based on its amino acid sequence. This breakthrough reduces reliance on labor-intensive experimental methods traditionally used to obtain protein structures, thereby accelerating the pace of scientific discovery. Despite the bright future, it remains unclear whether AlphaFold2 can uniformly predict the wide spectrum of proteins equally well. Systematic investigation into the fairness and unbiased nature of its predictions is still an area yet to be thoroughly explored. In this paper, we conducted an in-depth analysis of AlphaFold2’s fairness using data comprised of five million reported protein structures from its open-access repository. Specifically, we assessed the variability in the distribution of PLDDT scores, considering factors such as amino acid type, secondary structure, and sequence length. Our findings reveal a systematic discrepancy in AlphaFold2’s predictive reliability, varying across different types of amino acids and secondary structures. Furthermore, we observed that the size of the protein exerts a notable impact on the credibility of the 3D structural prediction. AlphaFold2 demonstrates enhanced prediction power for proteins of medium size compared to those that are either smaller or larger. These systematic biases could potentially stem from inherent biases present in its training data and model architecture. These factors need to be taken into account when expanding the applicability of AlphaFold2.
format	Online Article Text
id	pubmed-10245900
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cold Spring Harbor Laboratory
record_format	MEDLINE/PubMed
spelling	pubmed-102459002023-06-08 Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures Abbas, Usman Chen, Jin Shao, Qing bioRxiv Article AlphaFold2 is reshaping biomedical research by enabling the prediction of a protein’s 3D structure solely based on its amino acid sequence. This breakthrough reduces reliance on labor-intensive experimental methods traditionally used to obtain protein structures, thereby accelerating the pace of scientific discovery. Despite the bright future, it remains unclear whether AlphaFold2 can uniformly predict the wide spectrum of proteins equally well. Systematic investigation into the fairness and unbiased nature of its predictions is still an area yet to be thoroughly explored. In this paper, we conducted an in-depth analysis of AlphaFold2’s fairness using data comprised of five million reported protein structures from its open-access repository. Specifically, we assessed the variability in the distribution of PLDDT scores, considering factors such as amino acid type, secondary structure, and sequence length. Our findings reveal a systematic discrepancy in AlphaFold2’s predictive reliability, varying across different types of amino acids and secondary structures. Furthermore, we observed that the size of the protein exerts a notable impact on the credibility of the 3D structural prediction. AlphaFold2 demonstrates enhanced prediction power for proteins of medium size compared to those that are either smaller or larger. These systematic biases could potentially stem from inherent biases present in its training data and model architecture. These factors need to be taken into account when expanding the applicability of AlphaFold2. Cold Spring Harbor Laboratory 2023-05-31 /pmc/articles/PMC10245900/ /pubmed/37293014 http://dx.doi.org/10.1101/2023.05.23.542006 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle	Article Abbas, Usman Chen, Jin Shao, Qing Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures
title	Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures
title_full	Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures
title_fullStr	Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures
title_full_unstemmed	Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures
title_short	Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures
title_sort	assessing fairness of alphafold2 prediction of protein 3d structures
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245900/ https://www.ncbi.nlm.nih.gov/pubmed/37293014 http://dx.doi.org/10.1101/2023.05.23.542006
work_keys_str_mv	AT abbasusman assessingfairnessofalphafold2predictionofprotein3dstructures AT chenjin assessingfairnessofalphafold2predictionofprotein3dstructures AT shaoqing assessingfairnessofalphafold2predictionofprotein3dstructures

Assessing Fairness of AlphaFold2 Prediction of Protein 3D Structures

Ejemplares similares