AlphaFold Models of Small Proteins Rival the Accuracy of Solution NMR Structures
Recent advances in molecular modeling using deep learning have the potential to revolutionize the field of structural biology. In particular, AlphaFold has been observed to provide models of protein structures with accuracies rivaling medium-resolution X-raycrystal structures, and with excellent atomic coordinate matches to experimental proteinNMR and cryo-electron microscopy structures. Here we assess the hypothesis that AlphaFold models of small, relatively rigid proteins have accuracies (based oncomparison against experimental data) similar to experimental solution NMR structures. We selected six representative small proteins with structures determined by both NMR and X-ray crystallography, and modeled each of them using AlphaFold. Using several structure validation tools integrated under the Protein Structure Validation Softwaresuite (PSVS), we then assessed how well these models fit to experimental NMR data, including NOESY peak lists (RPF-DP scores), comparisons between predicted rigidity andchemical shift data (ANSURR scores), and 15N-1H residual dipolar coupling data (RDC Qfactors) analyzed by software tools integrated in the PSVS suite. Remarkably, the fits to NMR data for the protein structure models predicted with AlphaFold are generally similar,or better, than for the corresponding experimental NMR or X-ray crystal structures. Similarconclusions were reached in comparing AlphaFold2 predictions and NMR structures forthree targets from the Critical Assessment of Protein Structure Prediction (CASP). Theseresults contradict the widely held misperception that AlphaFold cannot accurately model solution NMR structures. They also document the value of PSVS for model vs. data assessment of protein NMR structures, and the potential for using AlphaFold models for guiding analysis of experimental NMR data and more generally in structural biology.