25 Facts About AlphaFold 2

1.

Team that used AlphaFold 2 repeated the placement in the CASP competition in November 2020.

FactSnippet No. 1,628,454
2.

On 15 July 2021 the AlphaFold 2 paper was published at Nature as an advance access publication alongside open source software and a searchable database of species proteomes.

FactSnippet No. 1,628,455
3.

AlphaFold 2 started competing in the 2018 CASP using an artificial intelligence deep learning technique.

FactSnippet No. 1,628,456
4.

AlphaFold 2 1 was built on work developed by various teams in the 2010s, work that looked at the large databanks of related DNA sequences now available from many different organisms, to try to find changes at different residues that appeared to be correlated, even though the residues were not consecutive in the main chain.

FactSnippet No. 1,628,457
5.

Central to AlphaFold 2 is a distance map predictor implemented as a very deep residual neural networks with 220 residual blocks processing a representation of dimensionality 64×64×128 – corresponding to input features calculated from two 64 amino acid fragments.

FactSnippet No. 1,628,458

Related searches

DNA NMR
6.

Alongside a distance map in the form of a very finely-grained histogram of distances, AlphaFold 2 predicts F and ? angles for each residue which are used to create the initial predicted 3D structure.

FactSnippet No. 1,628,459
7.

The AlphaFold 2 authors concluded that the depth of the model, its large crop size, the large training set of roughly 29,000 proteins, modern Deep Learning techniques, and the richness of information from the predicted histogram of distances helped AlphaFold 2 achieve a high contact map prediction precision.

FactSnippet No. 1,628,460
8.

Software design used in AlphaFold 2 1 contained a number of modules, each trained separately, that were used to produce the guide potential that was then combined with the physics-based energy potential.

FactSnippet No. 1,628,461
9.

AlphaFold 2 replaced this with a system of sub-networks coupled together into a single differentiable end-to-end model, based entirely on pattern recognition, which was trained in an integrated way as a single integrated structure.

FactSnippet No. 1,628,462
10.

AlphaFold 2 team stated in November 2020 that they believe AlphaFold 2 can be further developed, with room for further improvements in accuracy.

FactSnippet No. 1,628,463
11.

However, the October 2021 update, named AlphaFold 2-Multimer, included protein complexes in its training data.

FactSnippet No. 1,628,464
12.

In December 2018, DeepMind's AlphaFold 2 placed first in the overall rankings of the 13th Critical Assessment of Techniques for Protein Structure Prediction.

FactSnippet No. 1,628,465
13.

AlphaFold 2 gave the best prediction for 25 out of 43 protein targets in this class, achieving a median score of 58.

FactSnippet No. 1,628,466
14.

AlphaFold 2 has not announced plans to make their code publicly available as of 5 March 2021.

FactSnippet No. 1,628,467
15.

In 2018 AlphaFold 2 1 had only reached this level of accuracy in two of all of its predictions.

FactSnippet No. 1,628,468
16.

AlphaFold 2 achieved an accuracy in modelling surface side chains described as "really really extraordinary".

FactSnippet No. 1,628,469
17.

Three structures that AlphaFold 2 had the least success in predicting, two had been obtained by protein NMR methods, which define protein structure directly in aqueous solution, whereas AlphaFold was mostly trained on protein structures in crystals.

FactSnippet No. 1,628,470
18.

AlphaFold 2 scoring more than 90 in CASP's global distance test is considered a significant achievement in computational biology and great progress towards a decades-old grand challenge of biology.

FactSnippet No. 1,628,471
19.

However it is not yet clear to what extent structure predictions made by AlphaFold 2 will hold up for proteins bound into complexes with other proteins and other molecules.

FactSnippet No. 1,628,472
20.

Where structures that AlphaFold 2 did predict were for proteins that had strong interactions either with other copies of themselves, or with other structures, these were the cases where AlphaFold 2's predictions tended to be least refined and least reliable.

FactSnippet No. 1,628,473
21.

However, the authors highlighted that many AlphaFold 2 models were accurate enough to allow for the introduction of post-predictional modifications.

FactSnippet No. 1,628,474
22.

At launch the database contains AlphaFold 2-predicted models of protein structures of nearly the full UniProt proteome of humans and 20 model organisms, amounting to over 365,000 proteins.

FactSnippet No. 1,628,475
23.

AlphaFold 2 planned to add more sequences to the collection, the initial goal being to cover most of the UniRef90 set of more than 100 million proteins.

FactSnippet No. 1,628,476
24.

AlphaFold 2 DB uses a monomeric model similar to the CASP14 version.

FactSnippet No. 1,628,477
25.

AlphaFold 2 has been used to predict structures of proteins of SARS-CoV-2, the causative agent of COVID-19.

FactSnippet No. 1,628,478

Related searches

DNA NMR