← Back to context

Comment by lamename

2 years ago

How are the predictions validated? Waiting for the old fashioned way for... very difficult crystal structure experiments? Or something else?

Most of them are not, just estimations based on previous results given sequences with known structure.

Every couple years there is a massive competition called CASP where labs submit previously unresolved protein structures derived from experimental EM, x-ray crystallography, or NMR studies and other labs attempt to predict these structures using their software. AlphaFold2 absolutely destroyed the other labs in the main contest (regular monomeric targets, predominantly globular) for structure resolution two years ago, in CASP 14.

https://predictioncenter.org/casp14/zscores_final.cgi

The latest contest, CASP15, is currently underway and expected to end this year. As with all ML, the usual caveats apply to the models Google generated -- the dangers of overfitting to existing structures, artifacts based on the way the problem was modelled, etc

> very difficult crystal structure experiments?

Apart from X-ray crystallography there are other methods for structure determination such as nuclear magnetic resonance (NMR) or cryo-electron microscopy (cryo-EM). The latter has seen a dramatic improvement in resolution over the last decade.

If the predictions are generally good enough, could also skip the validation and directly try to get a desired effect or reaction. That isn't strictly speaking validating the structure, but depending on the use case might be easier to just go for an outcome - really a question of application and cost efficiency.

  • I mean nothing is stopping you from skipping validation with pre-alphafold techniques and say for drug discovery to already do drug screening using the predicted structure. It's just the drug screening software is already error prone so you are still going to have to do some validation. However having an idea on a potential structure means that you can do other techniques that are simpler to validate it that are less expensive/time consuming (I'm thinking of things similar to FRET).

    Another idea is these may come into play for anti-verification, so if you are drug screening against a known structure. You could potentially use these more flawed structures of proteins you don't want to target but may be similar, and try to reduce the drug's efficacy at binding them. Or something to that effect. All of that is fun ideas that are currently being explored in that space but we'll see where it takes us.

For a lot of X-ray crystallography cases, some of the difficulty is working out with no prior information, the actual structure from the collected data. This makes a lot of that... much easier because with https://en.wikipedia.org/wiki/Molecular_replacement something that is "close, but not correct" can be used to bootstrap the actual structure from.