Corante

About this Author
DBL%20Hendrix%20small.png College chemistry, 1983

Derek Lowe The 2002 Model

Dbl%20new%20portrait%20B%26W.png After 10 years of blogging. . .

Derek Lowe, an Arkansan by birth, got his BA from Hendrix College and his PhD in organic chemistry from Duke before spending time in Germany on a Humboldt Fellowship on his post-doc. He's worked for several major pharmaceutical companies since 1989 on drug discovery projects against schizophrenia, Alzheimer's, diabetes, osteoporosis and other diseases. To contact Derek email him directly: derekb.lowe@gmail.com Twitter: Dereklowe

Chemistry and Drug Data: Drugbank
Emolecules
ChemSpider
Chempedia Lab
Synthetic Pages
Organic Chemistry Portal
PubChem
Not Voodoo
DailyMed
Druglib
Clinicaltrials.gov

Chemistry and Pharma Blogs:
Org Prep Daily
The Haystack
Kilomentor
A New Merck, Reviewed
Liberal Arts Chemistry
Electron Pusher
All Things Metathesis
C&E News Blogs
Chemiotics II
Chemical Space
Noel O'Blog
In Vivo Blog
Terra Sigilatta
BBSRC/Douglas Kell
ChemBark
Realizations in Biostatistics
Chemjobber
Pharmalot
ChemSpider Blog
Pharmagossip
Med-Chemist
Organic Chem - Education & Industry
Pharma Strategy Blog
No Name No Slogan
Practical Fragments
SimBioSys
The Curious Wavefunction
Natural Product Man
Fragment Literature
Chemistry World Blog
Synthetic Nature
Chemistry Blog
Synthesizing Ideas
Business|Bytes|Genes|Molecules
Eye on FDA
Chemical Forums
Depth-First
Symyx Blog
Sceptical Chymist
Lamentations on Chemistry
Computational Organic Chemistry
Mining Drugs
Henry Rzepa


Science Blogs and News:
Bad Science
The Loom
Uncertain Principles
Fierce Biotech
Blogs for Industry
Omics! Omics!
Young Female Scientist
Notional Slurry
Nobel Intent
SciTech Daily
Science Blog
FuturePundit
Aetiology
Gene Expression (I)
Gene Expression (II)
Sciencebase
Pharyngula
Adventures in Ethics and Science
Transterrestrial Musings
Slashdot Science
Cosmic Variance
Biology News Net


Medical Blogs
DB's Medical Rants
Science-Based Medicine
GruntDoc
Respectful Insolence
Diabetes Mine


Economics and Business
Marginal Revolution
The Volokh Conspiracy
Knowledge Problem


Politics / Current Events
Virginia Postrel
Instapundit
Belmont Club
Mickey Kaus


Belles Lettres
Uncouth Reflections
Arts and Letters Daily
In the Pipeline: Don't miss Derek Lowe's excellent commentary on drug discovery and the pharma industry in general at In the Pipeline

In the Pipeline

« Reactive Oxygen Species Are Your Friends! | Main | Another Reactive Oxygen Paper »

January 14, 2013

Virtual Screening, The Good Parts and the Bad Ones

Email This Entry

Posted by Derek

Virtual screening is what many people outside the field are thinking of when they talk about the use of computational models in drug discovery. There are many other places where modeling can pitch in, but one of the dreams has always been to take a given protein target and a long list of chemical structures, hit the button, and come back to a sorted list of which ones are going to bind well. That list could be as long as "every compound in our screening deck", or "every available compound in the commercial catalogs", or "everything that our chemists can think of and draw on a whiteboard, whether it's ever been made or not". So these virtual collections can get rather large, but that's what computer power is for, right?

Despite what some people might think, we're not exactly there yet. But we're not exactly not there, either, if you know what I mean. Like much of drug discovery, it's in that awkward age. Virtual screening is certainly real, and it can be useful, but it can also waste your time if you're not careful. And that's where this paper comes in - it's a fine overview of the issue that you need to think about if you're interested in trying this technique.

For one thing, you need to decide if you're going to be taking a drug target whose structure you know pretty well and modeling a bunch of small compounds into it, or if you're taking a bunch of small molecules whose activities you know pretty well and trying to find more compounds like them. These two approaches call for some different methods, and have different potential problems. The second one, especially in the older literature, often goes under the name of QSAR, for quantitative structure-activity relationship. But as the authors point out, "virtual screening" as a name has some advantages, because many people have been burned by things labeled "QSAR" over the years. They're also being used for different purposes, which is probably a good thing:

A fundamental assumption inherent in QSAR and pharmacophore-based VS is the “similar property principle”, that is, the general observation that molecules with similar structure are likely to have similar properties. While this assumption holds true in many cases, there are many counter-examples in the field of QSAR which lead to erroneous predictions and can shake the confidence of the experimental community in the prospective utility of QSAR modeling. Interestingly, this has not yet (or not to the same extent) been the case with VS. The difference is that QSAR is typically employed to evaluate a limited number of synthetic candidates, where errors are more noticeable and costly. However, when these techniques are applied on a massive scale to screen large chemical libraries, errors are much more easily tolerated as the objective is to increase the number and diversity of hits over what would have been otherwise a random selection.

The authors extensively cover the previous literature on computational screening - successful examples, warnings of trouble, theoretical predictions both optimistic and pessimistic. It would take you quite a while to assemble this list on your own, so that by itself recommends this paper to anyone interested in the area. But they go on to codify the various pitfalls to look out for.

"Such as expecting it to work", the cynics in the audience will remark. I say that sort of thing under my breath for time to time myself - or audibly, as the case may be. But this is the sort of paper that I can really endorse, because it's a completely realistic view of what you can expect with current technology. And that comes down to "Less than you want", but still "More than you might think". You're not going to able to feed the software the complete pile of all the chemical supplier catalogs and come back to find the nanomolar leads printing out. But you can get pointers toward parts of chemical space that you wouldn't have thought about (or wouldn't have been able to physically screen).

One tricky part is that when a virtual screening effort is successful (for whatever value you assign to "success"), it can be hard to tell why, and likewise for failures. There are so many places where things can disconnect - proteins are mobile, and small molecules even more so, and accounting for these conformational ensembles is not trivial. Binding interactions are not always well understood, or well modeled. Water molecules are pesky, but can be vitally important. You might have picked inappropriate controls (positive or negative), or be weighting the various computed factors in the wrong way. Either of those will send your calculation further and further off the rails.

And so on. The paper goes into detail on these possibilities and more; I highly recommend it for anyone getting into virtual screening (or for anyone already doing it, to keep the troubleshooting guide in one handy place).

Comments (2) + TrackBacks (0) | Category: In Silico


COMMENTS

1. Curious Wavefunction on January 14, 2013 10:12 AM writes...

Looks like a useful review. One important caveat to keep in mind is that even positive hits from virtual screening (validated by experiment) may not correspond to correct binding modes, and this may lead you down the wrong path when you are synthesizing analogs.

Permalink to Comment

2. H2L on January 14, 2013 2:38 PM writes...

@Curious Wavefunction: On the flip side, false positives may still have value in detecting aspects of molecular recognition that might be important for binding even if the entire molecule is not active enough to be detected by biochemical assays. A good example of this can be seen in the work of Prof. Bill Jorgensen:

"From Docking False-Positive to Active Anti-HIV Agent"
J. Med. Chem., 2007, 50 (22), pp 5324–5329
http://pubs.acs.org/doi/abs/10.1021/jm070683u

Permalink to Comment

POST A COMMENT




Remember Me?



EMAIL THIS ENTRY TO A FRIEND

Email this entry to:

Your email address:

Message (optional):




RELATED ENTRIES
Scripps Update
What If Drug Patents Were Written Like Software Patents?
Stem Cells: The Center of "Right to Try"
Speaking of Polyphenols. . .
Dark Biology And Small Molecules
How Polyphenols Work, Perhaps?
More On Automated Medicinal Chemistry
Scripps Merging With USC?