Now here's a paper, packed to the edges with data, on what kinds of drug candidate compounds different companies produce. The authors assembled their list via the best method available to outsiders: they looked at what compounds are exemplified in patent filings
What they find is that over the 2000-2010 period that not much change has taken place, on average, in the properties of the molecules that are showing up. Note that we're assuming, for purposes of discussion, that these properties - things like molecular weight, logP, polar surface area, amount of aromaticity - are relevant. I'd have to say that they are. They're not the end of the discussion, because there are plenty of drugs that violate one or more of these criteria. But there are even more that don't, and given the finite amount of time and money we have to work with, you're probably better off approaching a new target with five hundred thousand compounds that are well within the drug-like properties boxes rather than five hundred thousand that aren't. And at the other end of things, you're probably better off with ten clinical candidates that mostly fit versus ten that mostly don't.
But even if overall properties don't seem to be changing much, that doesn't mean that there aren't differences between companies. That's actually the main thrust of the paper: the authors compare Abbott, Amgen, AstraZeneca, Bayer-Schering, Boehringer, Bristol-Myers Squibb, GlaxoSmithKline, J&J, Lilly, Merck, Novartis, Pfizer, Roche, Sanofi, Schering-Plough, Takeda, Wyeth, and Vertex. Of course, these organizations filed different numbers of patents, on different targets, with different numbers of compounds. For the record, Merck and GSK filed the most patents during those ten years (over 1500), while Amgen and Takeda filed the fewest (under 300). Merck and BMS had the largest number of unique compounds (over 70,000), and Takeda and Bayer-Schering had the fewest (in the low 20,000s). I should note that AstraZeneca just missed the top two in both patents and compounds.
If you just look at the raw numbers, ignoring targeting and therapeutic areas, Wyeth, Bayer-Schering, and Novartis come out looking the worst for properties, while Vertex and Pfizer look the best. But what's interesting is that even after you correct for targets and the like, that organizations still differ quite a bit in the sorts of compounds that they turn out. Takeda, Lilly, and Wyeth, for example, were at the top of the cLogP rankings (numberically, "top" meaning the greasiest). Meanwhile, Vertex, Pfizer, and AstraZeneca were at the other end of the scale in cLogP. In molecular weight, Novartis, Boehringer, and Schering-Plough were at the high end (up around 475), while Vertex was at the low end (around 425). I'm showing a radar-style plot from the paper where they cover several different target-unbiased properties (which have been normalized for scale), and you can see that different companies do cover very different sorts of space. (The numbers next to the company names are the total number of shared targets found and the total number of shared-target observations used - see the paper if you need more details on how they compiled the numbers).
Now, it's fair to ask how relevant the whole sweep of patented compounds might be, since only a few ever make it deep into the clinic. And some companies just have different IP approaches, patenting more broadly or narrowly. But there's an interesting comparison near the end of the paper, where the authors take a look at the set of patents that cover only single compounds. Now, those are things that someone has truly found interesting and worth extra layers of IP protection, and they average to significantly lower molecular weights, cLogP values, and number of rotatable bonds than the general run of patented compounds. Which just gets back to the points I was making in the first paragraph - other things being equal, that's where you'd want to spend more of your time and money.
What's odd is that the trends over the last ten years haven't been more pronounced. As the paper puts it:
blockquote>Over the past decade, the mean overall physico-chemical space used by many pharmaceutical companies has not changed substantially, and the overall output remains worryingly at the periphery of historical oral drug chemical space. This is despite the fact that potential candidate drugs, identified in patents protecting single compounds, seem to reflect physiological and developmental pressures, as they have improved drug-like properties relative to the full industry patent portfolio. Given these facts, and the established influence of molecular properties on ADMET risks and pipeline progression, it remains surprising that many organizations are not adjusting their strategies.
The big question that this paper leaves unanswered, because there's no way for them to answer it, is how these inter-organizational differences get going and how they continue. I'll add my speculations in another post - but speculations they will be.