Academic Publication Testing the predictive power of reverse screening to infer drug targets, with the help of machine learning
Research Abstract & Technology Focus
AbstractEstimating protein targets of compounds based on the similarity principle—similar molecules are likely to show comparable bioactivity—is a long-standing strategy in drug research. Having previously quantified this principle, we present here a large-scale evaluation of its predictive power for inferring macromolecular targets by reverse screening an unprecedented vast external test set of more than 300,000 active small molecules against another bioactivity set of more than 500,000 compounds. We show that machine-learning can predict the correct targets, with the highest probability among 2069 proteins, for more than 51% of the external molecules. The strong enrichment thus obtained demonstrates its usefulness in supporting phenotypic screens, polypharmacology, or repurposing. Moreover, we quantified the impact of the bioactivity knowledge available for proteins in terms of number and diversity of actives. Finally, we advise that developers of such approaches follow an application-oriented benchmarking strategy and use large, high-quality, non-overlapping datasets as provided here.
Commercial Realization
Startups and Open Source tools heavily associated with the concepts explored in this paper.
-
GitHubgsd-build/gsd-2
-
GitHubsantifer/career-ops
-
Product HuntFreeCAD 1.1
-
Product HuntBlood Sugar Journal
Associated Media Narrative
- Show HN: Running local OpenClaw together with remote agents in an open network
- Korean startup backed by Samsung and Arm launches rack-sized inference monsters, claims "6x lower power consumption" and up to 75% cheaper acquisition cost compared to Nvidia
- Github Integrates AI to Improve Accessibility Issue Management and Automate Feedback Triage
Market Trends