PROTAC Design

PROTAC Design in CADD

PROTAC design in computer-aided drug discovery (CADD) presents significant challenges for various reasons. The fundamental issue remains the notable lack of 3D structures and reliable models that enable generalization, which means that every molecular modeling approach already carries a lot of uncertainties and inaccuracies before a single keystroke is made.

Yet, this does not mean that rational design is impossible. By breaking down proteolysis targeting chimeras (PROTACs) into their three elements, namely the target protein ligand, the protein degrader ligand (e.g. E3 ligase), and a linker group connecting both, one will face simpler problems that are easier to address.

Elements of PROTACs

Let's discuss the components that make up a PROTAC in detail. First, we have the target ligand, which binds to the target protein that is to be degraded by the ligase. After the binding event, it brings both structures into close proximity, initiating the degradation of the target protein.

Ideally, the target ligand should be a potent and selective binder to the target structure. Often, in-house compounds are already sufficiently optimized or almost display desired physicochemical properties. For planning a PROTAC, it's crucial to identify the appropriate site on the molecule where it can be linked without interfering with the binding functionality (e.g., clashing with the macromoelcule surface).
If any predictions or PDBs are available, this information can be used to identify a suitable solvent-exposed position on the target ligand for attaching a linker. Otherwise, one has no choice but to resort to a trial-and-error approach to find an optimal position that does not significantly alter the ligand's affinity.

If the target ligand is already well-characterized, it can be advantageous to look for commercially available analogs that already contain a functional group that can easily be coupled with a linker. Chemical Spaces are well-suited for both purposes: to search for modified target ligands or for building blocks that can be synthesized into the target ligand through straightforward steps.

Smart Setup of a PROTAC Toolbox

The ligase ligand offers somewhat less flexibility in design possibilities, but there are still options available. In addition to the well-known E3 ubiquitin ligase ligands like thalidomide, lenalidomide, and pomalidomide, there are also other structurally more complex entries.

A linker between the target ligand and the ligase ligand should always be attached at the correct position to avoid impairing the affinity to the binding partner. Once the positions are established, the next question is the type of linker. A PEG linker is commonly chosen due to its favorable formulation properties, increased water solubility, and the ease with which the optimal length can be sampled. The length of the linker is crucial for stabilizing the complex formed by the PROTAC, target, and ligase, which can vary depending on the target, protein class, and binding mode of the ligand.

From both an intellectual property and bioavailability perspective, it may also be beneficial to explore alternative linker modalities. Rigidification of the linker group can positively impact the pharmacological and pharmacokinetic behavior of the PROTAC and, through additional interactions in the complex, enhance the binding affinity of the PROTAC.

Chemical Spaces can be conveniently searched for ligase ligands with suitable functionalities for coupling, or directly for combinations of linkers and ligase ligands. A broader in-house arsenal provides efficient flexibility in projects and accelerates synthesis processes.

Chemical Spaces are Boutiques for PROTACs Design

Online compound catalogs rarely list more than 1,000 analogs of a ligase ligand. While this is not an insignificant number, most researchers would agree that it is all too common to encounter situations where certain building blocks are missing. Additionally, the external synthesis of desired building blocks can be quite costly.

Chemical Spaces address this issue by including all possible structures that can be synthesized from the available building blocks, exponentially expanding the range of options. For example, searching the eXplore database yields 1,339,094,441 structures containing lenalidomide as a substructure, 2,647,921,721 analogs for pomalidomide, and another 2,522,855,058 options for thalidomide.

The comparison between a thousand options and billions clearly highlights the potential that Chemical Spaces have in PROTAC design.
BioSolveIT software for Chemical Space exploration:
  • infiniSee: Chemical Space navigation platform with a graphical user interface.
    You can use infiniSee's Motif Matcher Mode to specifically search for compounds for PROTAC design, covering all three components: target ligand, linker, and ligase ligand. Depending on the desired scope, you can also focus on specific combinations of two of these elements. The R group search is particularly useful for mining analogs at a desired position, allowing for a more refined and targeted approach in selecting compounds for your PROTAC design.
  • infiniSee xREAL: Exclusive platform to screen Enamine's largest compound catalog featuring trillions of compounds.
    infiniSee xREAL contains all features of infiniSee and supports all three Chemical Space exploration search modes.
Command-line tools for Chemical Space exploration:
  • FTrees: Pharmacophore-based similarity screen. Algorithm behind the Scaffold Hopper Mode.
  • SpaceLight: Retrieves close analogs based on molecular fingerprints. Algorithm behind the Analog Hunter Mode.
  • SpaceMACS: Performs maximum common substructure searches, as well as exact substructure mining. Algorithm behind the Motif Matcher Mode.

Excited for more drug discovery solutions?