# Retrosynthesis

{% hint style="info" %}
Pending AI’s various capabilities are powered by artificial intelligence which is an experimental technology and may occasionally be misleading or incorrect.
{% endhint %}

Pending AI's Retrosynthesis Engine is a cutting-edge service designed for ultra-high-throughput synthetic feasibility analysis. By integrating multiple unique deep learning models with a Monte Carlo Tree Search (MCTS) framework, the Engine rapidly identifies complex, multi-step, and economically viable synthetic routes for novel and existing chemical entities.

The synthetic bottleneck is a major impediment in drug discovery. Our engine drastically accelerates the Design-Make-Test-Analyze (DMTA) cycle by providing chemist-first, feasible synthesis routes early in the process.

* <mark style="color:$primary;">**Intelligent Feasibility**</mark>: Combines reaction pattern recognition from deep learning models with the exhaustive search capabilities of MCTS to propose routes that are both practically possible and literature-inspired.
* <mark style="color:$primary;">**Massive Throughput**</mark>: Designed to scale and screen the synthetic accessibility of hundreds of thousands of compounds per day, enabling proactive filtering of synthetically difficult molecules *prior* to the Make stage.
* <mark style="color:$primary;">**Actionable Outputs**</mark>: Provides detailed, step-by-step route trees, including learned scoring and pairwise diversity to guide lab-based synthesis.

See the <mark style="color:$primary;">**PAI Retro**</mark> page for more information about the service.

{% content-ref url="../api-reference/pai-retro" %}
[pai-retro](https://docs.pending.ai/api-reference/pai-retro)
{% endcontent-ref %}

***

## Applications

High-throughput large-scale molecule retrosynthesis has traditionally blocked drug discovery pipelines forcing chemists to perform insufficient workarounds for a target. The [PAI Retro](https://docs.pending.ai/api-reference/pai-retro) service provides an avenue to unlock previously time-restricted applications within the DMTA cycle:

<table data-card-size="large" data-view="cards"><thead><tr><th align="center"></th><th align="center"></th></tr></thead><tbody><tr><td align="center"><i class="fa-memo-circle-check">:memo-circle-check:</i> Assess Synthetic Accessibility</td><td align="center">Proactively filter lead candidates from virtual screening campaigns (or sampled synthetic libraries using <a data-mention href="generative-ai">generative-ai</a>).</td></tr><tr><td align="center"><i class="fa-route">:route:</i> Route Design &#x26; Optimisation</td><td align="center">Identify cheaper, shorter, or more scalable routes for existing commercial  products from specified building block libraries.</td></tr><tr><td align="center"><i class="fa-lock">:lock:</i> Targeted Proprietary Libraries</td><td align="center">Retrosynthesis models can be customised using proprietary datasets for reaction templates or building block libraries.</td></tr><tr><td align="center"><i class="fa-robot">:robot:</i> Automated Synthesis Integration</td><td align="center">Directly feed valid routes into automated synthesis platforms, electronic laboratory notebooks, or AI-driven environment.</td></tr></tbody></table>

***

## Features & Advantages

### 1. Chemically-Aware AI Architecture

Our unique approach integrates three distinct, high-quality AI models trained on chemically diverse datasets within an engine-variant MCTS search framework.

* <mark style="color:$primary;">**Reaction Prediction AI**</mark>: Base models are trained on millions of unique chemical reactions to accurately predict viable precursors in a single retrosynthetic step.
* <mark style="color:$primary;">**Template Extraction**</mark>: Over 75,000 high-quality, chemically relevant reaction templates are extracted and utilised, ensuring outputs reflect real-world reaction space.
* <mark style="color:$primary;">**AI-Driven Expansion Policy**</mark>: A highly optimised AI model (trained on 71 million samples) guides MCTS expansion, prioritising ideal reactions for a given target molecule for faster convergence.

### 2. Scalability and Customisation

We offer flexible solutions designed to meet both the scale of virtual screening and the specificity of proprietary data.

* <mark style="color:$primary;">**Ultra-High Throughput**</mark>: Standard processing capacity of up to 400,000 queries per day, with tiered options (High, Very-High, Ultra-High) available for bespoke campaigns.
* <mark style="color:$primary;">**Custom Data Integration**</mark>: The AI models can be customised and fine-tuned using proprietary datasets (e.g., in-house electronic laboratory notebook data), ensuring generated routes prioritise preferred, reliable, and available chemistry.
* <mark style="color:$primary;">**Building Block Awareness**</mark>: Routes are assessed against a comprehensive set of nearly ten million commercially available building blocks (including a subset of ZINC labelled as in-stock).

### 3. Chemist-First Capabilities

The Retrosynthesis engine is designed to augment, not replace, the medicinal chemist's expertise by providing practical, high-value insights.

* <mark style="color:$primary;">**Route Diversity Analysis**</mark>: MCTS is engineered to explore a rich set of distinct synthetic pathways. Diversity rankings and analysis of routes allows chemists to assess trade-offs in terms of cost, novelty, and other characteristics.
* <mark style="color:$primary;">**Feasibility Scoring**</mark>: Routes and reaction steps are assigned confidence scores allowing the chemist to quickly prioritise higher-confidence pathways and assess risks for lower scoring components.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.pending.ai/capabilities/retrosynthesis.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
