AGNT5 - Runtime for production-ready AI agents

AI workflows are glue.
Glue breaks.

AGNT5 adds the durability, retries, and state needed to handle queues, callbacks, and human handoffs — so your workflows and agents recover instead of fail.

triage_ticket.py
1
# Durable workflow with human approval
2
from agnt5 import durable, Context
3
 
4
@durable('triage_ticket', retries=5)
5
async def triage(ctx: Context, ticket: dict):
6
 
7
# Step 1: Draft a response using LLM
8
payload = {'role': 'user', 'content': ticket['text']}
9
draft = await ctx.step('draft', lambda: ctx.llm.chat(messages=[payload]))
10
 
11
# Step 2: Wait for human approval (can pause for days)
12
decision = await ctx.signal.wait('agent_approval', timeout='24h')
13
if decision != 'approve':
14
return 'held'
15
 
16
# Step 3: Send the email
17
await ctx.step('send', lambda: ctx.email.send(
18
to=ticket['user'],
19
body=draft.text,
20
))
21
return 'sent'
Waiting to start...
[01] THE PROBLEM

The "glue" is harder than you think.

Solving distributed systems challenges in application code is a liability. AGNT5 handles the complexity of the network and infrastructure for you.

A deployment happens while an agent is waiting for an LLM response. In standard code, that process dies and the context is lost forever.

run_agent.py
# WITHOUT AGNT5 (The "Manual State" Nightmare)
async def run_agent(run_id):
    state = db.get_run(run_id)

    if state.step == 'start':
        try:
            # If server crashes here, result is lost
            result = await llm.generate()
            # If DB commit fails here, you might re-run LLM later
            db.update(run_id, step='processed', data=result)
        except SystemExit:
            # Hope your graceful shutdown works...
            db.save_checkpoint(run_id)
> Waiting to start...
[02] PLATFORM PRIMITIVES

Production primitives for AI workflows.

Everything you need to build reliable AI applications, built into the runtime.

Durable Runtime

Survive crashes, deploys, restarts. Your workflow picks up exactly where it left off. No manual checkpointing required.

Retries & Schedules

Declarative retry policies, not scattered try/catch. Cron schedules built-in. Set once, run forever.

Human Handoffs

Wait for approvals as first-class steps. Pause for days, resume instantly. No polling, no webhooks.

Observability & Evals

Traces, logs, metrics, eval hooks. Debug in development, monitor in production. OpenTelemetry built-in.

[03] DEVELOPER WORKFLOW

How AGNT5 fits into your stack.

From local development to production deployment in three steps.

Step 01

Install

terminal
$ pip install agnt5

Install the SDK. Define workflows in your existing Python codebase. No infrastructure to set up.

Step 02

Develop

terminal
$ agnt5 dev

Run locally with full visibility. Test workflows, inspect state, debug with traces.

Step 03

Deploy

terminal
$ agnt5 deploy

Deploy to your infra or AGNT5 Cloud. Same code, same guarantees. Scale automatically.

[04] USE CASES

Built for AI workflows that need to work.

Real applications running in production today.

Customer Support Agents

Triage tickets, draft responses, wait for approval, send. All durable. No lost context between handoffs. Handle escalations that span days without losing state.

human-in-loopmulti-stepasync-approvals

Multi-step RAG Pipelines

Chunk, embed, store, retrieve, generate. Decoupled compute with recoverable steps.

Data Processing Pipelines

ETL, transformations, exactly-once. Never double-process a record.

Approval Workflows

Route to reviewers, collect signatures, handle escalations. Pause for days or seconds, resume instantly. No polling, no webhooks, no manual state management.

Start building reliable AI workflows today.

Get started in minutes. No infrastructure to manage.