Deploy cloud agents that improve in production
Hey there, Roland
Here's a quick look at your managed agents.
The managed cloud for agents, powered by Pi. Define each agent as a recipe, deploy it to commit-pinned infrastructure, and keep it improving in production with a graded loop you control.
Pi recipes
Your agent is a Pi recipe: YAML in a git folder. Any model, any MCP, your tools, identical on your laptop and in our cloud.
GitOps runtimes
Git is the source of truth: every runtime is a commit-pinned deploy. Each change opens a pull request with a preview runtime and live experiment, and any deploy reverts in one click.
Continual learning
An MCP and skill for Claude Code, Codex, and Cursor that runs the loop on your primitives: patterns, judges, experiments.
A directory is a
multi-agent system
A Pi recipe is a git folder of agents, tools, and skills. Run it locally, ship the same commit to our cloud.
Orchestrator-worker research: ingest briefs from Drive, decompose, dispatch parallel subagents, synthesize, then cite.
Incident orchestrator: investigate Datadog and PagerDuty alerts, auto-triage Sentry errors with fix PRs, and draft postmortems on resolve.
Frontline support agent: triage Slack threads, search the Notion KB, draft replies, and escalate when policy requires a human.
GTM orchestrator: qualify inbound leads, research accounts across CRM and calls, draft outbound for Slack approval, and publish account intelligence.
Orchestrator-worker research: ingest briefs from Drive, decompose, dispatch parallel subagents, synthesize, then cite.
1name: agent2description: Lead researcher, multi-agent research orchestrator.3model:4name: anthropic/claude-opus-4-85reasoning_effort: high6tools:7- todo_write8- read9- bash10- write11- WebSearch12- WebFetch13- mcp:google-drive/read_file14- mcp:google-drive/search_files15subagents:16- research17- citations18skills:19- research-process20system_instructions:21mode: append22content: |23# Role: Lead Researcher24Pull source docs from Drive, decompose25the question, delegate parallel research,26synthesize, then run the citation pass.
Incident orchestrator: investigate Datadog and PagerDuty alerts, auto-triage Sentry errors with fix PRs, and draft postmortems on resolve.
1name: agent2description: Incident response orchestrator, alerts, triage, postmortems.3model:4name: anthropic/claude-opus-4-85reasoning_effort: high6tools:7- todo_write8- read9- bash10- edit11- write12- mcp:pagerduty/acknowledge13- mcp:pagerduty/list_incidents14- mcp:datadog/query_logs15- mcp:datadog/query_metrics16- mcp:sentry/get_issue17- mcp:github/create_pull_request18- mcp:slack/post_message19subagents:20- alert-investigator21- sentry-triage22- postmortem-writer23skills:24- incident-response25- hotfix-playbook26system_instructions:27mode: append28content: |29# Role: Incident commander30On alert: ack, pull logs, correlate deploys,31delegate investigation; triage Sentry with fix PR;32on resolve, draft postmortem with timeline + actions.
Frontline support agent: triage Slack threads, search the Notion KB, draft replies, and escalate when policy requires a human.
1name: agent2description: Support orchestrator, classify requests, retrieve KB answers, draft replies.3model:4name: anthropic/claude-sonnet-4-65reasoning_effort: medium6tools:7- read8- bash9- mcp:slack/list_threads10- mcp:slack/post_message11- mcp:notion/search12- mcp:notion/get_page13subagents:14- triage15- responder16- escalation17skills:18- ticket-triage19system_instructions:20mode: append21content: |22# Role: Support lead23Read the customer thread, search Notion for24approved answers, draft a reply, escalate25billing and security issues to a human.
GTM orchestrator: qualify inbound leads, research accounts across CRM and calls, draft outbound for Slack approval, and publish account intelligence.
1name: agent2description: GTM orchestrator, inbound qualification, research, and Slack drafts.3model:4name: openai/gpt-5.55thinking_level: medium6tools:7- shell_command8- apply_patch9- update_plan10- web_search11- mcp:salesforce/get_lead12- mcp:salesforce/get_account13- mcp:gong/search_calls14- mcp:slack/post_draft15- mcp:apollo/enrich_contact16- mcp:bigquery/query17subagents:18- lead-research19- sales-research20- account-intel21skills:22- outbound-playbook23- gtm-workflow24system_instructions:25mode: append26content: |27# Role: GTM orchestrator28Run do-not-send checks, gather CRM + Gong + web29context, draft relationship-aware outbound with30rationale, post to Slack for rep approval.
Everything you need
for production agents
Drop agents into your own product. Each user gets an agent that acts as them, with conversations, files, and memory sealed to that user.
Your users sign in through your IdP (Supabase, Auth0, Okta) and the agent acts as them. Cookie-auth in the browser, no key shipped.
Connect any MCP server or API as a tool. A reverse proxy authorizes every call and injects credentials at the edge, so secrets never reach the sandbox.
Kick off a long-running task, disconnect, and reconnect later, it keeps running and replays where you left off.
Full per-user traces (turns, tool calls, tokens, and cost) queryable over the API.
Per-user files your agents read and write: versioned, durable memory that persists across sessions.
Share any conversation by a revocable link, then fork it into a new task, retry or explore with full history intact.
With primitives
for continual learning
Every deployment becomes a self-improving loop. You set the direction. The infrastructure makes it happen.
A single trace explains one request. A pattern tells you whether the same failure is happening enough to act on, named and counted automatically.
Each pattern becomes a judge: graded code committed beside the recipe, pinned to the commit, running online on live traffic.
Candidates A/B on real users, bucketed per end-user, with P(better), every fix proven on production before you merge it.
Every cycle leaves patterns, judges, and proven changes keyed to your commits: a record of what works that no fresh start can copy.
Deployed on
frontier infrastructure
Your agents run in a single-tenant data plane inside your own cloud. You ship the recipe; the boundary stays yours.
Your agents run in your own cloud: your region, your VPC, encrypted at rest, never co-mingled with other tenants.
Every run executes in a confidential, hardened sandbox in your own infrastructure: strong isolation with no performance hit, torn down on completion so nothing persists.
Agents reach only the hosts you allow, and credentials are injected at the edge, so your keys never enter the sandbox.
Tie every agent, key, and action to your identity provider, gated by scoped role-based access.
Every run is traced end-to-end (turns, tool calls, tokens, and cost), streamed to your own stack and pinned to the exact commit that ran.
Start on managed keys, or bring your own provider accounts: your spend, under your own data agreements.
