This month is, unsurprisingly, Cost Reduction Month.
In our data from the last 3 yrs, we commonly see major cost crunches right after the latest breakthrough.
We'll ship major features to help you cut inference costs at least once a week, starting with today.
Running list 👇
New server tool: Advisor
Let smaller models consult a higher-intelligence "advisor" model.
Helps them escape doom loops, and helps you migrate to cheaper models! đź§µ
Official OpenRouter MCP: help your agents pull live data about model pricing, benchmarks, popularity, and more!
Avoid the accidental bad model slug being written into your codebase.
Introducing the OpenRouter MCP, live model intelligence right inside your agent
Your agent builds and ships, but when it comes to choosing the right model for the right job, it guesses from 6 month old training data
Watch it pick, price, and test the right model:
Open-weight provider meta-benchmarking, run and published continuously by OpenRouter.
Powers our tool-call routing and ensures you receive the highest-quality inference:
Tip: OpenRouter continuously runs GPQA and TAU-Bench on most open-weight models and publishes the results publicly.
This informs our AutoExacto meta-benchmark, used by default when routing tool calls.
Here, @parasail_io and @Zai_org rank first: openrouter.ai/z-ai/glm-5.2#p…
Always stay on the latest Sonnet by using ~anthropic/claude-sonnet-latest in your API calls.
~anthropic/claude-sonnet-latest now routes to Claude Sonnet 5!
Claude Sonnet 5 is rolling out on OpenRouter with a promo price: $2/M in and $10/M out!
It boosts agentic coding and pro workflows w/ flagship intelligence at Sonnet pricing. In early tests, agents were more reliable, faster, and easier to trust with larger tasks than 4.6.
Introducing Claude Sonnet 5, our most agentic Sonnet yet.
It makes plans, uses tools like browsers and terminals, and runs autonomously at a level that just a few months ago required larger and more expensive models.
Tip: OpenRouter continuously runs GPQA and TAU-Bench on most open-weight models and publishes the results publicly.
This informs our AutoExacto meta-benchmark, used by default when routing tool calls.
Here, @parasail_io and @Zai_org rank first: openrouter.ai/z-ai/glm-5.2#p…
Four open-weight models have crossed into territory where they are powering real agentic pipelines.
New post in our Insights blog about why companies are choosing them in June: openrouter.ai/blog/insights/…
TIP đź’ˇ@Zai_org GLM-5.2 providers are working on faster and faster inference! Today's new endpoints include @wafer_ai and @FireworksAI_HQ fast variants.
Set your model to "z-ai/glm-5.2:nitro" to continuously get the fastest provider based on live traffic data.
TIP đź’ˇ@Zai_org GLM-5.2 providers are working on faster and faster inference! Today's new endpoints include @wafer_ai and @FireworksAI_HQ fast variants.
Set your model to "z-ai/glm-5.2:nitro" to continuously get the fastest provider based on live traffic data.
In this OpenRouter MCP demo, your agent finds the best model at design:
1. Pulls the top design models from @Designarena, live, through the MCP
2. Spins up three subagents - GLM-5.2, Opus 4.7, Kimi 2.6 - each designing a self-portrait as a webpage
3. Opens all three for you
diff models are good at diff things, but how many of us actually compare them?
you sign up for each provider separately, load creds, and run the same prompt over and over
it's tedious, so 99% of the time, people just go with what others say is best
there's a better way..
Introducing the OpenRouter MCP, live model intelligence right inside your agent
Your agent builds and ships, but when it comes to choosing the right model for the right job, it guesses from 6 month old training data
Watch it pick, price, and test the right model:
xAI + Zero Data Retention, now live on OpenRouter. đź”’
Available across Grok 4.3, 4.20 and Build 0.1. Turn on ZDR and you're covered.
Browse Grok ZDR models: openrouter.ai/models?zdr=tru…
TIP đź’ˇ@Zai_org GLM-5.2 providers are working on faster and faster inference! Today's new endpoints include @wafer_ai and @FireworksAI_HQ fast variants.
Set your model to "z-ai/glm-5.2:nitro" to continuously get the fastest provider based on live traffic data.
Interface + inference, in one place.
@OpenWebUI now runs on OpenRouter.
Give your team one chat interface, one unified bill, and access to 400+ frontier and open models through a single API.