AI gateway & routing
One OpenAI-compatible endpoint moves across OpenAI, Anthropic, and Gemini. Sticky assignment per user and node keeps your experiments steady.
AI gateway · outcome analytics solution
abto routes your LLM traffic in one place, records the cost and latency of every call, and connects which model or prompt actually drives conversion.
DOM events + abto gateway → analysis
What you spentmeasured as outcome
Cost −37% · evidence for model choice
Cost and outcome
abto joins the user actions captured as DOM events with the call costs recorded at the gateway, then shows which model actually drives conversion.
DROP-IN
It's OpenAI-compatible, so there's no call code to rewrite. Pass identifiers as headers — the gateway reads and consumes them, and never forwards them to the vendor.
import OpenAI from "openai"; const client = new OpenAI({ baseURL: "https://gw.abto.dev/v1", // the only line you change apiKey: process.env.ABTO_API_KEY,}); await client.chat.completions.create( { model, messages }, { headers: { // gateway reads & strips these — never sent to the vendor "x-abto-user-id": user.id, "x-abto-node-id": "checkout_suggest", } },);Point baseURL at the abto gateway. The rest of your call stays exactly the same.
Flow user_id and node_id through x-abto-user-id / x-abto-node-id; the gateway consumes them.
Tenant is identified automatically from the API key — nothing to configure.
The question that matters is elsewhere: which user behavior created which LLM cost, and did that cost come back as revenue and conversion? abto goes beyond ‘who clicked what’ and closes the broken link between spend and outcome.
How it works
Accept an OpenAI-compatible request and convert it to the internal canonical form.
inbound → canonical
Sticky assignment per user and node routes across OpenAI, Anthropic, and Gemini.
sticky · user·node
Cost, latency, and outcome are recorded per call. Delivery is async, so responses never slow down.
OpenTelemetry · async
User behavior → LLM cost → real conversion, joined in a single line.
behavior → cost → conversion
One OpenAI-compatible endpoint moves across OpenAI, Anthropic, and Gemini. Sticky assignment per user and node keeps your experiments steady.
On top of OpenTelemetry, abto records the cost, latency, and outcome of every call. Delivery is async, so it never touches your response time.
Connect user behavior → LLM cost → real outcomes in a single line. Prove with data which model or prompt drives conversion.
Join the reservation list and we'll tell you the moment we launch.
Contact ·contact@abto.app