
The Agent Stack
Quick Answer
The Agent Stack by Vercel AI provides essential building blocks for creating production-grade agents, enabling seamless integration across multiple AI models and secure operations.
Quick Take
The Agent Stack by Vercel AI provides essential building blocks for creating production-grade agents, enabling seamless integration across multiple AI models and secure operations. It features components like AI Gateway for model routing, Workflow SDK for durable execution, and Vercel Connect for scoped access, streamlining agent development and deployment across various platforms.
Key Points
- AI Gateway allows routing across hundreds of models from a single endpoint.
- Workflow SDK ensures durable execution with state persistence and retries.
- Vercel Connect provides short-lived tokens for secure system access.
- Chat SDK integrates agents across multiple platforms with a single codebase.
- Eve framework simplifies agent development with pre-wired functionalities.
Article Content
From source RSS / original summaryAgents are designed to do almost any kind of work, from answering support tickets to writing code. No matter how complex the workload, how long it runs, or how many turns it takes to complete, every agent needs three core capabilities to operate: Implementing these capabilities to build a complete agent forces developers to choose between vendor lock-in with a single provider API, stitching together solutions, or building abstractions themselves.
The Agent Stack gives you all the building blocks you need to create and ship production-grade agents. Agents don't run on a single model. Every task has a different cost, latency, and capability tradeoff, and the right call depends on what the agent is doing. It needs one interface to reach any of them, a way to route between them, and a way to stream back to the user. gives an agent one interface to call any model, and routes across hundreds of them from a single endpoint.
AI SDKAI GatewayEvery lab exposes model calls through their own API. Streaming, tool calls, structured output, and the shape of the request all vary, so every provider you support adds another integration to build and maintain. is a single interface for building AI apps, agents, and frameworks. It is platform, framework, and model agnostic, and allows you to generate text, images, speech, video, and more.
AI SDKTokens are a production dependency now, the way bandwidth is for the web, and agents use different models per task. Integration across labs means separate keys and billing from providers that are expensive, rate-limited, and always changing. is the CDN for tokens, routing them on the global network we have run for over a decade. It routes each call through a single endpoint, fails over when a provider goes down, and tracks cost and usage across all of them.
You pay the provider's price with no markup, and you can use your own keys. AI Gateway runs three models from a single key, sending market analysis to Claude, marketing copy to GPT, and image generation to Gemini. SERHANT. Agents work on tasks sequentially, sometimes for minutes or hours, and those tasks often require them execute code and other operations in a secure environment. makes agent runs durable, and gives agents their own isolated VM.
Workflow SDKVercel SandboxWhen a step fails deep in an agent workflow and there's nothing to resume from, the whole job starts over, re-running every model call you paid for. Solving for durability means building and maintaining retries, state persistence, and orchestration yourself. checkpoints every step of every job, keeps state, retries what fails, and pauses when it needs to wait on a person, a slow API, or a webhook. Runs resume from the last good step, instead of from zero.
Workflow SDK built its creative agent on Workflow SDK, where a single creative session fans out across more than fifty image models. Each step persists and retries on failure, so a long run never loses its state. FLORAAgents read files, run commands, and write code. That freedom is what makes them capable. But without constraints, it's also a risk. The code is unreviewed, a command might be wrong, and one bad step can reach something it should never touch.
gives each agent its own microVM, a full Linux computer with a filesystem, Docker support, and its own kernel, isolated from the host and from every other sandbox. Credentials are injected only when the agent's code calls a service, so it can use what it needs without ever seeing a raw token. Vercel SandboxSandboxes give you and your agents the same primitive behind Vercel's billion preview deployments and six million daily builds. An agent that only talks to models can't do much.
To be useful, it has to access data and external systems, and communicate with the people using it. Both connections have to be secure. gives agents scoped, short-lived access to data and systems. ships agents into the apps where your users already are. Vercel ConnectChat SDKOpening a pull request, updating a record, querying a data warehouse.
Asking an agent to do that work means giving it access to the platforms you use, and today that usually means a long-lived token broad enough to cover anything the agent might ever do. It never expires, and no one can say which user authorized what action the agent took. With, you integrate with each system once. The agent mints a short-lived token for each task, scoped only to the permissions you explicitly grant.
Vercel ConnectEvery action traces from user to agent to service, so the audit log ties every call to the user the agent acted for. Vercel Connect is the newest building block in the Agent Stack, in public beta with support for Slack, GitHub, Snowflake, Salesforce, Notion, and Linear, and any other service through OAuth or API. People don’t work in one tab. They move between Slack, GitHub, Linear, WhatsApp, and Discord, and they expect your agent in each one.
Putting it there yourself means a different API integration, auth flow, and message format for every platform. delivers your agent to all of them. You install Chat SDK once, and it handles each platform's adapter, making your agent available where your users already are. Chat SDK uses Chat SDK to deliver one agent across more than a dozen channels from a single codebase. A conversation can start on Slack and continue in GitHub or Linear, and the agent keeps its context across surfaces.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from Vercel AI
See more →
Opus 4.8 on AI Gateway
Claude Opus 4.8, now available on Vercel AI Gateway, excels in long-horizon agentic execution and complex coding tasks, producing clearer prose for knowledge work. Users can access it via the .anthropic/claude-opus-4.8 model in the AI SDK, benefiting from a unified API with no markup on provider pricing.


