Vellum is coming to the AI Engineering World's Fair in SF. Come visit our booth and get a live demo!

Pricing

Start for free. Upgrade to get the capacity that exactly matches your team's needs.

Free

Find out how Vellum helps you build powerful agents.

$0
per month
1 user
50 credits
Hosted agent apps
Debugging console
Knowledge base (20 docs/m)
Pro

Designed for anyone starting to build agents and test ideas.

$25
per month
1 user
200 builder credits
Hosted agent apps
Debugging console
Knowledge base (1000 docs/m)
Execution history (up to 3GB)
Business

Designed for teams scaling agent deployments across the org.

$79
per user / month
Up to 5 users
500 builder credits
Hosted agent apps
Debugging console
Knowledge base (1000 docs/m)
Execution history (up to 10GB)
Enterprise

Built for large orgs needing flexibility, scale, and governance.

Custom
Unlimited credits & custom setup, plus:
RBAC
SSO
Environments
Prompt management & Evals
VPC install
Dedicated support
Prompt Engineering
Prompt editor
Comparison mode
Chat mode
Function calling
Human review
Image prompting
Collaborative workspace
Versioning & tracking
Support for open and closed source models
Workflows
Workflow development IDE
Out of box templates
HTTP API requests
pip & npm package support
Composability via Subworkflows
Arbitrary Python and Typescript execution
Tight integration with Workflows Python SDK
Evaluation
Evaluate prompt outputs
Evaluate multi-step Workflows
Out of box metrics
LLM-based evaluation
Custom metrics via Python or Typescript
Bulk execution with rate limit guards
Reports to compare between draft & deployed versions
Search
Managed document ingestion
Chunking and embedding
Search API
Custom chunking strategies
Native integration with Workflows
Semantic, keyword and rule-based search
Embedding model comparison
Image/table RAG
Deployments
Release management
Execution history
Actuals feedback
Monitoring dashboard
Chatbot frontend
Configurable data retention policies
External monitoring integrations
Security & Support
Dedicated Slack channel
Custom contracts
BAA
SSO
DPA
Prompt engineering, evaluations and architecture support
Managed deployments (Virtual private cloud + on-prem)
Features
Free
Pro
Business
Enterprise
Users
1
1
5
Custom
Agent builder interactions (credits)
50
200
500
Unlimited credits
Hosted agent apps
3 agents
∞
∞
∞
Knowledge base
20 documents
1000 documents
1000 documents
Custom
Parallel agent runs
1 run
3 runs
10 runs
Custom
Execution history logs
1 GB
3 GB
10 GB
Custom
Max agent runtime
3 min
30 min
30 min
Custom
Data retention period
7 days
30 days
Up to 1 year
Custom
Debugging console
Add-ons
Custom enterprise add-ons
Available enterprise features
Unlimited builder credits
*up to 1,000 builder interactions per user per month.
RBAC
Restrict what actions users have in the workspace.
Environments
Maintain proper isolation between development, staging and production for all your agents.
Slack support and SLAs
Priority support in Slack and mutually agreed SLAs.
Custom Contracts, BAA, DPA
Contracts with custom terms and red-lining.
VPC Install
Self host Vellum in your virtual private cloud.
Prompt management
Prompt management library to unit test individual prompts before incorporating them into an entire workflow or agent.
Evaluations
Evals suite to test prompts, workflows, & agents before rolling them out to production environments.
Book a Demo

Frequently asked questions

What are agent builder credits?

Agent builder credits represent how many messages you can send while creating or testing agents in Vellum. Each message in the builder counts as 1 interaction, and your plan defines how many you get per month.

What is the knowledge base used for?

The context storage is powered by a built-in vector database that your agents can use for retrieval (RAG). This lets your agents access relevant knowledge during runs. Higher tiers allow you to store and search more docs for richer, more accurate responses.

What are hosted agent apps?

Hosted agent apps are live, shareable links automatically generated for your agents. Each one gives your agent its own hosted interface; no setup or code required. You can share the link with your team.

What are parallel agent runs?

Parallel agent runs determine how many people can use your hosted agent link at the same time. Higher plans allow more users to interact with your agent concurrently without waiting for others to finish.

What’s included in the Enterprise plan beyond Business?

Enterprise gives you everything in Business plus features for governance and scale: RBAC, isolated environments (dev/staging/prod), Slack support & SLAs, Prompt Management, Evals, and the option for VPC installation and custom legal terms.

What is the debugging console used for?

The Agent Console lets you debug and test your agents as you build. You can see detailed traces of each run, view inputs and outputs step-by-step, and replay executions to understand how your agent made decisions.

What’s the data retention period?

The data retention period defines how long your logs and execution data remain viewable in Vellum. When the period ends, older data will be automatically hidden.

What happens when I reach my storage or data retention limit?

When you hit your storage cap, your newest agent runs still work, but older execution data will automatically be hidden once your retention period ends. Upgrading increases both storage size and how long you can access your data.

Can I monitor how my agents perform after deployment?

Yes. You can review execution history, see performance metrics, and use traces in the console to spot and fix issues quickly, even after your agents are live.

Can I keep development and production agents separate?

Yes. With isolated environments, you can safely test agents in development or staging before promoting them to production, keeping your workspace organized and reducing risk. This feature is available only on enterprise plans.

Features
Startup
Enterprise
Workspace Management
Users
Up to 5 users
Custom
Collaborative Editing
Version History
Multi-Player Configuration
Multiple Workspaces
Environments
Role Based Access Control
Prompts
Prompt Editor (Including Image & Function Calling UI)
Comparison Mode
Human Review
Prompt Optimizer
Workflows
Arbitrary Python/Typescript Execution
pip & npm Package Support
HTTP API Requests
Composability via Subworkflows
In-line Guardrails
Custom Nodes
Custom Docker Images
AI Generated Workflow Descriptions
Natural Language to Workflows
Coming Soon
Evaluations
Custom Metrics via Python/Typescript
Out Of Box Metrics
LLM Based Evaluation
Bulk Execution With Rate-Limit Guards
Compare Draft & Deployed Versions
Synthetic Test Case Generator
Coming Soon
Search
Managed Document Ingestion, Chunking, and Embedding
Semantic, Keyword, and Rule-Based Search
Native Integration w/ Workflows
Image and Table RAG
Usage Limits
Up to 10,000 pages
Custom
Deployments
Release Management
Execution History
Trace-Span & Graph View
Actuals Feedback
Monitoring Dashboard
Online Evaluations
Chatbot Frontend
Configurable Data Retention Policies
External Monitoring Integrations (Datadog, Webhook)
Release Reviews
Models
Top Proprietary Models
Top Open-Source Models
Custom Open Source Models
Bring Your Own Models
Support
Dedicated Slack Channel
Workflow Architecture Advice
Prompt Engineering And Evaluations Support
SLA
Procurement
Contracts
Standard
Custom
Billing
Stripe
Custom
BAA
DPA
Security
Single Sign On
Deployment Options
Vellum SaaS
- Your VPC
- On-prem
- Vellum SaaS