We believe your AI costs should be transparent and predictable. Vellum passes through model provider costs at cost. We don't charge any margin or markup on token usage. When you spend $1 worth of credits on LLM tokens, that full dollar goes to the model provider. Our goal is to keep Vellum affordable and aligned with your actual usage, not to profit from the AI calls your assistant makes.
Vellum uses a prepaid credit balance: usage is deducted from your credits as you use the assistant. You can add credits anytime from the Billing page via Stripe Checkout. Applicable taxes may be added during checkout.
In the app, your Billing screen shows a Credit Balance plus a breakdown of settled and pending amounts:
The purchase and use of Vellum Credits is governed by Section 6 of our Terms of Service.
Vellum makes available certain features and functionalities within the Services, as designated by Vellum from time to time, that are accessible exclusively through the use of prepaid credits (“Vellum Credits”). These designated features and functionalities are referred to as “Credit-Eligible Features.” Credit-Eligible Features currently include inference, web search, image creation, and paid third-party APIs accessed through Vellum's managed OAuth (for example, Twitter). No alternative direct-payment method is available for Credit-Eligible Features.
When credits are exhausted, the app will show a “You've run out of credits” message with an Add Credits action that links you to Billing. Assistant actions that require paid usage will pause until credits are added. Enabling Auto-Reload is the easiest way to avoid hitting this state.
You may fund your Vellum Credit balance (“Vellum Balance”) by purchasing Vellum Credits in ten-dollar ($10.00 USD) increments, up to one hundred dollars ($100.00 USD) per top-up, through the payment methods made available in the Services, or at such other amounts as determined by Vellum from time to time.
You can add credits from the app's Billing settings:
One Vellum Credit equals one US dollar. So $10 in checkout adds 10 credits to your balance.
If you'd rather not think about manual top-ups, Auto-Reload purchases more credits automatically whenever your balance drops below a threshold you set. Configure it from Settings → Billing.
You set three values:
Auto-Reload requires a saved payment method, which you can add in the Payment Methods section right below the toggle. If you're close to the monthly cap when the threshold trips, Auto-Reload only adds the amount remaining before the cap.
You can disable Auto-Reload anytime; your saved card stays on file so you can re-enable it later without re-entering details.
Four categories of work consume credits today: LLM inference (the biggest line item by far), web search, image generation, and paid third-party APIs you reach through Vellum's managed OAuth (for example, Twitter).
Inference is itself broken into a set of Actions you'll see attributed in your usage dashboard. Here's what each one does:
Conversation with your assistant
Memory subsystem (mostly background)
Conversation polish (background)
Autonomy (background)
Other
A lot of background work is configurable. You can ask your assistant to disable or reduce the frequency of “heartbeats” and “memory compaction,” or to use a less expensive model for these actions. We're actively working on making background spend more visible and easier to control.
Need help with billing? Contact support at support@vellum.ai.