Key Concepts

Web Search

How your assistant looks things up online — the two modes (managed vs bring your own), the four built-in providers, the fallback chain, and how it's billed.

Overview

Web search lets your assistant pull live information from the internet to answer questions, verify claims, and ground its responses in current sources. Whenever a conversation, scheduled task, or skill needs information that isn't in your workspace, the assistant can call out to a search provider, read the results, and cite the sources it used.

You control which provider runs that search and how it's billed. You can use the native search capability bundled with your inference provider, or connect your own API key for a dedicated search engine like Perplexity, Brave Search, or Tavily.

Managed vs Bring Your Own

Web search runs in one of two modes.

ModeWhat it doesWho pays
ManagedThe assistant uses the native search capability built into your selected inference provider. No separate search vendor receives the query.Billed through Vellum credits as part of inference.
BYOKThe assistant calls a dedicated search provider with the API key you connect. You pick the provider.Billed directly by the provider under your account.

Most workspaces start in managed mode and stay there. Bring Your Own makes sense when you want a specific search engine's results, when you already have a Perplexity / Brave / Tavily account, or when you want search costs separated from inference costs.

Providers

Four providers ship with every workspace.

ProviderKindAPI key prefixNotes
Provider NativeManagedn/aHands the search to the inference provider. Quality and freshness follow whichever LLM you have selected.
PerplexityBYOKpplx-...Synthesized answers with inline citations. Default first choice in the fallback chain. Pulls from the Perplexity Search API.
Brave SearchBYOKBSA...Independent index, no tracking, supports freshness filtering. Good default for privacy-leaning workspaces.
TavilyBYOKtvly-...Search API designed for AI agents. Returns scored results with extracted content blocks.

For each BYOK provider you connect, your assistant stores the key locally and uses it for every web search until you disconnect or switch providers. Provider privacy policies: Perplexity, Brave, Tavily.

Choosing a provider

Side-by-side comparison across the dimensions that usually matter.

DimensionProvider NativePerplexityBraveTavily
SetupNone. Works out of the box.API key requiredAPI key requiredAPI key required
BillingBundled with inference (Vellum credits)Direct to PerplexityDirect to BraveDirect to Tavily
Result styleDepends on the LLM you selectedSynthesized answer with inline citationsRaw ranked results (title, URL, snippet)Scored results with extracted content blocks
Index sourceWhatever the LLM provider usesAggregates multiple search enginesIndependent crawl, not Google or BingAggregates multiple search engines
Freshness controlNot exposedNot exposedSupported (day / week / month / year)Supported
PrivacySubject to LLM provider termsStandard SaaSNo query trackingStandard SaaS
Agent ergonomicsHidden behind the LLMPre-synthesized, agent can quote directlyGeneral-purpose search shapeBuilt for agents (scores, content blocks, raw content)
Best forDefault setups, low overheadCited answers in chatPrivacy-leaning workspacesAgentic workflows with extraction

If you're unsure, start on Provider Native and switch only when you hit a concrete reason to.

Configuring web search

Web search is configured in Settings → AI → Web Search.

  1. Choose a mode: Managed or Bring Your Own.
  2. In Managed mode, no further setup is required. Searches use the native capability of your selected inference provider.
  3. In Bring Your Own mode, choose a provider (Perplexity, Brave, or Tavily) and paste your API key into the field next to it. Keys are stored locally on the device that runs your assistant.
  4. Optionally connect keys for more than one BYOK provider. The extras become fallbacks if the primary provider fails or runs out of quota. See Fallback behavior below.

Changing the provider takes effect immediately for the next search; in-flight searches finish on the previously configured provider.

Fallback behavior

In Bring Your Own mode, your assistant tries the provider you selected first. If that provider has no key configured or its request fails with a retryable error, the daemon walks the fallback chain in this order:

  1. Perplexity
  2. Brave Search
  3. Tavily

The chain skips any provider that doesn't have a key connected. If none of them have keys, the search returns an error and the assistant tells you it couldn't reach the web. Connecting more than one BYOK key is the simplest way to keep search resilient when an upstream provider has an outage.

Billing

Web search is one of four chargeable categories in the Vellum platform, alongside LLM inference, image generation, and paid third-party APIs.

  • In Managed mode, search cost is bundled into the inference call that triggered it. You pay Vellum credits the same way you pay for any other LLM request.
  • In Bring Your Ownmode, the search provider bills you directly under your account with them. Vellum doesn't mark up or proxy these requests.

For pricing details and credit denominations, see the pricing page.

Ready to meet yours?

Pick a name and share your world. Then watch the relationship grow.

HATCH YOURS