Docs / Releases

July 2026

Get started
Looking for what's coming next?See the roadmap. Shipping soon, up next, and what we're exploring.

v0.10.4

A feature release headlined by a memory system overhaul and rebuilt Slack rendering, plus plugins are now generally available.

  • Self-improving skills. The assistant now writes its own skills based on memory, edits them, and knows what to do better next time. The retrospective writes and edits its own procedural notes, and recall is sharper for named people and things. The memory worker can be started, stopped, and checked from config.
  • Richer Slack UI. Assistants now use native Slack images and tables instead of just sending plain text. Messages render through a real markdown parser, so images, tables, column alignment, and headings all work, and reactions show as emoji.
  • Heartbeat on by default. A heartbeat is a moment when your assistant quietly checks in on its own, without you asking, to catch reminders, follow-ups, and anything time-sensitive. It's now enabled out of the box, with up to 10 runs a day. The Schedules tab gets shareable links to individual schedules.
  • Improvements to default models. The managed Balanced profile moves to GLM 5.2 with medium reasoning effort. The Frontier profile is retired in favor of Quality. Together AI profiles now show topP and effort controls.
  • Chat improvements. Fixed idle scroll flicker, and thinking and tool drawers now stay open after a turn finishes. Thinking previews now stream live as they generate.
  • New plugins. You can install plugins directly from a GitHub URL, and the Plugins tab now matches the Skills tab. New in the plugin marketplace: Fitness Companion, Reading Pal, and Writing Coach.

v0.10.3

A feature release headlined by the new onboarding flow in the web app, plus MCP server management, plugin publishing, and redesigned subagent and workflow UIs.

  • Guided onboarding in the web app. New users get a guided flow after signup that suggests plugins based on how they plan to use Vellum, connects Google Calendar, and schedules a Day 2 check-in.
  • MCP server management. MCP settings have moved under Integrations. You can now view connected MCP servers (including which tools are registered on each and the token contribution) and remove servers from the UI. Adding OAuth-based servers (with automatic auth flow) and revoking connected OAuth accounts from the web UI are still in development.
  • Plugin publishing. A new assistant plugins publish CLI command lets authors publish to the marketplace directly. The plugins list command gains an -all flag, and installed plugins now activate without restarting the daemon.
  • Advisor streams live. The advisor now streams its guidance as it generates, with search and reasoning activity visible in real time alongside the final advice. It receives full conversation context and runs as a blocking step.
  • Subagent UI redesign. The subagent detail panel is redesigned with a phase-grouped timeline and a new Active Subagents floating overlay showing running subagents at a glance.
  • Computer-use and macOS polish. Selection and cursor key actions are no longer incorrectly flagged as having no visible effect. The macOS app nudge banner now appears 24 hours after first seen instead of immediately. The mac-helper executable gets proper per-environment naming so it displays correctly in Privacy & Security settings.
  • Model profile updates. The managed Speed profile moves to DeepSeek V4 Flash via Fireworks. The managed Balanced profile's reasoning effort is lowered to medium. topP and effort controls are now visible for Together AI profiles.
  • Quote reply is generally available. The quote-reply flow, previously behind a flag, is now on for everyone, you can reply inline to a specific message and the original is quoted as context, no opt-in needed.
  • Model profile picker on schedules. Schedules now expose the model-profile selector, so a scheduled run can be pinned to a specific profile (cost-optimized, balanced, etc.) instead of inheriting the active one at fire time.
  • Channels and gateway. Channel trust floors move inline into Contacts → Channels and are enabled by default. Consent version comparison is now monotonic, stopping cross-client re-consent loops. Approval requests now notify the requester and release interaction on expiry.
  • Workflows UI. The workflow detail panel is redesigned with an active workflows floating overlay pill. Directory-style saved workflows now resolve correctly.

v0.10.2

A maintenance release that fixes a startup migration issue.

  • Database migration no longer gets stuck on startup. Two startup migrations could crash and leave the daemon stuck replaying a failing migration on every launch if the database had already been partially updated by a later migration. Both are now idempotent and safe to re-run regardless of schema state.

v0.10.1

A feature release that refreshes the model profile lineup, adds image support for text-only models and many stability improvements and bug fixes.

  • Quality profile now uses GLM-5.2, plus a new Frontier profile. The Quality profile now points to GLM-5.2. A new Frontier profile has been added for users who want Claude Opus, available as the highest-tier model option.
  • Balanced profile now served via Together AI. The Balanced model profile, which runs Minimax M3, has moved from Fireworks to Together AI as its hosting provider. The model itself is unchanged, only the infrastructure behind it.
  • Better image handling for text-only models. When a tool returns an image, like a screenshot, the assistant now describes it in words so models that can't see images still understand what's there.
  • Fixed errors with newer models. Some of the latest models were rejecting settings that older ones accepted, which caused parts of the assistant to fail. The assistant now sends each model only the settings it supports.
  • Improved dictation UI for Mac. The dictation interface in the Mac app has been redesigned for a cleaner experience. This is an interim improvement, with further polish planned.
  • Faster, smoother startup. The assistant could freeze for a few minutes on startup while it initially booted. It now stays responsive the whole time.
  • Upgrade reminders for hosted installs. Hosted setups now show a banner when an update is available, without nagging or getting in the way.

v0.10.0

A wide release: plugins become a first-class marketplace, your assistant can now pull in a second, more powerful model for hard problems, memory moves to a sharper section-grain v3, and subagents, workflows, Slack, and the Activity page all get upgrades.

  • Plugins are now a first-class marketplace. Browse, install, and update plugins from inside the app, alongside the ones the vellum team ships. Plugins are TypeScript packages with lifecycle hooks (distinct from skills, which are instruction bundles), so they can reshape how the assistant behaves, not just what it knows. Read more here.
  • Privacy policy and consent updates. We have updated the privacy policy and the Share Analytics / Share Diagnostics control. Share Diagnostics now covers crash reports, conversation traces, and session replay data. Because the consent terms changed, you will be asked to review and reconfirm your privacy-related preferences. Read the updates here.
  • Workflows - When your assistant needs to apply the same operation across several things (researching products from different brands, comparing multiple options, summarizing a batch of docs), you can watch each one run in parallel: a progress card in your chat, plus a side panel showing each item start and finish.
  • Your assistant can consult a more powerful model. The new Advisor plugin lets the main agent step back on a complex task and consult a second agent running a stronger model before committing to an approach. It comes installed and enabled by default on new assistants. Self-hosted and BYOK workspaces can point the advisor at one of their own profiles.
  • Memory v3. Memory moves to a section-grain model: instead of one flat page per topic, your assistant retrieves a compact card plus the specific section that matches, so recall is sharper and less bloated. New assistants get v3 from creation. If you are on an existing assistant, ask it to run the "Memory v3 Migration" skill to reshape your current memory from v2 into the new format. It reads v2 pages either way, but retrieval is noticeably better once migrated.
  • Firecrawl joins both web search and web fetch. Your assistant can now use Firecrawl for either searching the web or fetching a specific page, alongside the existing options (Perplexity, Brave, Tavily). Firecrawl returns clean, relevance-ranked results and has a freshness filter.
  • Smarter, tidier subagents. The subagent detail panel is faster and cleaner: the timeline is virtualized, the panel no longer re-renders on every streamed token, and side-panel open is snappier. Spawned subagents now inherit the inference profile of the agent that launched them, so parallel work runs on the right model without extra setup.
  • Balanced profile now runs on MiniMax M3. The managed Balanced profile is repointed to MiniMax M3 on Fireworks (the former Balanced Economy configuration), keeping the same Balanced label while improving the cost-to-quality tradeoff. The separate Balanced Economy profile has been deprecated.
  • Slack improvements. Direct-message replies no longer disappear. Approval cards and reactions clear across every surface when resolved. Access-request verification codes are delivered straight to the requester.
  • A cleaner Activity page. The page formerly known as Home is now Activity, decluttered and polished: Notifications and Schedules tabs, a slide-in detail drawer that respects reduced-motion, persistent accordion state, and system schedules (Heartbeat, Memory Consolidation, Memory Retrospective) restored to the view.
  • OS-Beta profile for open-weight model testing. A new model profile, OS-Beta, lets you opt into early-access open-weight models. It currently points to GLM 5.2 via Fireworks and is gated behind a feature flag.
  • Chrome extension for self-hosted assistants. The extension now works again with self-hosted setups: browser-use is functional, and you can edit the gateway URL or see a clear error when the connection fails.
  • Global status banner. A new banner surfaces your assistant's lifecycle state at the top of the app: upgrading, offline, waking up, and any other connectivity events. You'll always know at a glance whether your assistant is actually running.

v0.9.0

A new Mac app replaces the legacy Swift build, with feature parity across web and iOS plus stability and performance wins.

  • A new Mac app. The Electron-based desktop app replaces the legacy Swift build and keeps feature parity with the web UI and iOS. It is more stable (less prone to app crashes) and snappier across the app.
  • Login security improvements. All clients got a security pass: more secure login and token management across the board.

v0.8.12

Overall polish everyday feel of the assistant: in-progress steps inside chats show a live timer and the assistant's thinking hovers to reveal how long it reasoned; you can search inside a long chat to jump to a specific word or moment; messages stream more smoothly and stay ordered even when the connection stutters; and a new Balanced economy profile gives you a middle-ground option that keeps costs down without giving up much quality.

  • Sign in through your browser. Logging in now hands you off to your normal browser (Safari, Chrome, whatever you use) and brings you back. Small change, but your password manager, saved logins, and two-factor codes all work the way they normally do.
  • See how long things take. In-progress steps show a live timer so you know it's working. Hover over its "thinking" to see how long it spent reasoning. Less guessing about whether it's stuck.
  • Find within a conversation. Search inside a long chat to jump to a specific word or moment.
  • Smoother streaming. Messages hold together more reliably even if your connection stutters. Fewer replies arriving out of order, duplicating, or vanishing.
  • A new budget-friendly profile for keeping costs down without giving up much quality. A middle ground between cheapest and best. We call it Balanced economy.

v0.8.11

Web searches inside chats show as clean, clickable cards you can verify, adds MiniMax M3 as a new model option, and refreshes the skill lineup with a new geo-audit skill, a rebuilt app builder, and document writing that consistently uses the proper editor. Plus, opening the Mac desktop app now installs the Vellum CLI into your shell PATH automatically.

  • MiniMax M3 added as an option.
  • Web searches are easier to follow. When the assistant looks something up, results appear as clean, clickable cards in the conversation. See what it searched and click straight to the sources.
  • New geo-audit skill for location and SEO analysis, now in the catalog.
  • App builder rewrite. The skill that builds dashboards, simple tools, and pages was rebuilt to be more reliable, so what you ask for comes out working more consistently.
  • More reliable document writing. Ask for something substantial and it now reliably uses the proper editor, giving cleaner, better-formatted results.
  • The Mac desktop app now installs the Vellum CLI into your shell PATH automatically, so running vellum from any Terminal window works without a separate install step. The wrapper points at a locator file that the app refreshes on every launch, keeping vellum messagevellum upgrade, and shell completions available and pointing at the live CLI binary.

v0.8.9

Pro subscribers bundle credits directly into their plan, makes the daily heartbeat calmer by default, gives scheduling a friendlier builder with plain-language descriptions and a tidier list that hides past one-time tasks and system jobs, and lets you download all your invoices, including Pro and ad-hoc credit purchases, as a single zip.

  • Pop conversations into their own windows. Open any conversation in its own window so you can keep one chat visible while working in another. There's also a new "Quick Input" window: a small box to dash off a message without opening the full app or losing your place.
  • Keyboard shortcuts. Set your own system-wide hotkeys that work no matter what app you're in. Press it and the assistant pops up, or dictation starts, instantly.
  • Native notifications. Alerts now arrive as proper macOS notifications with buttons you can click to act, without opening the app first.
  • A cleaner icon and lots of polish. A sharper app icon plus many small visual fixes so it looks at home on macOS.
  • Easier scheduling. A friendlier builder, with a plain-language description of when it'll actually run ("every weekday at 9am") so you're not second-guessing the timing.
    • Tidier schedules page. Old one-time tasks are collapsed and system jobs are tucked away, so you see the schedules you care about.
  • Calmer by default. The heartbeat now runs at most twice a day out of the box, and you can pause or turn it off whenever. Less noise, more control.
  • You can now bundle credits as part of your pro plan: Clearer credit purchasing. The Pro credit screen is cleaner and shows your base monthly fee up front. No surprises.
  • Download all invoices at once in a single zip instead of clicking through one at a time. Includes invoices for pro plan and ad hoc credit purchases.

v0.8.8

  • Cross-machine device pairing: vellum pair mints a device-scoped, revocable token on the assistant host and prints a pairing bundle, vellum connect import sets up a persistent paired assistant on another machine, vellum client --token opens an ephemeral session, and vellum devices and vellum unpair list, revoke, and forget pairings.
  • Richer chat surfaces: new work-result, choice, and copy surfaces, plus managed-OAuth connect and approval surfaces so the assistant can hand you structured results and connection prompts inline.
  • Pro plan credit bundles: pick a credit bundle during upgrade and change flows, with the current bundle shown on the billing plan card.
  • In-chat find: Edit menu now has Find (Cmd+F) wired to in-chat search on both macOS and web.
  • Streaming made visible: thinking blocks render live in the chat transcript, a spinning ring wraps your custom avatar while the assistant is working, and the Dock and menu-bar icons reflect the assistant avatar with a live status indicator.
  • Skill browsing: skills carry category metadata and SVG icons in their SKILL.md frontmatter, rendered in the web app with dynamic categories.
  • Library app polish: apps open fullscreen on web and mobile, with an Edit button in the Library app viewer.
  • Cloudflare tunnel provider: new tunnel option for exposing a local assistant.
  • Model profile polish: advanced params shown only after a model is chosen, the un-pinned params now read "Default" instead of "Inherit," and advanced-param spacing tightened up.
  • Web search sources: clamped sources expand behind a "+N more" pill.
  • Connection reliability: SSE reconnects are self-diagnosing with single-cursor resumable replay, and truncated thinking blocks heal on reconnect.

v0.8.7

Highlights

  • Improved reliability of web release builds, ensuring shared packages are correctly installed before the build process runs
  • Fixed a configuration issue that caused web release builds to not properly enable platform mode, resolving potential runtime errors in the web application

Build: 0.8.7 Commit: cfe9507d34 Built at: 2026-06-03 01:43:06 UTC

v0.8.6

A model and reliability update: Claude Opus 4.8 joins the hosted catalog, Gemini 3.5 Flash is now available to cloud users after a missing platform rate card was blocking requests, the sleep policy gains 1-hour and 3-hour idle presets, per-assistant sleep thresholds now survive wake cycles, inbound email is fixed on Resend-registered domains, disk-usage alerts fire an owner email, and API key rotation gets a propagation grace period.

Highlights

  • Claude Opus 4.8 now available: Anthropic's latest Opus model is live in the hosted catalog and selectable across all surfaces
  • Gemini 3.5 Flash on the hosted service: the model was already selectable in the local UI but a missing platform-side rate card returned a 400 for every cloud request; the rate card and billing entry are now wired in so cloud users can select it without hitting a rejection
  • Sleep policy 1-hour and 3-hour presets: two shorter idle-sleep durations join the sleep policy control in Settings alongside the existing 1-day, 3-day, and 7-day options so assistants can be tuned to sleep faster without using the full daily minimum
  • Idle-sleep thresholds persist across wake cycles: per-assistant sleep cutoff settings were being reset to the system default after each wake event; the configured threshold now survives correctly
  • Disk-usage email notifications: when workspace storage crosses the alert threshold the assistant owner now receives an email with remediation links to upgrade storage or ask the assistant to clean up, alongside the existing in-app error notification
  • API key rotation grace period: a short overlap window during key rotation keeps the old key valid while the new key propagates, eliminating the authentication gap that could cause brief request failures on self-hosted instances
  • Email inbound receiving fixed on Resend domains: newly registered Resend domains had receiving silently disabled because the create-domain API call was missing the nested capabilities object; inbound replies were being dropped with no error surfaced

v0.8.5

A billing and reliability release: MiniMax is now an OpenAI-compatible provider, self-hosted assistants route runtime calls directly to their own ingress, billing and resize flows get confirmation steps and clearer copy throughout, email and domain settings gain a live verification chip and confirmation modals, the CLI gains tab management and roadmap CRUD, and the avatar editor learns to generate from a prompt or reset to default.

Highlights

  • MiniMax is now available as an OpenAI-compatible provider: configure MiniMax models through the OpenAI-compatible provider flow with primary and fallback endpoint validation, with macOS thinking UI and conversation title generation updated to recognize MiniMax's <think> reasoning tags so they render in the collapsible thinking surface and never bleed into auto-generated titles
  • Self-hosted assistants route runtime calls directly to their own ingress: when you connect a self-hosted assistant from the web app, message and event calls go straight to your assistant with the platform actor token attached, instead of being proxied through the cloud platform
  • Billing and resize flows are clearer and safer: tier upgrades go through a read-only confirmation step before submitting to Stripe, plan resize triggers a confirmation modal after the Stripe portal step, adjust-plan failures surface inline instead of failing silently, downgrades show resource changes in read-only resource cards with an inline downgrade path, and the Compute & Resources card uses a unified tile layout with contextual resize CTAs
  • Email and domain settings are easier to work with: domain settings show a live verification status chip, domain registration and deletion go through confirmation modals, the managed email upsell auto-opens the upgrade modal, the domain status CLI is wired to the live verification endpoint, and the assistant handle auto-locks when it matches the subdomain
  • Disk pressure and credits banners surface action: disk pressure banners are redesigned with clearer state, and the credits exhausted banner now links directly to auto top-ups
  • Multi-tab management from the CLI: list, open, switch, and close conversation tabs with new assistant tabs subcommands, with each connected client tracking its own tab scope independently
  • Roadmap from the CLI: the assistant roadmap command now supports create, search, and upvote alongside listing, so you can manage roadmap items end-to-end without leaving the terminal
  • Avatar editor gains Generate with AI and Reset to default options, so you can spin up an avatar from a prompt or revert to the original without leaving the editor
  • Composer emoji aliases: :huff and :steam now resolve to 😤 alongside the existing alias set
  • Chat reliability fixes: the thinking indicator no longer survives stale terminal events because send state is now scoped to the active turn, and macOS conversation list pagination keeps the scheduled section from being truncated on large histories
  • Share Feedback dialog polish: cleaner layout, design library components throughout, and contextual tips, so reporting an issue feels less like filling out a form
  • LaunchDarkly references removed from the repo, with the assistant fully moved to push-driven feature flag invalidation via SSE so flag changes take effect immediately instead of waiting on a polling loop

v0.8.4

An identity and model-catalog release: every assistant gets a unique handle that doubles as its subdomain, the document editor gains agent-driven comments and find-and-replace, the model picker adds Gemini 3.5 Flash, Gemini 3.1 Flash-Lite GA, Grok 4.3, and Owl Alpha, OpenAI-compatible providers can finally point at any endpoint on macOS, and a new /clean slash command and a dock-icon fix tighten the day-to-day feel.

Highlights

  • Unique assistant handles and matching subdomains: every assistant now has a unique handle that doubles as its subdomain by default, the handle is editable inline from About Assistant, the subdomain locks to match it, and existing assistants were backfilled so handles are non-nullable across the system
  • Document editor agent tools: the assistant can now leave inline comments on documents and use grep and sed-style find-and-replace tools, so iterative edits run as tool calls instead of full rewrites
  • Custom email address builder in Pro onboarding: the Pro onboarding domain step now includes an inline email-address builder so you can configure a custom email domain end to end during signup, and the new assistant domain register --email-username flag does the same from the CLI
  • Expanded model catalog: Gemini 3.5 Flash and Gemini 3.1 Flash-Lite GA via Google, Grok 4.3 and Owl Alpha via OpenRouter, and Fireworks now respects per-model reasoning_effort ceilings so deep-thinking models stop getting clipped by a global cap
  • OpenAI-compatible providers on macOS: the provider connection editor now exposes Base URL and Models fields so you can point macOS at any OpenAI-compatible endpoint (vLLM, Ollama, LM Studio, llama.cpp) without leaving Settings
  • New /clean slash command: strips runtime injections from the conversation without summarizing the rest of the turn, available in the web chat composer slash picker and on macOS
  • Inline file attachments in assistant message bubbles on macOS: file chips now render between paragraphs in assistant replies instead of all batching at the bottom of the message
  • macOS performance and reliability: dock icon image processing moves off the main thread to eliminate 2s+ AppHang events, self-signed certificate creation no longer duplicates on every build, and the home feed preserves read-row text contrast on dark backgrounds

v0.8.3

A home feed and notifications intelligence release: the macOS home page now separates urgent from routine activity, the CLI gains full schedule management, memory routing is on by default, Fireworks joins the managed provider roster, and the web platform recognizes self-hosted assistants.

Highlights

  • Home feed redesigned with urgency routing: urgent items go to Inbox, routine items to Activity, every row shows timestamps and unread weight, read rows fade back, and the Activity section collapses to keep your Inbox focused
  • Notification urgency is now first-class: urgent signals get OS banners and Inbox placement, routine signals route to Activity silently, and assistant notifications send --urgent lets your own scripts control urgency
  • Full schedule management from the CLI: create, cancel, enable, disable, and delete scheduled tasks with new assistant schedules subcommands, complete CRUD without opening the app
  • Memory v2 router is on by default: the Sonnet-based memory router now activates for managed Anthropic installs with a balanced profile, no feature flag needed, retrospective timestamps reflect your local timezone, and transcripts use participant display names
  • Fireworks is now a managed connection: use any Fireworks model without supplying your own API key, the same way managed Anthropic, OpenAI, and Gemini connections work
  • Self-hosted assistants are now recognized in the web app: if you are running the open-source daemon locally, the platform shows a dedicated screen with a link to settings instead of attempting to start a cloud instance

v0.8.2

A cross-platform polish release: subagents get a cleaner design and iOS gets a critical ChatComposer reliability fix.

Highlights

  • Subagent UI redesigned to match macOS: cleaner layout, full-screen detail panel on mobile, and consistent design language across platforms
  • iOS ChatComposer reliability fix: resolved an infinite update loop triggered by WKWebView re-dispatched input events
  • Usage page redesigned with clearer information hierarchy, descending-sorted stacked-bar tooltips, and improved aesthetics

v0.8.1

A focused release on a redesigned model profiles and provider connections system, Twilio phone calling for platform-hosted assistants, one-click Publish to Vercel from the app builder, iOS push notifications, a smarter memory pass, and a tightened iOS login flow.

Highlights

  • Redesigned model profiles and provider connections: mix managed and bring-your-own connections per profile, edit labels and status inline, save tweaks as new variants, and switch profiles mid-conversation with the new /model slash command
  • Twilio phone calling for platform-hosted assistants: your assistant can now answer and place real phone calls when running on the Vellum platform, so you can reach it from anywhere without opening an app
  • Publish to Vercel from the app builder: when your assistant builds you an app, it can now deploy it to Vercel for you in one step, with complex apps handed off to the assistant for end-to-end deployment and live URLs surfaced in the deploy toast
  • iOS push notifications are live: with the latest staging iOS app, your assistant can send native APNs notifications to your phone for activity completions, scheduled wakeups, and turn-end events you'd otherwise miss on mobile
  • Memory v2.5 under the hood: a background retrospective pass reflects on recent conversations, a Sonnet-based router replaces dynamic context injection for memory recall, and per-conversation memory state forks cleanly with the conversation. No visible UX changes, just memory that feels right more often
  • iOS login flow polish: a single Sign In button on the Capacitor login screen, native OAuth completion now lands through a deep-link router instead of falling back to popups, conversational OAuth opens in SFSafariViewController, invalid_grant errors prompt a reconnect instead of silently failing, and the onboarding name is pre-filled from your authenticated profile
  • Tavily joins the web search provider list, giving you another option alongside the existing providers
  • Quality-of-life polish: a Discord community card and chat banner make it easier to find the Vellum community, assistant changelog brings release notes into the CLI, NOW.md injection is now configurable, the macOS provider editor gained a Save as New button, Google OAuth setup is now single-step with inline credential collection, and gateway logs are pretty-printed with a JSONL sidecar for log-tail

v0.8.0

A major intelligence and cross-platform release: Memory v2 is now live, the web app can reach back to your Mac for local actions, managed integrations expanded, and the iOS experience is smoother across login, settings, attachments, and notifications.

Highlights

  • Memory v2 is live: your assistant now builds richer concept pages instead of isolated fact snippets, retrieves the right context more reliably, consolidates what it learns in the background, and gives you cleaner memory surfaces for browsing and searching what it knows
  • Use your Mac from the web app: when your desktop assistant is connected, web conversations can route local actions back through your Mac for workspace files, terminal commands, browser control, and other host-side work without forcing you to switch clients
  • Live integrations now include Discord, GitHub, Google, HubSpot, Linear, Notion, Twitter, Asana, Outlook / Microsoft, and Todoist, with smoother native OAuth handoff on iOS and desktop
  • Model settings now include Fireworks and OpenRouter in the provider catalog, making it easier to configure assistants that use your preferred model provider
  • iOS got a serious polish pass: Sign in with Apple, native OAuth completion, APNs push notification groundwork, better mobile settings layouts, improved attachment preview and download behavior, file downloads through the native share sheet, and cleaner login screens
  • Everyday reliability improved across notifications, trust-rule chips, document persistence, skill loading, gateway routing, OAuth setup, memory maintenance, and macOS signing and update flows

v0.7.3

Highlights

  • Improved memory retrieval routing for more reliable and efficient background memory lookups
  • Under-the-hood stability improvements to how the assistant accesses and recalls stored information
  • General reliability fixes to the memory system to reduce retrieval inconsistencies

v0.7.2

A broad reliability release focused on the Chrome extension and cloud connection path, macOS workflow polish, contact and communication flows, heartbeat and schedule reliability, and tighter gateway security.

Highlights

  • Chrome extension and cloud reliability improved across the board: requests now send fresh session tokens consistently, platform API calls use the right headers and base URLs, self-hosted pairing is fixed, and the extension is prioritized over the macOS SSE bridge for browser control
  • Gateway and security hardening tightened local and cloud boundaries, including stricter loopback handling, known-origin checks for pairing and CORS, safer token refresh behavior, secret redaction before recall evidence reaches the model, and reduced elevated capabilities in Docker-mode assistant containers
  • Contact, messaging, and document workflows got smoother with contact prompt commands and macOS panels, gateway-owned Slack contact upserts, missed @mention catch-up after socket reconnects, stronger trusted contact checks, and PDF export for Document Writer
  • Conversation workflows are more resilient with reliable notification deep links, Mark as read in the conversation context menu, a cleaner conversation-switch loading skeleton, visible trust-rule save errors, and the new burst-based Worked for X.Ys activity model
  • Open-source visibility improved in the macOS app with a new Settings card for the public GitHub repo and a View on GitHub link in the About panel
  • Scheduling, heartbeat, and operations are more robust: scheduled tasks can retry after failures, heartbeat runs use cron-style timing with missed-run detection, SSE disconnects are detected faster with a heartbeat watchdog, and assistant gateway logs tail makes gateway debugging easier

v0.7.1

A focused polish release with stronger Language Model controls, more reliable trust rule editing, Chrome extension fixes, macOS quality-of-life improvements, and credential infrastructure upgrades for smoother local and cloud assistant setups.

Highlights

  • Language Model controls got more precise: profile-specific context budgets, model-aware max token sliders, refreshed context metadata, and effective context handling across the main agent loop, wake paths, and slash commands make model profiles behave more predictably
  • Your Own mode is more reliable: explicit user saves are now respected, main agent profiles can override static defaults, profile context and slider fallbacks are preserved, and actual provider metadata is stamped correctly when routing picks a non-default provider
  • Trust rules and approvals are easier to understand: rule editor copy now shows natural language instead of structured tool data for non-bash tools, trust badge taps open the right edit flow, and approval provenance is tracked with mode, reason, and risk threshold fields
  • Chrome extension reliability improved with fixes for status probing, activity isolation, privileged tabs, popup icon paths, self-hosted UX, and hostname handling across local and cloud modes
  • macOS polish and workflow fixes: turn-end notifications now fire when the app is unfocused, task progress widgets default inline with a pop-out option, the profile editor is cleaner, toolbar state is better isolated from conversation list updates, and account deletion requests can be started from the app
  • Credential and CLI infrastructure moved forward with credential account key normalization, a direct CES credential management CLI, API key migration work, and a new vellum upgrade --latest flag for pulling the newest available version

v0.7.0

A redesigned trust and permissions system with a new v3 rules engine, Gemini 3 model support, GPT-5.5 as the new default OpenAI model, and a wide range of new CLI commands and a reworked browser extension experience.

Highlights

  • Redesigned trust and permissions system: a new v3 Trust Rules engine replaces the older permissions model, with cleaner presets (Conservative, Relaxed, Autonomous), suggested rules with an Allow & Create Rule button, directory-scoped rules, and a fully redesigned Trust Rules management UI
  • Gemini 3 model support and catalog improvements: Gemini 3 models are now available with correct pricing, thought signature capture, and tool-call metadata, the default OpenAI model is now GPT-5.5, and OpenAI reasoning effort can now be explicitly disabled or set to an extra-high tier
  • Expanded CLI: new commands for inspecting installed skills, managing trust rules, registering and listing webhooks, setting and getting environment variables, verbose exec, and SSH and exec support for managed instances
  • Reworked browser extension: a proper onboarding flow, SSE-based event transport, and direct gateway pairing replacing the legacy native messaging host, for a smoother and more reliable connection between your assistant and Chrome

v0.6.5

The Vellum browser extension is now one-click install from the Chrome Web Store, X/Twitter is generally available as a managed integration, and Slack setup is one click with expanded permissions so your assistant can triage your messages, not just post. Plus a new Voice settings tab on the web platform, a conversation Refresh action, and a wide range of macOS stability and polish fixes.

Highlights

  • Vellum browser extension on the Chrome Web Store: install the Vellum Assistant Browser Relay from the Chrome Web Store in one click, no more loading an unpacked extension in developer mode. The extension bridges your assistant to your live Chrome tabs for reading, clicking, filling, and extracting on any site you are already signed into, and now also supports cloud sign-in directly from the extension
  • X/Twitter integration is generally available: connect X to your assistant through the managed integration without toggling a flag, with expanded scopes so your assistant can read likes and bookmarks in addition to posting and browsing
  • Slack is more capable and easier to set up: creating a Slack app is now a single step with the redirect URL baked into the manifest, and the install flow has been simplified so broken links during app creation are gone. Permissions were also expanded so your assistant can now read your Slack messages for triage and summarization in addition to posting, and the bot automatically joins public channels instead of requiring an invite
  • Voice settings on the web platform: the web platform now has a dedicated Voice settings tab matching the macOS app, with push-to-talk presets, and the Test button in voice settings is always enabled so you can preview voices at any time
  • Conversation Refresh action: a new Refresh action in the conversation menu lets you reload a conversation when its state gets out of sync with the server
  • macOS stability and polish: dedicated network session for background streaming to fix stalled responses, cold-launch avatar caching, fixed scroll white space and push-to-top jitter during streaming, sidebar skeleton that waits for restoration instead of a timer, smoother home panel transitions, and a wide range of reliability improvements across memory compaction, onboarding, credentials, and OAuth

v0.6.4

Claude Opus 4.7 support, a migration to OpenAI's newer Responses API, major macOS performance fixes, a smoother chat scroll experience, configurable log retention, improved Gmail cleanup, and broad stability polish.

Highlights

  • Claude Opus 4.7 support: the newest Anthropic model is now available across the app and is automatically used when your assistant needs its strongest quality-oriented reasoning
  • OpenAI provider moved to the Responses API: upgraded the underlying connection to OpenAI models for better streaming, tool calls, and compatibility with newer features
  • Major macOS performance improvements: resolved multiple two-second-plus app hangs caused by layout, font, avatar, sound, and menu-bar initialization, plus a new inverted scroll architecture for noticeably smoother chat scrolling
  • Configurable LLM log retention: choose how long request logs are kept on your device from Settings > Permissions & Privacy. Options are 1, 7, 30, or 90 days, or never expire
  • Faster, more accurate voice transcription: Google Gemini speech-to-text now streams over the Live API for real-time partial transcripts, with support for speaker labels when the provider offers them
  • Smarter Gmail cleanup: persistent blocklist and safelist preferences, a new cold-outreach workflow with automatic classification and enrichment, and more reliable archiving behavior
  • Conversation archive: archiving and unarchiving conversations now syncs reliably with the server, with archived items sorted by when they were archived
  • Assistant thinking in progress cards: the progress card now includes a thinking sub-row and factors thinking time into the total duration, so you can see where the assistant is spending its time
  • Polish and stability fixes: cleaner thinking block layout within the chat column, Reflections grouped under Background in the sidebar, wider model names in the Usage breakdown, a fixed Web Search API key field, smoother onboarding, and a wide range of reliability improvements across memory, credentials, and OAuth

v0.6.3

Dramatically improved chat performance and stability on macOS, a redesigned onboarding flow, real assistant names throughout the UI, a refreshed integrations panel, and broad UX polish.

Highlights

  • Dramatically improved chat performance and stability on macOS: resolved a wide range of rendering hangs, layout freezes, and scroll issues, including fixes for blank chat on conversation switch, streaming scroll corrections, and smoother message rendering overall
  • Redesigned onboarding flow: the setup experience now focuses on what you want your assistant to do and how it should behave, replacing the previous personality quiz with a more practical, goal-oriented approach
  • Assistant identity improvements: assistants now display real names instead of IDs throughout the app, with random name generation for new assistants and the ability to edit role and description directly from the Identity panel
  • Refreshed integrations panel: the integrations page has been redesigned with a cleaner grid layout and moved into Settings as a dedicated tab, with provider logos for easier recognition
  • Smoother scrolling and chat polish: a new scroll model brings smoother conversation scrolling, better alignment throughout the chat, animated conversation names on hover, a new sidebar groups divider, and archived conversations sorted by archive time
  • Memory and context improvements: memory recall is now more configurable, consolidation is less aggressive to prevent memory loss, and deleted memories are now recoverable
  • Broad stability and UX fixes: simplified permission controls, skill previews now work before installation, Slack gained /new command support, and numerous fixes across modals, attachments, sidebars, and the composer

v0.6.2

Introduces a referral program for earning credits, Linear integration, cleaner thinking blocks, configurable knowledge base injection, improved API key management, a polished chat and sidebar experience, and broad stability fixes.

Highlights

  • Referral program: invite friends with your unique referral link and earn credits when they sign up. Accessible from the Billing tab and the sidebar, with stats tracking and program details built in
  • Linear integration: connect to Linear through the platform without configuring your own OAuth app, enabled by default during onboarding alongside Outlook
  • Thinking blocks rendered as markdown and collapsed by default: the assistant's reasoning is now displayed as clean, formatted text that starts collapsed, making responses easier to read
  • Configurable knowledge base injection: control which knowledge base files are automatically included in conversations, with fixes for duplicate and out-of-order content
  • Improved API key management: a more reliable system for storing and reading API keys across settings and services
  • Chat and sidebar polish: a new animated typing indicator, spell checking in the composer, a Recents group replacing ungrouped conversations, and cleaner sidebar text handling
  • Stability and correctness fixes: numerous fixes across scrolling, history loading, dictation, audio handling, and OAuth flows, reducing crashes and stale data issues

v0.6.1

Introduces the Personal Knowledge Base for reliable fact recall, a refreshed design system, major macOS chat performance fixes, background agents for parallel task execution, and numerous stability improvements.

Highlights

  • Personal Knowledge Base (PKB) introduced: your assistant now has a persistent memory system that files and retrieves important information across conversations, with a fully redesigned Memory inspector showing what your assistant remembers and how confident it is
  • Refreshed design system: updated color palette, new typography, and polished components throughout the app for a cleaner, more modern look
  • Major chat performance improvements: resolved several severe slowdowns on macOS, including multi-minute freezes when switching conversations and layout bottlenecks that caused the app to hang
  • Background agents for parallel work: your assistant can now spawn background agents to handle tasks independently, with different roles for different types of work like research, coding, or planning, and the ability to report results back when finished
  • Stability and rendering fixes: smoother thinking block animations, improved markdown formatting, better sidebar behavior, and more reliable scrolling throughout the app

v0.6.0

The biggest release yet: Vellum goes open source, introduces platform-hosted assistants, a completely revamped memory system with image support, Outlook feature parity, conversation folders, and proactive assistant check-ins.

Highlights

  • Open source launch: the Vellum Assistant repository is now publicly available, inviting the community to explore, contribute, and build on the platform
  • Platform-hosted assistants: assistants can now run fully hosted on the Vellum platform, removing the need for local infrastructure and enabling seamless cloud-based operation
  • Revamped memory system with image support: the memory system has been completely rebuilt with support for image references, smarter search that combines multiple retrieval strategies, and more reliable memory consolidation
  • Outlook Calendar and Email reach full feature parity with Google: Outlook Calendar and Outlook Email integrations are now generally available, matching the functionality previously available only for Google Calendar and Gmail
  • Conversation folders and sidebar improvements: organize conversations into folders with automatic grouping by source, improved icons and count badges, and easy group management
  • Proactive assistant check-ins: your assistant can now periodically review its notes, reflect on recent conversations, and reach out when it has something worth sharing, enabled by default
  • Skills system redesign: skills have been rebuilt with better discovery, easier installation, and more reliable behavior across the board
  • Performance and stability improvements across macOS and iOS: faster conversation loading, smoother scrolling, and reduced memory usage through better caching and background processing

v0.5.16

Major macOS performance and stability improvements, Outlook messaging support, smarter assistant memory, security hardening, and polished UI components.

Highlights

  • Significant macOS performance and stability improvements: fixes for chat scroll freezes, sidebar lag, and app responsiveness issues, resulting in a noticeably smoother experience
  • Outlook messaging support: Vellum can now connect to Microsoft Outlook as a messaging provider, joining the existing Slack integration and expanding where your assistant can be reached
  • Smarter assistant memory: the assistant now remembers its capabilities and available tools at startup, with improved search for finding relevant memories
  • Security hardening: removed the ability to bypass permission prompts, tightened access controls across the app, and added stricter validation for sensitive operations
  • Polished UI components: redesigned skill detail page, improved file browser, better dropdowns and navigation, and a context window indicator showing how much conversation space is left

v0.5.15

Signing key handling improvements and automatic migration for smoother upgrades.

Highlights

  • Improved signing key handling to prevent potential authentication issues during normal operation
  • Automatic key migration when upgrading from older versions, ensuring a smooth upgrade experience without manual steps

v0.5.14

Thinking blocks in chat, overhauled memory and retrieval, /compact command, expanded model support, and collapsible sidebar sections.

Highlights

  • Thinking blocks are now visible in chat: see your assistant's reasoning process inline as collapsible blocks, giving you transparency into how responses are formed, enabled by default
  • Significantly improved memory and retrieval: smarter memory extraction, better search diversity to surface unexpectedly relevant memories, and improved formatting of recalled information
  • New /compact command and context window indicator: manually trigger conversation compaction at any time, with a color-coded bar in the toolbar showing how much context space is remaining
  • Expanded model support: DeepSeek, Qwen, Mistral, Meta, Moonshot, and Amazon models added through OpenRouter; Anthropic 1M context window beta and fast mode now supported
  • Collapsible sidebar sections and channel conversations: Scheduled and Background sidebar sections can now be collapsed, and channel-bound conversations display with a read-only indicator

The Personal AI you were promised

GET STARTED