Docs / Key Concepts / Model Profiles

Model Profiles

Overview

Model profiles control which AI model your assistant uses and how it behaves. You set one profile as the workspace-wide default, and your assistant applies it to everything — conversations, background memory work, scheduled tasks, and more. You can override it per conversation or per action type when you need something different.

Built-in profiles

Every workspace starts with three built-in profiles. You can edit or duplicate them, but the defaults can't be deleted.

ProfileModelBest for
QualityClaude OpusDeep research, complex reasoning, high-stakes tasks
BalancedClaude SonnetEveryday use — capable across the board at reasonable cost
Cost OptimizedClaude HaikuSimple, short, or structural tasks where speed matters more than depth

Balanced is active by default. Every call your assistant makes — conversation replies, memory filing, title generation — runs on Claude Sonnet unless you change it.

Switching profiles

Open Settings → Inference Profiles and select a different active profile from the dropdown. The change applies workspace-wide immediately — no restart needed. You can also create custom profiles from this screen if you want to use a different provider or model not covered by the built-ins.

Per-conversation override

Use the profile picker in the chat header to pin a different profile to a single conversation. It only affects that conversation — your workspace default stays untouched. This is the easiest way to run one session on a stronger or lighter model without changing anything globally.

Our recommendation

Keep Balanced as your active profile — it covers everyday use well. Then use Action Overrides to selectively upgrade the actions that benefit most from a stronger model, and step down only for tasks where the output is purely structural.

A good rule of thumb: anything that shows up directly in your conversation or drives a decision should stay on at least Sonnet. Tasks like generating a title, formatting a notification, or suggesting conversation starters are good candidates for a lighter model — they're short, easy to verify, and quality differences are barely noticeable.

Tip: use the per-conversation profile picker for one-off heavy tasks. Switch to Quality for a deep research session, then leave your workspace default untouched. That way you only pay for Opus when you explicitly reach for it.

Action overrides

Open Settings → Inference Profiles → Action Overrides to assign a specific profile to individual actions. Each action has a toggle — when off it uses your active profile, when on you pick a profile just for that action. You can search by name and reset everything back to defaults at any time.

ActionWhat it doesRecommended
Agent loop
Main agentThe primary conversation agent that handles your messagesQuality
Subagent spawnSpawns a subagent to handle a delegated subtaskQuality
Heartbeat agentRuns background tasks and proactive checks on a scheduleBalanced
Filing agentFiles memories and updates the knowledge base after conversationsBalanced
Analyze conversationAnalyzes conversation content for summaries and insightsBalanced
Call agentHandles voice call conversationsQuality
Memory
Memory · ExtractionPulls facts and preferences out of conversations and stores themBalanced
Memory · ConsolidationMerges and deduplicates your memory store over timeBalanced
Memory · RetrievalSearches memory to surface relevant context during conversationsBalanced
Narrative refinementRefines and polishes stored narrative memory entriesBalanced
Pattern scanScans conversation history to detect behavioral patternsBalanced
Conversation summarizationSummarizes long conversation threads for memory and contextBalanced
Conversation startersGenerates suggested openers for new conversationsCost Optimized
Workspace
Conversation titleGenerates a title for each conversationCost Optimized
Commit message generatorWrites git commit messages from staged changesBalanced
UI
Identity introGenerates your assistant's introductory message on first launchBalanced
Empty-state greetingGenerates the greeting shown on an empty conversationCost Optimized
Notifications
Notification decisionDecides whether to surface a proactive notification to youBalanced
Preference extractionLearns your communication preferences from how you interactBalanced
Voice
Guardian question copyGenerates spoken prompts during guardian verification flowsBalanced
Watch commentaryProduces live commentary delivered via Apple WatchBalanced
Watch summaryGenerates brief summaries surfaced on Apple WatchCost Optimized
Utility
Interaction classifierClassifies the type of each inbound message to route it correctlyCost Optimized
Style analyzerAnalyzes your writing style to help your assistant match itBalanced
Invite instruction generatorGenerates onboarding instructions for new assistant invitesBalanced
Skill category inferenceAutomatically categorizes installed skillsCost Optimized
Skills
Meet · Consent monitorMonitors meeting consent during Google Meet sessionsBalanced
Meet · Chat opportunityIdentifies moments to send a helpful message during meetingsBalanced