Additional

Features

Synced from github.com/CoWork-OS/CoWork-OS/docs

Multi-Channel AI Gateway

17 messaging channels with unified operations. See Channel Integrations for setup details, Channel User Guides for per-channel features and best practices, Dedicated Channel Guides for separate channel pages, Using CoWork from WhatsApp and Other Channels for end-user workflows, and Gateway Message Lifecycle for remote command routing, active-task behavior, delivery, and scheduled outputs.

  • WhatsApp: QR code pairing, self-chat mode, markdown support, /new and /new temp task resets, /stop cancellation, editable progress delivery, and hidden temporary scratch workspaces
  • Telegram: Bot commands, streaming responses, workspace selection, group routing modes, and allowed-group allowlists
  • Discord: Slash commands, DM support, guild integration, guild allowlists, embeds/polls/select menus, live message fetch and attachment download
  • Slack: Socket Mode, channel mentions, file uploads, multiple workspace installations in one app profile, and optional curated middle-step progress relays
  • Microsoft Teams: Bot Framework SDK, DM/channel mentions, adaptive cards
  • Google Chat: Service account auth, spaces/DMs, threaded conversations
  • Feishu / Lark: Webhook + app credential gateway support for Lark/Feishu tenants
  • WeCom: Enterprise WeCom webhook and encrypted event routing support
  • iMessage: macOS native integration, pairing codes
  • Signal: End-to-end encrypted messaging via signal-cli
  • Mattermost: WebSocket real-time, REST API
  • Matrix: Federated messaging, room-based, end-to-end encryption ready
  • Twitch: IRC chat integration, multi-channel
  • LINE: Messaging API webhooks, 200M+ users in Asia
  • BlueBubbles: iMessage via Mac server, SMS support
  • Email: IMAP/SMTP, any email provider, threading
  • X (Twitter): Mention-trigger task ingress (do: prefix by default) with allowlist controls and idempotent session keys (guide)
  • Research Channels: Telegram and WhatsApp groups can be marked as link-research channels that auto-generate a structured findings report from posted URLs

Agent Capabilities

  • Profiles & Portability: Separate app profiles isolate their own database, credentials, channels, skills, and sessions. Profiles can be exported/imported as bundles for migration or cloning.

  • Ideas Panel: Curated launch panel accessible from the sidebar above Sessions. Pre-written prompts organized by category let you start common workflows in one click. See Ideas Panel: Supported Capabilities for the full list of tools each prompt uses and their graceful fallbacks.

  • Personal Agentic OS and Everything App: CoWork OS is a local-first personal agentic workspace for coding, web design, research, knowledge work, documents, spreadsheets, presentations, automations, channels, devices, and long-running tasks.

  • Everything Workbench: Generated documents, spreadsheets, presentations, web pages, PDFs, and previews share one artifact model: compact output card, sidebar open, fullscreen artifact workspace, follow-up composer, and refresh after the agent completes requested edits. This reduces the need to switch into separate office apps for generated review and light editing while keeping external app actions available for advanced native workflows. See Everything Workbench.

  • Browser Workbench / Browser V2: live website and local-app testing opens in a visible right-sidebar/fullscreen browser by default. Browser-use tools target the same webview the user can see through Browser V2, with accessibility snapshot refs, CDP-backed actions, tabs, diagnostics, screenshots, annotation, and visible cursor movement during agent actions. See Browser Workbench and Browser V2 Architecture.

  • Task-Based Workflow: Multi-step execution with plan-execute-observe loops

  • Task Overflow Actions: task view title menus expose supported task actions in place: pin/unpin, rename, archive, copy working directory, copy task ID, copy cowork://tasks/<taskId> deeplink, copy Markdown, fork session, view outputs, and create a scheduled automation from the current task. See Task Automations.

  • Managed Agents: versioned managed agents, reusable local environments, and durable managed sessions exposed through the control plane, with backing tasks and team runs still visible in the normal app surfaces. See Managed Agents.

  • Runtime Orchestration: SessionRuntime owns task-session state, session checklists, resume snapshots, recovery state, and task projection while the turn kernel handles each individual step, follow-up, or text turn; metadata-driven tool scheduling, graph-backed delegation, typed worker roles, verifier verdicts, semantic tool-batch summaries, and terminal-state reconciliation keep delegated work coherent across tasks, follow-ups, teams, and ACP runs.

  • Prompt-Aware Tooling: visible tools receive concise prompt-local guidance after policy filtering, and planning plus execution share the same render source for compact tool text and provider-facing tool descriptions.

  • Composer Mentions: type @ in the main composer to choose Agents, configured Integrations, or Files. Integration mentions render as icon+name chips and add soft runtime routing guidance without changing permissions. See Composer Mentions.

  • Message Box Shortcuts: type / in the main composer to search deterministic app commands and skill-backed workflow shortcuts in one picker. App commands include /schedule, /clear, /plan, /cost, /compact, /doctor, and /undo; plugin-pack aliases resolve to their target skills through the existing skills runtime. See Message Box Shortcuts.

  • Sectioned Prompt Stack: execution and follow-up prompts are built from named session- and turn-scoped sections with explicit budgets, memoization of stable sections, provider-aware prompt caching, and truncation/drop reporting when token pressure rises.

  • Provider-Aware Prompt Caching: CoWork keeps stable system blocks cacheable and dynamic turn context uncached, prefers Anthropic automatic caching where supported, uses explicit Claude breakpoints on OpenRouter, and derives stable OpenAI-family cache keys for GPT routes.

  • Session Checklist Primitive: execution-style tasks can create a session-local ordered checklist with task_list_create, maintain it with task_list_update, inspect it with task_list_list, and surface it read-only in the task UI. The runtime can issue a non-blocking verification nudge when implementation items are done but no verification item exists yet.

  • Structured Delegation Briefs: spawn_agent and orchestrate_agents resolve a worker role, package parent-step context plus evidence requirements into a structured brief, and apply the corresponding completion/tool contract to the child.

  • Permission Engine: layered tool approvals combine explicit modes, per-tool/path/command-prefix/MCP-server rules, session grants, workspace-local rules, profile rules, and hard guardrails; dangerous_only adds a lower-friction mode that still prompts on destructive, privacy-sensitive, side-effecting, or ambiguous actions.

  • Live Terminal: Shell commands run in a real-time terminal view — see output as it happens, stop execution, or provide interactive input (e.g. y/n prompts)

  • Dynamic Re-Planning: Agent can revise its plan mid-execution

  • 147 Built-in Skills: GitHub, Slack, Notion, Spotify, Apple Notes, Unity, Unreal, Terraform, Kubernetes, financial analysis, and more. Bundled workflows now include LLM Wiki for persistent research vaults, manim-video for deterministic technical animation, kami for editorial PDFs and slide decks, react-best-practices for React and Next.js implementation work, and taste-skill for high-agency frontend design. Optional CLI-based skills (e.g. aurl for OpenAPI/GraphQL APIs) appear when the binary is installed.

  • Additive Skill Runtime: Skills can still be proactively shortlisted from task semantics, but they now apply as additive context and scoped runtime directives. They never replace the original task prompt. See Skills Runtime Model.

  • Chat Mode: Direct LLM chat with no tools by default, no step timeline, same-session follow-ups, chat-only streaming for supported providers, and a fixed high output budget for explicit executionMode: "chat" sessions. Uploaded PDF turns that need deeper document reading are narrowly promoted into read-only analysis so the document parser can run. See Chat Mode.

  • Document Creation: Excel, Word, PDF, PowerPoint, HTML, and React-style outputs with professional formatting, first-class LaTeX/TikZ .tex -> PDF compilation when a system TeX engine is installed, plus the bundled kami workflow for editorial PDFs, resumes, one-pagers, and slide decks

  • Document Artifact Workbench: task-created Word-style files use compact artifact cards in the task feed. .docx opens directly into a resizable right-sidebar editor with a Google Docs-style toolbar, direct text editing, copy, save, external-open, and folder actions. .doc, .rtf, .odt, .ott, .pages, and related formats are recognized as document artifacts and use best-effort preview or external-app/folder actions depending on parser support. Fullscreen mode expands editable documents across the app and keeps a functional follow-up composer with the main task model picker, voice input, attachments, send behavior, latest-turn/working context, and automatic preview refresh after follow-up edits. See Document Artifacts.

  • Format-Aware File Preview Popup: clicking a file link in chat opens a single in-app preview modal that adapts its layout, header metadata, and per-format affordances to the file type. Supported formats: HTML (sandboxed iframe), Markdown, code with highlight.js syntax highlighting, plain text, JSON / JSONL / GeoJSON (collapsible tree with raw/tree toggle and parse-error fallback), CSV / TSV (RFC-4180 quoted-field parser feeding a sortable-style table), XLSX, DOCX, PDF (with page/native-text/OCR summary and inline document surface), images (with fit/actual-size toggle, dimension readout, and an alpha checkerboard for PNG/SVG/WebP/GIF/ICO), video, audio (mp3/wav/ogg/m4a/flac/aac with duration metadata), and LaTeX. Modal width and padding are driven by a single data-format attribute, so HTML/PDF/image/video get more horizontal room while text/code stay compact and audio renders narrow. The header shows a format-specific subtitle (e.g. PNG · 1920×1080 · 240 KB, PDF · 12 pages · 1.4 MB, CSV · 412 rows · 24 KB) and a unified action bar — Copy path (with copied flash), Show in Finder, Open externally, Close — plus contextual buttons such as the image fit toggle and the JSON tree/raw toggle. Theme tokens replace the previously hardcoded modal background and PDF summary colors, so the popup renders consistently in light and dark themes.

  • Smart PDF Attachments: uploaded PDFs are saved into the workspace and represented in the prompt with stable metadata: filename, workspace-relative path, page count, extraction mode/status, OCR/scan counts, and a compact excerpt. If the user asks about PDF contents beyond that excerpt, the runtime calls parse_document on the attached path instead of inlining the whole PDF. PDF excerpts are marked as untrusted document data so instructions inside the PDF cannot override the user or system prompt.

  • Spreadsheet Artifact Workbench: task-created spreadsheet files use compact artifact cards in the task feed. .xlsx, .xls, .xlsm, .csv, and .tsv open a resizable right-sidebar spreadsheet viewer with sheet tabs, sticky headers, working zoom, cell/range/row/column selection, copy with a short Copied flash, inline editing, add row/column, and save back to the file. .numbers, .gsheet, .ods, and .xlsb are recognized as spreadsheet artifacts and use external-app/folder actions. Fullscreen mode expands editable sheets across the app and keeps a functional follow-up composer with the main task model picker, voice input, attachments, and send behavior. See Spreadsheet Artifacts.

  • Presentation Artifact Workbench: PPTX artifacts render as compact task-feed cards and open by default in the resizable right-sidebar presentation viewer. The viewer includes slide thumbnails, previous/next controls, zoom, a white slide canvas, extracted slide text, speaker notes, fast text-first loading, cached rendered slide images, fullscreen follow-up context, external-app/folder actions, and preview refresh after follow-up edits. Legacy PowerPoint formats are recognized with external actions. See Presentation Artifacts and PPTX Preview.

  • Web Page Artifact Workbench: generated .html / .htm files and built React output entrypoints render as compact task-feed cards and open by default in the resizable right-sidebar web viewer. The viewer uses a sandboxed iframe with local assets inlined where possible, browser/folder/copy actions, fullscreen follow-up context, and preview refresh after matching file/build-output updates. React-style projects without built output show a build-output-needed state instead of auto-starting a dev server. See Web Page Artifacts.

  • Browser Workbench / Browser V2: live browser-use prompts such as "go to this site and test it as a normal user" open a shared in-app browser session beside the task. The agent can navigate, snapshot, click, fill, type, scroll, inspect diagnostics, upload/download files, screenshot, and annotate while the user watches the visible cursor and page state. See Browser Workbench.

  • Persistent Memory: Cross-session context with curated hot memory, searchable archive recall, session transcript recall, topic packs, privacy-aware observation capture, and an optional Supermemory external provider lane

  • Chronicle (Desktop Research Preview): opt-in local recent-screen context for vague prompts like this, that, what is this, latest draft, or why is this failing, with Memory Hub controls, pause/resume, promoted screen_context evidence, and optional linked background memory generation. See Chronicle.

  • Knowledge Graph: SQLite-backed entity/relationship memory with FTS5 search, graph traversal, and auto-extraction

  • Workspace Kit: .cowork/ project kit + markdown indexing with context injection

  • Agent Teams: Multi-agent collaboration with shared checklists, graph-backed coordinated runs, and team management UI

  • Collaborative Mode: Auto-create ephemeral teams where multiple agents work on the same task, sharing thoughts in real-time through the delegated orchestration graph

  • Multi-LLM Mode: Send the same task to multiple LLM providers/models simultaneously, with a judge agent synthesizing the best result

  • Workflow Pipeline: Optional phase-based execution path where decomposed steps run as child tasks with per-phase LLM overrides or capability-based auto-selection

  • Agent Comparison Mode: Compare agent or model outputs side by side

  • External Agent Orchestration: Discover ACP agents, target local or remote assignees from orchestration tools, and invoke A2A-compatible remote endpoints behind the normal approval/policy layer; orchestration now flows through the shared graph engine and graph-backed task state

  • ACP Lifecycle Hardening: ACP task state is persisted locally, survives restarts, supports remote cancel, and enforces scoped task/inbox access for non-operator clients

  • Sub-Task Navigation: Open a delegated sub-task, inspect its timeline, then jump back to the parent task from the main content view

  • Git Worktree Isolation: Tasks run in isolated git worktrees with automatic branch creation, auto-commit, merge, conflict detection, and cleanup

  • Task Pinning: Pin important tasks in the sidebar for quick access

  • Wrap-Up Task: Gracefully wrap up running tasks instead of hard-cancelling

  • Capability Matcher: Auto-select the best agents for a task

  • Completion Output Confidence UX: When tasks finish with file outputs, users get high-signal completion toasts with direct actions (Open file, Show in Finder, View in Files), automatic right-panel focus for the active task, unseen-output badges when reviewing another task/view, and richer completion text composed from semantic summaries plus verifier verdict/report when available.

  • Completion/Resume Coherence: terminal task state is persisted before terminal events are emitted, and approval- or follow-up-driven resume paths re-check canonical persisted status before writing executing, preventing completed tasks from regressing to an in-progress row state.

  • Artifact-First Output Visibility: Artifact-only tasks are treated the same as file-created outputs across progress, timeline, and Files panel surfaces.

  • Paired LaTeX/PDF Outputs: Compiled LaTeX artifacts preserve the editable .tex source as the durable artifact and pair it with the generated PDF in one task workbench, including Summary, source, and PDF tabs.

  • Performance Reviews: Score and review agent-role outcomes with autonomy-level recommendations

  • Vision: Analyze workspace images via the active image-capable model/provider (OpenAI, Anthropic, Azure OpenAI, or Bedrock)

  • Image Attachments: Attach images to tasks and follow-ups for multimodal analysis

  • Image Generation: Multi-provider support (Gemini, OpenAI gpt-image-1/1.5/DALL-E, Azure OpenAI, OpenRouter) with configurable provider ordering

  • Video Generation: Text-to-video and image-to-video via new video generation providers. Configure preferred video model in Settings > LLM. Generated videos render inline in the task feed.

  • Programmatic Technical Animation: The bundled manim-video skill scaffolds Manim CE projects for math explainers, algorithm walkthroughs, architecture animations, and data stories with local project files, dependency preflight, and draft-first render helpers.

  • Editorial Document Design: The bundled kami skill scaffolds workspace-local source projects for resumes, one-pagers, white papers, letters, portfolios, diagrams, and slide decks, with PDF/PPTX render helpers and a preserved editorial design system.

  • React/Next.js Implementation Guidance: The bundled react-best-practices skill applies React and Next.js performance guidance during feature work, enhancements, refactors, reviews, data-fetching changes, bundle-size checks, and rendering-performance fixes.

  • High-Agency Frontend Design: The bundled taste-skill workflow adds a stricter anti-slop frontend option for React/Next.js-style UI work, with stronger layout variance, typography, motion, dependency-check, and responsive-quality rules than the default frontend guidance.

  • Visual Annotation: Iterative image refinement with the Visual Annotator

  • Context Summarization: Automatic context compression surfaced in the task timeline

  • Structured Input Requests: In plan-mode flows, the agent can pause with 1-3 short multiple-choice questions instead of asking ambiguous free-text follow-ups

  • Parallel Tool Timeline: Concurrent read-only tool bursts are grouped into lane-based timeline cards instead of flooding the event feed; screenshot-heavy refinement loops stay more compact in summary mode

  • Renderer Performance: In the CoWork-OS/CoWork-OS repo, the renderer uses @chenglou/pretext for text-heavy sidebar/timeline measurement, with flattened visible sidebar rows and post-render height reconciliation for expanded timeline cards

  • Adaptive Runtime Recovery: Main interactive tasks no longer receive implicit strategy turn caps. They use explicit-only window caps, bounded follow-up recovery, retry-aware turn guidance, and lifetime/emergency safety-stop escalation instead of default hard-window failure

  • Session Snapshot Resume: SessionRuntime prefers session_runtime_v2 checkpoint and event payloads, falls back to legacy conversationHistory payloads or event-derived history, and rewrites legacy resumes to V2 on the next checkpoint

  • Workspace Rule Manager: Settings can list and remove workspace-local permission rules directly, and approval prompts can persist new workspace or profile rules with explicit reasons and scope previews.

  • Path Drift Repair: /workspace/... aliases and drifted relative paths can be normalized back into the active workspace or pinned task root, with strict-fail policies when hard enforcement is preferred

  • Action-First Planning: Agent prioritizes direct action over excessive pre-planning

  • Operator Runtime Visibility: Task completion now surfaces the learning progression, unified recall spans tasks/messages/files/workspace notes/memory/KG, persistent shell sessions keep operator state, and live provider routing/fallback decisions are visible in the task UI and Mission Control.

  • Voice Calls: Outbound phone calls via ElevenLabs Agents

  • Think With Me Mode: Socratic brainstorming mode that helps clarify thinking without executing tools. Activated via toggle or auto-detected from brainstorm/trade-off patterns.

  • Problem Framing Pre-flight: Complex tasks show a structured problem restatement, assumptions, risks, and approach before execution begins

  • Graceful Uncertainty: Agent expresses uncertainty honestly and rates confidence on recommendations. Low-confidence messages display with an amber indicator.

  • AI Playbook: Auto-captures successful patterns (approach, outcome, tools) and lessons from failures with error classification (7 categories: tool failure, wrong approach, missing context, permission denied, timeout, rate limit, user correction). Time-based decay scoring deprioritises stale entries. Proven patterns reinforced on repeated success. Mid-task user corrections automatically detected and captured. Relevant entries injected into system prompts. View in Settings > AI Playbook.

  • Task Result Feedback: Completed task banners show 👍 / 👎 controls, with task-level rejections logged as quality signals for Usage Insights. The shared feedback IPC still supports structured message-level feedback for adaptation and future UI surfaces.

  • Evolving Agent Intelligence: The agent visibly improves over time through a connected set of subsystems — layered memory, retry-aware recovery reuse, adaptive style learning, playbook-to-skill promotion, channel persona adaptation, evolution metrics, and daily operational journaling. See Evolving Agent Intelligence.

LLM Wiki Research Vaults

CoWork includes llm-wiki as a bundled first-class research-vault workflow inspired by Andrej Karpathy's LLM Wiki concept.

  • Persistent vaults: creates workspace-local markdown knowledge bases instead of one-off research outputs
  • Obsidian-friendly structure: durable notes, maps, [[wikilinks]], source-preserving raw/ captures, and filed-back outputs/
  • Deterministic workbench: bundled runtimes handle raw ingest, vault-first search, slide/chart rendering, and graph analysis
  • Deterministic maintenance: bundled analyzer reports topology, orphan pages, broken links, ambiguous links, weakly linked pages, bridge pages, surprising cross-section links, and suggested follow-up questions
  • GUI-first and slash-friendly: natural prompts like Build a persistent research vault for GRPO papers route into llm-wiki, /llm-wiki remains available in desktop and supported gateway channels, and the welcome screen surfaces the default vault for browsing
  • Run artifacts + durable state: each run writes an inspectable manifest and graph report while keeping the vault itself persistent in the workspace

See LLM Wiki for command syntax, layout, modes, and analyzer behavior.

Computer use

Desktop automation for native apps on macOS and Windows when MCP, browser automation, and shell are not enough. Full guide: Computer use.

  • Session lifecycle: One active computer-use session at a time; global Esc abort; cleanup when the task finishes or the session ends.
  • Platform helpers: macOS uses helper-targeted Accessibility and Screen Recording permissions; Windows v1 controls visible, non-minimized windows through a bundled Win32 helper.
  • Built-in tools category: Enable/disable and priority for the computer_use tool family alongside other built-in categories.
  • Policy & planning: Tool availability defers the computer-use lane unless the task signals native/desktop GUI intent; executor guidance treats computer use as last resort after integrations, browser_*, and shell. For native GUI tasks, routing prefers screenshot, click, type_text, keypress, and related tools (plus open_application when launching the app) over run_applescript / fragile shell GUI hacks.
  • Tools: screenshot, click, double_click, move_mouse, drag, scroll, type_text, keypress, wait (with blocklisted dangerous key chords).

Chronicle (Desktop Research Preview)

Chronicle is the desktop-only recent-screen context lane for underspecified on-screen references.

  • Memory Hub-first controls: Chronicle is configured from Settings > Memory Hub > Chronicle, with consent gating, pause/resume, capture scope, Screen Recording / Accessibility / OCR status, and recent-buffer status
  • Dedicated Chronicle tool lane: screen_context_resolve now lives in its own chronicle built-in tool category instead of piggybacking on computer_use
  • Screen-context resolution: screen_context_resolve searches recent local frames first and only falls back to a fresh local screenshot when passive context is weak
  • No second memory system: raw passive frames are ephemeral; only task-used observations are promoted into .cowork/chronicle/, unified recall, and optionally linked screen_context memory entries
  • Mission Control visibility: promoted observations appear as screen_context evidence, a dedicated learning-progress step, and recall hits
  • Task-level control: task creation in the main composer and Devices panel can disable Chronicle for one task without changing the global setting
  • Privacy boundary: passive capture stays local, screen-derived text is marked untrusted, and Chronicle-backed promotion can respect workspace memory privacy/auto-capture settings

See Chronicle for setup, testing, privacy, and contributor details.

Inbox Agent

Local-first inbox workspace for email triage, normal email handling, follow-up, task capture, cross-channel identity, and operator handoff.

  • Classic + Today modes: the familiar three-pane inbox remains available, while Today groups work into Needs action, Happening today, Good to know, and More to browse
  • Live inbox surfaces: Unread, Needs reply, Suggested Actions, and Open Commitments
  • Mailbox views: Inbox, Sent, and All, with Recent and Priority sorting, saved views, account filters, and domain filters
  • Normal email actions: manual reply, reply-all, forward, To/Cc/Bcc, editable subject/body, and provider-backed send
  • AI draft review: generated replies can be edited before sending, and the draft card clears after successful send
  • Ask Inbox: right-sidebar mailbox chat with live agentic steps, final answers, matched email evidence, and click-to-open results
  • Hybrid mailbox search: Ask Inbox searches local FTS, semantic mailbox embeddings, provider-native Gmail/Outlook search when available, and attachment text for statements, invoices, extracts, PDFs, payment notices, and similar evidence
  • @Inbox composer routing: type @Inbox or @inbox ... in the main composer to open Inbox Agent, switch to Ask Inbox, and route the remaining query there
  • Action rail: cleanup, follow-up, reply, forward, mark done, thread prep, todo extraction, scheduling, and intel refresh
  • Thread visibility: received and sent message content are both shown in the thread detail view
  • Cross-channel replies: reply directly via linked Slack, Teams, WhatsApp, Signal, or iMessage targets when email is not the best channel
  • Unified identity: manual search/link in Settings can attach Slack, Teams, WhatsApp, Signal, iMessage, and CRM handles to one contact identity
  • Relationship timeline: the research rail merges email and channel history into one relationship timeline with channel preference hints
  • Mission Control handoff: threads can be turned into company issues, assigned to an operator, and woken from the inbox
  • Sender cleanup: noisy senders are ranked by volume, cleanup candidates, and estimated weekly reduction
  • Inbox automations: rules, reminder cadences, patrol schedules, and Gmail forwarding automations can create tasks, wake agents, and schedule review flows
  • Commitment handling: accepted commitments become real follow-up tasks; already-handled threads can be marked done
  • Event pipeline: mailbox sync, triage, draft, send, and action events feed Knowledge Graph, Heartbeat, triggers, playbooks, and briefing
  • Safer review: sensitive-content warnings, editable drafts, blocked scripts, and provider-permission gates keep outbound actions visible before anything leaves the app
  • Local persistence: cached mail remains visible after restart while background sync refreshes new mail

See the full workflow guide in Inbox Agent and the retrieval/IPC contract in Ask Inbox Architecture.

Managed Devices & Remote Operations

CoWork OS now includes a dedicated Devices tab for running and observing work across multiple machines.

  • Local + remote device inventory: track the current machine alongside saved remote devices in one view
  • Connection-aware remote cards: direct, SSH-tunneled, and Tailscale-backed devices expose connection state, last-seen time, active runs, storage summary, app summary, and attention state
  • Remote task dispatch: start a task on a selected remote device, optionally with shell access, execution mode, or multi-LLM options
  • Remote file picker: browse remote workspaces and attach files directly from the target machine before dispatching a task
  • Remote task feed: filter tasks for the selected device, all devices, or attention states, then open those tasks in a remote session view
  • Device overlays: inspect apps, storage, resource signals, alerts, and observer history without leaving the Devices surface

See Remote Access for connection patterns and Mission Control for the company-level control surface.

Automations Control Center

Automation features are now grouped together in Settings > Automations:

  • Routines: the primary automation abstraction for saved instructions, execution target, triggers, outputs, approval policy, connector policy, and recent runs
  • Workflow Intelligence: Memory, Heartbeat, internal Reflection, and reviewable Suggestions form one always-on runtime owned by automation profiles
  • Task Queue: concurrency, queueing, and background execution policy
  • Workflow Intelligence settings: target-scoped evidence, hypotheses, critique, winner selection, suggestion dispatch, feedback learning, and guarded auto-create policy
  • Scheduled Tasks: recurring time-based task execution; now also used as a compiled backend for routine schedule triggers and as the backend for task-sourced Add automation... flows
  • Webhooks: inbound automation entry points; now also used as a compiled backend for routine API triggers
  • Event Triggers: condition-based actions triggered by channel, webhook, or runtime events; now also used as a compiled backend for routine event triggers
  • Daily Briefing: scheduled summaries with workspace, memory, and evolution context

Ownership model:

  • Mission Control is the cockpit around the core runtime
  • Routines are the main user-facing automation object
  • Scheduled Tasks, Webhooks, and Event Triggers remain low-level or generated infrastructure
  • Triggers are ingress and normalized evidence only
  • Devices are execution routing only
  • Digital Twins are optional persona presets and are not direct cognition owners

The home dashboard also surfaces recent automation runs so background work is visible without opening Settings. See Core Automation.

Task view can also create a low-level scheduled task directly from the selected task with ... > Add automation.... That popup is prefilled from the task title/prompt, appends a source task reference and cowork://tasks/<taskId> deeplink to the saved prompt, and calls the existing cron scheduled-task API. See Task Automations.

Routines

Routines are now the main way to define saved automation in CoWork OS.

Each routine can carry:

  • one saved instruction block
  • one execution target (workspace, worktree, device, or managed_environment)
  • one or more triggers
  • one or more outputs
  • a connector policy (prefer or enforced allowlist)
  • an approval policy
  • durable run history

Supported trigger types include:

  • schedule
  • api
  • connector_event
  • channel_event
  • mailbox_event
  • github_event
  • manual

Current product stance:

  • use Routines first when you want one automation object with observability and policy
  • use Scheduled Tasks, Webhooks, or Event Triggers directly only when you intentionally want the lower-level surface
  • treat Workflow Intelligence as the always-on cognitive runtime, not as a routine trigger

Zero-Human Company Ops

CoWork OS can also be configured as a founder-directed autonomous company shell by composing several existing systems into one operating loop:

  • Venture operator workspace kit: initializes .cowork/ with COMPANY.md, OPERATIONS.md, KPIS.md, PRIORITIES.md, and HEARTBEAT.md
  • Companies control surface: Settings > Companies centralizes company creation, company-graph editing, linked operators, and direct handoff into Digital Twins or Mission Control
  • Operator personas: venture-oriented templates such as Founder Office Operator, Company Planner, Growth Operator, and Customer Ops Lead
  • Automation profiles: always-on ownership for the chosen operator roles
  • Heartbeat v3 follow-up: automation-profile-backed operators can proactively review recurring checks defined in HEARTBEAT.md, while cheap Pulse cycles stay non-LLM until escalation is justified
  • Strategic planner: turns company goals, projects, and stalled work into planner-managed issues and optionally auto-dispatches them into tasks
  • Mission Control ops view: exposes planner config, planner runs, goals, projects, issues, linked tasks, issue comments, and run events
  • Autonomy policy integration: operator roles can carry reusable autonomy presets instead of relying on one global all-or-nothing mode
  • Persisted company-linked operators: venture/operator twins can be assigned to a company so the same operator set stays visible across Companies, Digital Twins, and Mission Control
  • Companies tab as the source of truth: Settings > Companies centralizes company metadata, goals, projects, issues, linked operators, planner state, and handoff into company-scoped Mission Control views

This workflow is designed for "human-directed, agent-operated" execution:

  • humans define business goals, guardrails, and irreversible approval policy
  • agents continuously create, route, and execute operational work
  • Mission Control becomes the monitoring and intervention cockpit

See Zero-Human Company Operations for architecture, setup recipe, monitoring flow, and example operating models.

Workflow Intelligence

Workflow Intelligence is the primary always-on cognition layer in CoWork OS:

  • Memory as source of truth: reflection outputs become memory candidates such as preferences, workflow patterns, open loops, corrections, recurring tasks, and ignored noise
  • Heartbeat as scheduler: Heartbeat decides when accumulated signals justify reflection
  • Reviewable suggestions first: useful outcomes appear as Next actions under the welcome message box, in the automation inbox, and in the Suggestions panel
  • Global coordinator, namespaced targets: one coordinator ranks work globally while each workflow target keeps its own history, winner, backlog, and dispatch stream
  • Stable target identities: supports core-owned targets such as global, workspace, agent_role, code_workspace, and pull_request
  • Fixed reflective pipeline: collect evidence, generate 3-5 hypotheses, critique them, synthesize one winner, write backlog, and create a suggestion by default
  • Feedback learning: act/edit/snooze/dismiss/ignore responses update memory and future scoring
  • Durable artifacts plus SQLite indexing: compatibility files are written under .cowork/subconscious/ and indexed for UI/search/filtering
  • Guarded downstream execution: auto-created tasks require explicit policy, low risk, clear scope, and trusted or repeatedly accepted patterns
  • Recommendation-only success path: if no executor mapping exists or policy does not allow autonomy, the run still completes with a reviewable suggestion or deferred recommendation
  • No maintainer-only enrollment gate: safety remains enforced by the existing executor approval and capability policies

See Workflow Intelligence for the architecture and operational guidance.

Core Harness

The always-on runtime now includes a learning loop around core traces:

  • failure mining
  • recurring failure clustering
  • living eval cases
  • gated experiments
  • promoted learnings

Mission Control exposes this through the Core Harness view. See Core Automation.

Reliability Flywheel

Reliability is built as a continuous loop: capture failures -> replay deterministically -> gate risky completions -> harden nightly/release workflows.

Reliability CapabilityWhat It Does
Eval Corpus (local)Converts failed/partial tasks into replayable eval cases stored in local SQLite
Deterministic ReplayRe-runs eval suites to catch regressions before they reappear in production usage
Risk-Based Review GateScores task risk (low/medium/high) and escalates review/verification only when justified
Policy Modesoff, balanced, and strict review policies for domain-appropriate guard strength
Skill/Prompt HardeningUses modular prompt sections, explicit token budgets, and skill shortlist routing to reduce drift and context overload
PR Reliability ContractEnforces “production failure fix must add/update eval case” in CI
Nightly HardeningRuns eval + battery loops nightly, produces grouped and machine-readable artifacts
Release GateApplies hardening checks before release with a date-based stability-window promotion
Local-Only Data PolicyKeeps reliability artifacts local; no required telemetry upload path

See Reliability Flywheel for architecture, schema, scripts, IPC endpoints, CI workflows, and operational commands.

Mode Picker

The UI exposes a small set of execution modes. Chat mode is separate from task execution and uses the direct conversation path.

ModeBehavior
ChatDirect assistant conversation, no tools, no step timeline, same-session follow-ups, and chat-only streaming for supported providers.
ExecuteFull task execution path with tools, planning, and artifacts.
PlanStructured planning path; can pause for request_user_input and is intended for non-mutating planning/coordination.
AnalyzeRead-only analysis path that stays evidence-focused and blocks mutating tools.
VerifiedExecute-like path that adds external verification checks after steps before completion.

These modes are mutually exclusive. Chat is the conversational path; the others are task execution modes.

Note: Verified mode is strongest when you want execution plus an explicit verification gate. Plan mode shows a confirmation dialog only for structured input requests, not because it bypasses approvals.

Task Toggles

The task creation UI also includes higher-level toggles that change how tasks are orchestrated:

ToggleBehavior
AutonomousAuto-approves all gated actions (shell commands, file deletions, etc.) so the agent runs without pauses. Disables user input prompts.
CollaborativeAuto-creates an ephemeral team of agents that analyze the task from multiple perspectives, then a leader synthesizes the results. Phases: dispatch → think → synthesize → complete.
Multi-LLMSends the same task to multiple LLM providers/models in parallel. A designated judge model synthesizes the best result. Requires 2+ providers configured.
Think With MeSocratic brainstorming mode — agent asks follow-up questions and explores trade-offs without executing tools. Read-only tools only.

Note: Autonomous mode shows a confirmation dialog before enabling, since it bypasses all approval prompts.

Chat Mode

Chat mode is the direct assistant conversation surface. It is designed for normal Q&A, not task execution.

  • No tools by default: the assistant does not plan or call tools in normal chat mode
  • PDF exception: chat turns with uploaded PDF attachment metadata are auto-promoted to read-only analysis when deeper PDF content is needed, so parse_document can read the file without enabling mutating tools
  • No step timeline: chat turns do not render execution steps
  • Same-session follow-ups: later questions stay in the current conversation thread
  • Explicit only: chat behavior is enabled only when executionMode is explicitly set to chat
  • High output budget: explicit chat sessions use a fixed 48K target output cap, clamped to the active provider budget
  • History strategy: long chat sessions use a summary-plus-recent-window prompt strategy with cached summary reuse

See Chat Mode for the full behavior contract.

When the agent is operating in plan-mode execution, it can also use request_user_input to pause for structured multiple-choice decisions. Responses are persisted locally and can be submitted from either the desktop UI or the Control Plane web dashboard.

Guided Decisions & Runtime Recovery

The runtime now includes a set of decision and recovery contracts aimed at keeping tasks convergent without hiding failures:

CapabilityBehavior
Structured input requestsrequest_user_input asks 1-3 concise multiple-choice questions, pauses the task, and resumes after submit/dismiss. Only available in plan mode.
Explicit turn-window recoveryMain interactive tasks run without an implicit turn window. If a caller, managed template, or helper explicitly sets maxTurns or windowTurnCap, the runtime applies the requested turnBudgetPolicy, soft-logs exhausted adaptive windows, reserves space for finalization, and allows bounded follow-up recovery before triggering a safety stop.
Context overflow retryContext-capacity errors trigger compaction plus retry instead of immediate hard failure when the model context window is exceeded.
Workspace alias repairAbsolute alias paths such as /workspace/... can be remapped into the active workspace for file and directory tools, or blocked via strict_fail.
Pinned task-root repairRelative paths that drift outside the task's canonical root can be rewritten back under the pinned root, retried with a bounded budget, or rejected under strict policy.
Parallel tool-lane renderingParallel read-only tool groups are projected into stable lane rows in the timeline so summary mode stays readable.

Digital Twin Personas

Pre-built AI agent templates that create role-specific digital twins for team members. Each twin absorbs cognitively draining work so the human can stay in deep focus.

  • Built-in templates across engineering, management, product, data, operations, and venture/operator roles: including Software Engineer, Engineering Manager, Product Manager, Company Planner, Founder Office Operator, Growth Operator, and Customer Ops Lead
  • Persona-only by default: activation creates a role preset, not a core runtime participant
  • Optional automation pairing: always-on behavior is attached separately through automation profiles in Mission Control
  • 10 cognitive offload categories: context-switching, status-reporting, information-triage, decision-preparation, documentation, review-preparation, dependency-tracking, compliance-checks, knowledge-curation, routine-automation
  • 4 bundled skills: twin-status-report, twin-pr-triage, twin-meeting-prep, twin-decision-prep
  • One-click activation: Browse gallery, customize name, prompt, skills, and company context, then create
  • Enterprise scaling: Activate one twin per team member across the organization

Access from Mission Control > Add Digital Twin. See Digital Twins and Heartbeat v3 for the current runtime model.


Plugin Packs & Customize

Role-specific and workflow bundles that group skills, agent roles, connectors, and slash commands into installable packs. Each pack targets a job function or workflow area and can optionally link to a Digital Twin Persona as an optional role preset.

  • 19 bundled packs: Engineering, Engineering Management, Product Management, DevOps, Mobile Development, Game Development, Data Analysis, QA & Testing, Sales CRM, Customer Support, Content & Marketing, Technical Writing, Equity Research, Financial Analysis, Investment Banking, Private Equity, Wealth Management, Geo SEO, and CoWork Shortcuts
  • 100+ pack skills and shortcuts: Code review prep, sprint health, feature triage, incident response, prospect research, DCF modeling, LBO analysis, /strategy, /batch-rename, /gmail-summary-drive, /multi-source-report, and more
  • Unified Customize panel: Browse, enable/disable packs, view skills/commands/agents, click "Try asking" prompts
  • Search & filter: Real-time sidebar search across pack names, descriptions, categories, and skill names
  • Per-skill toggles: Enable or disable individual skills within a pack without toggling the entire pack
  • Persistent state: Pack and skill toggle states survive app restarts (stored in pack-states.json)
  • Digital Twin integration: 7 packs link to persona templates as optional role presets; always-on automation remains a separate core setup step
  • Recommended connectors: Packs display clickable connector chips that navigate to connector settings
  • Update detection: Background check against the remote registry with orange dot indicators on packs with newer versions
  • "Try asking" in chat: Empty chat state shows randomized prompt suggestions from enabled packs for one-click task creation
  • Message-box slash aliases: Plugin-pack slashCommands appear in the main / picker and invoke their mapped skill IDs. Alias enable/disable state follows pack and per-skill toggles.
  • Plugin Store: In-app marketplace for discovering, installing, and creating packs (from Git repos, URLs, or scaffold)
  • Managed import scanning: Git and URL pack installs are staged and scanned before activation, with install results surfaced as installed, installed with warning, or quarantined
  • Quarantine & report UX: blocked imported packs move into a dedicated quarantine area with stored reports, retry scan, and removal actions in the Customize panel
  • Warning-only local detection: unmanaged local pack folders remain discoverable in v1, but security findings can surface as warning badges and report details
  • Remote Pack Registry: Community-contributed packs catalog with search and category filtering
  • Extensible: Create custom packs with JSON manifests in ~/.cowork/extensions/
  • Active Context sidebar: Always-visible right-panel section showing connected MCP connectors with branded Lucide icons (44 connectors supported) and enabled skills, with scrollable sub-sections and 30-second auto-refresh
  • Skill conflict detection: Warns when multiple packs register the same skill ID, preventing silent overwrites
  • Admin Policies: Organization-level controls for allowed/blocked/required packs, installation permissions, and agent limits

Access from Settings > Customize. See Plugin Packs for pack management and Message Box Shortcuts for the composer shortcut model.


Skill Store & External Skills

CoWork OS supports external skill installation through the desktop GUI, not just bundled skills or CoWork-native packs.

  • CoWork Registry tab: Browse curated skills distributed through CoWork’s own registry flow
  • ClawHub tab: Search ClawHub directly from the app, view live skill stats, and install from result cards
  • Popular ClawHub list: Opening the ClawHub tab with no query shows the top downloaded public ClawHub skills
  • External import field: Install skills from Git repositories, ClawHub page URLs, raw JSON manifests, or raw SKILL.md URLs
  • Managed install path: Imported skills are staged, scanned, and then copied into CoWork’s managed skills directory only when activation is allowed
  • Security outcomes: installs now return installed, installed with warning, or quarantined, with summary text surfaced directly in the Skill Store UI
  • Quarantine & report UX: blocked imported skills move to a dedicated quarantine area with stored reports, retry scan, and removal actions
  • Optional external directories: Add one or more absolute read-only skill folders in Settings so shared team skills can load without being copied into CoWork
  • Warning-only local discovery: optional external skill directories are not auto-quarantined in v1, but CoWork can still surface unscanned or warning-state badges and reports for those bundles
  • Clear precedence: Workspace skills override managed installs, managed installs override external directories, and external directories override bundled defaults
  • Cross-ecosystem support: Other external skill stores are supported when they expose Git repos, raw manifests, or raw SKILL.md bundle entry points
  • Shared runtime contract: Once loaded, external skills follow the same additive execution model as bundled skills. They can add context and scoped directives, but they cannot replace the canonical task prompt.

Access from Settings > Skills > Skill Store. Users can start with bundled global skills such as llm-wiki, kami, react-best-practices, and taste-skill, then add third-party skills through the same runtime model. See Skill Store & External Skills for install/import behavior and Skills Runtime Model for execution semantics.


Admin Policies (Enterprise)

Organization-level policy controls for managing plugin packs, connectors, and agents across teams.

Policy AreaCapabilities
Pack policiesAllow, block, or require specific packs by ID. Whitelist mode restricts to approved packs only.
Connector policiesBlock specific MCP connectors
Agent policiesSet max heartbeat frequency (min 60s) and max concurrent agents per workspace
Installation controlsToggle custom pack creation, Git-based install, URL-based install
Organization directoryDistribute admin-managed packs from a shared directory to all users

Policy enforcement:

  • Blocked packs appear disabled in the Customize panel and cannot be enabled
  • Required packs cannot be disabled by users
  • Installation policies block scaffold, Git install, and URL install at the handler level
  • Organization packs load from a configurable shared directory

Access from Settings > Admin Policies (Power density mode). See Admin Policies for full documentation.


Voice Mode

Talk to your AI assistant with voice input and audio responses.

FeatureDescription
Text-to-SpeechElevenLabs (premium), OpenAI TTS, or local Web Speech API
Speech-to-TextOpenAI Whisper for accurate transcription
Multiple VoicesElevenLabs voices or OpenAI voices (alloy, echo, fable, onyx, nova, shimmer)
Outbound Phone CallsInitiate calls via ElevenLabs Agents
ProviderTTSSTTCost
ElevenLabsYes (premium)Pay-per-character
OpenAIYesYes (Whisper)Pay-per-token
LocalYes (Web Speech)Coming soonFree

Configure in Settings > Voice.


Persistent Memory System

FeatureDescription
Curated Hot MemorySmall prompt-visible memory lane for durable user/workspace facts, constraints, workflow rules, project facts, and active commitments
Wake-Up LayersThe runtime exposes memory as L0 Identity, L1 Essential Story, L2 Topic Packs, and L3 Deep Recall; only L0 + L1 are injected into the live prompt by default
Curated Memory Toolsmemory_curate adds/replaces/removes curated entries, and memory_curated_read inspects the current hot-memory layer with stable entry IDs for deterministic edits
Archive Memorymemory_save persists observations, decisions, errors, and insights into the larger searchable archive lane for cross-session recall
Structured ObservationsArchive memories get inspectable sidecar metadata with title, narrative, facts, concepts, files, tools, source events, privacy state, and deterministic migration status
Progressive Recall Toolsmemory_search_index, memory_timeline, and memory_details let agents search compact metadata first, inspect timeline context second, and fetch full details only for selected IDs
Checkpoint CaptureRuntime-native checkpoints are written before compaction, on non-trivial task completion, and every 12 meaningful exchanges, each carrying both a structured summary and a verbatim evidence packet
Session Recallsearch_sessions searches recent transcript spans and optional checkpoints when the agent needs to recall what happened in a prior run
Verbatim Quote Recallsearch_quotes returns exact spans with provenance from transcripts, task messages, imported memories, and indexed workspace markdown when the agent needs “what was actually said?”
Topic Packsmemory_topics_load loads focused packs from .cowork/memory/topics, and refresh: false performs a true read-only lookup over existing topic files
Memory Hub Preview And InspectorMemory Hub shows the current L0/L1 payload, the L2/L3 layers excluded from default injection, structured observation search, detail/timeline views, token estimates, privacy filters, metadata editing, promotion, redaction, suppression, and explicit metadata rebuild
Supermemory ProviderOptional external provider lane with prompt-time profile injection, explicit supermemory_profile / supermemory_search / supermemory_remember / supermemory_forget tools, and optional mirroring of non-private MemoryService.capture(...) writes
Privacy ProtectionAuto-detects sensitive patterns (API keys, passwords, tokens)
Unified Searchsearch_memories searches archive memory plus indexed .cowork/ markdown with hybrid semantic + BM25 ranking
LLM CompressionSummarizes observations for ~10x token efficiency
Prompt DefaultsL0 Identity and L1 Essential Story are injected by default; archive injection is off by default; L2/L3 recall stays explicit and tool-driven
Temporal Knowledge GraphRelationships can carry valid_from / valid_to, kg_invalidate_edge closes an active fact without deleting history, and historical reads can opt into as_of
ChatGPT History ImportImport your full ChatGPT conversation history — eliminates cold start. All data stored locally and encrypted. Details below
Per-Workspace SettingsEnable/disable, privacy modes, retention policies
Optional External Memory ProviderSupermemory can be enabled from Memory Hub for prompt-time profile injection, explicit external memory tools, and optional mirroring of non-private local memory captures. Guide

Privacy Modes: Normal (auto-detect sensitive data), Strict (all private), Disabled (no capture).

Inline privacy controls are also available during capture: <no-memory> disables automatic capture for that task content, and <private>...</private> redacts the marked segment from captured memory. Redacted and suppressed observations are excluded from prompt recall, and private/redacted/suppressed entries are not mirrored to Supermemory.

Supermemory is additive, not a replacement for local memory. CoWork still keeps the workspace kit, curated hot memory, archive memory, structured observation metadata, transcript recall, and knowledge graph locally. The current integration mirrors local memory captures only when you opt in; it does not yet stream every chat turn into Supermemory conversations. See Structured Memory Observations and Supermemory Integration.

Configure in Settings > Memory Hub.


ChatGPT History Import

Import your full ChatGPT conversation history into CoWork OS's memory system. Instead of starting from scratch, the agent immediately understands your preferences, past projects, communication style, and context from hundreds or thousands of previous conversations.

How It Works

  1. Export from ChatGPT: Go to ChatGPT Settings > Data Controls > Export Data. OpenAI emails you a .zip file containing conversations.json.
  2. Import in CoWork OS: Go to Settings > Memory Hub > Import ChatGPT History and select the exported .zip or conversations.json file.
  3. Processing: Conversations are parsed, deduplicated, and stored as memory entries with full-text search indexing. User messages are captured as context; assistant responses are summarized for token efficiency.

What Gets Imported

DataHow It's Used
Your messagesStored as observations — reveals your interests, projects, preferences, and communication style
Assistant responsesSummarized and stored as insights — captures decisions, recommendations, and solutions you received
Conversation titlesIndexed for semantic search — helps match relevant past context to new tasks
TimestampsPreserved for time-based relevance ranking — recent conversations weighted higher

Security & Privacy

  • Stored locally only — All imported data is written to the local SQLite database on your Mac. Nothing is uploaded, synced, or sent anywhere.
  • Encrypted at rest — The database is protected by the same AES-256 encryption and OS keychain integration used for all CoWork OS data.
  • Privacy filtering — The same auto-detection that filters API keys, passwords, and tokens from regular memories applies to imported history.
  • No provider access — Imported memories are injected into prompts locally. Your ChatGPT history is never sent back to OpenAI or any other provider — only the relevant snippets are included in task context.
  • Deletable — You can clear all imported memories at any time from Settings > Memory Hub.

Why This Matters

Most AI assistants start with zero context about you. Every new tool means re-explaining your preferences, projects, and constraints. ChatGPT history import eliminates this cold-start problem — CoWork OS learns from your existing AI conversations so it can be useful from the first task.


Durable Learning Stack

CoWork OS still keeps a multi-layered learning stack under the reflective loop. These services improve recall, personalization, and future evidence quality across sessions.

LayerServiceWhat It Learns
Task PatternsPlaybookServiceSuccessful approaches, failure categories, error recovery strategies
Core MemoryMemoryServiceObservations, decisions, insights with hybrid semantic + BM25 search
User ProfileUserProfileServiceName, preferences, location, goals, constraints
RelationshipRelationshipMemoryService5-layer context: identity, preferences, context, history, commitments
FeedbackFeedbackServiceRejection patterns, preference corrections, workspace-local MISTAKES.md

Key mechanisms:

  • Error classification: 7 categories for targeted recovery strategies
  • Confidence decay: older playbook entries receive lower relevance scores (30d: 0.8x, 90d: 0.5x)
  • Reinforcement: successful patterns are boosted via reinforcement memories
  • Mid-task correction detection: regex-based detection of user corrections during execution
  • Retry-aware reuse: retries can reuse playbook patterns during planning, recent session recall during planning/execution/follow-ups, and pending verification checklist state instead of restarting cold
  • /learn skill: manually teach the agent insights, corrections, preferences, or rules

These layers feed Workflow Intelligence and the normal task runtime. See Workflow Intelligence for the full architecture guide.

Evolving Agent Intelligence

A set of connected subsystems that make improvement visible and measurable over time.

SubsystemPurpose
Layered Memory RuntimeUses explicit wake-up layers: <cowork_hot_memory> for L0 Identity, <cowork_structured_memory> for L1 Essential Story, and tool-driven L2/L3 recall through memory_topics_load, search_sessions, search_memories, and search_quotes.
Retry-Aware Recovery GuidanceWhen execution retries or resumes, injects retry count, retry reason/classification, pending verification items, and recent session evidence so the agent keeps moving from the last good state instead of restarting blindly. Planning retries can also include compact playbook context.
Adaptive Style EngineObserves message length, emoji usage, technical vocabulary, and structured feedback to gradually shift personality settings (response length, emoji usage, explanation depth). Rate-limited to a configurable number of level-shifts per week.
Playbook-to-Skill PromotionWhen a playbook pattern is reinforced 3+ times, auto-generates a skill_proposal for admin review. No skill is created until explicitly approved.
Channel Persona AdapterApplies channel-appropriate communication directives (Slack = terse/structured, email = formal/greeting+sign-off, WhatsApp = short/emoji, etc.) on top of the core persona without replacing it.
Evolution MetricsComputes 5 on-demand metrics: Correction Rate, Style Adaptations, Knowledge Graph growth, Task Success Rate, and Style Alignment. Produces an overall 0–100 Evolution Score. Surfaced in the Daily Briefing.
Daily Operational LogDailyLogService manages optional per-day raw logs under .cowork/memory/daily/<YYYY-MM-DD>.md. Raw logs are never injected directly into prompts.
Daily Log SummarizerReads pre-written summary files from .cowork/memory/summaries/<YYYY-MM-DD>.md, applies recency decay, and feeds ranked summaries into the structured-memory lane.

Behavior Adaptation controls (Settings > Guardrails > Behavior Adaptation):

  • Adaptive Style toggle — enable/disable style learning (off by default)
  • Max drift per week — maximum one-level style shifts per 7-day window (default: 1)
  • Reset learned style — clears all accumulated style adaptations
  • Channel Persona toggle — enable/disable per-channel communication adaptation (off by default)

See Evolving Agent Intelligence and Behavior Adaptation for full details.


Operator Runtime Visibility

CoWork OS now exposes the learning loop as a visible operator surface instead of leaving it buried inside background services.

SurfaceWhat users see
Task learning progressionA standardized post-task learning card showing memory capture, playbook reinforcement, skill proposal state, evidence links, and next action. The same event stream also feeds activity feeds and Mission Control task details.
Unified recallOne “search everything” surface across tasks, messages, files, workspace notes, memory entries, and knowledge-graph context, with shared ranking/dedup logic for UI and prompt injection.
Persistent shell sessionsLong-lived shell state per task/workspace with retained cwd, env deltas, aliases, reset controls, and one-shot fallback for incompatible commands.
Model routing visibilityLive active provider/model, routing reason, fallback chain, and retry/fallback state in the task UI and settings surfaces.
Applied skills visibilityThe task header keeps the canonical request visible and shows applied skills separately, including trigger/reason metadata from runtime events.

This layer is intentionally additive. It makes learning and routing legible while preserving the desktop control plane, channels, inbox, devices, and governed automation that define CoWork OS.

See Operator Runtime Visibility for the cross-surface implementation summary and Skills Runtime Model for the skill-specific runtime contract.


Knowledge Graph

SQLite-backed structured entity and relationship memory with full-text search and graph traversal.

FeatureDescription
10 built-in entity typesperson, organization, project, technology, concept, file, service, api_endpoint, database_table, environment
15 built-in edge typesuses, depends_on, part_of, created_by, maintained_by, deployed_to, and more
FTS5 searchFull-text search with BM25 ranking over entity names and descriptions
Graph traversalIterative BFS up to 3 hops with edge type filtering
ObservationsAppend-only timestamped fact log per entity
Auto-extractionRegex-based entity extraction from completed task results
Confidence decayAuto-extracted entities decay over time (floor: 0.3)
9 agent toolskg_create_entity, kg_update_entity, kg_delete_entity, kg_create_edge, kg_delete_edge, kg_add_observation, kg_search, kg_get_neighbors, kg_get_subgraph
Context injectionRelevant entities auto-injected into task system prompts

See Knowledge Graph for the full architecture guide.


Workspace Kit (.cowork)

Initialize and maintain a .cowork/ directory inside each workspace for durable, human-edited context, scoped prompt injection, project scaffolding, and workspace health checks.

The workspace kit is contract-driven: every tracked markdown file has a declared title, scope, parser, prompt budget, freshness window, mutability model, and optional special handling.

Root workspace files

FileTitleScopeParserTypical use
AGENTS.mdWorkspace Rulestask, main-sessionsectionedworkspace-wide operating guidance and coordination rules
MEMORY.mdLong-Term Memorytask, main-sessiondecision-logdurable learnings and long-lived constraints
USER.mdUser Profiletask, main-sessionkv-linespreferences, timezone, communication defaults
TOOLS.mdLocal Setup Notestask, main-sessionsectionedenvironment notes, common commands, local conventions
IDENTITY.mdWorkspace Identitytask, main-session, rolekv-lineswho the agent is and what it owns
RULES.mdOperational Rulestask, main-session, role, company-opschecklistmust/must-not behavior and approval defaults
SOUL.mdWorkspace Personatask, main-session, rolesectionedtone, collaboration style, execution philosophy
VIBES.mdCurrent Operating Modetask, rolesectionedwhat to optimize for right now
MISTAKES.mdRecurring Mistakestask, main-session, roledecision-logrecurring failure patterns and corrections
LORE.mdDurable Contexttask, main-sessiondecision-logimportant historical decisions and background context
CROSS_SIGNALS.mdCross-Agent Signalstask, main-session, company-opsdecision-logcontradictions, risks, amplified opportunities
PRIORITIES.mdCurrent Prioritiescompany-ops, taskchecklistcurrent priorities, owners, review dates
COMPANY.mdCompany Contextcompany-opssectionedmission, offer, customer, constraints
OPERATIONS.mdOperating Modelcompany-opssectionedauto-allowed actions, approvals, escalation paths
KPIS.mdBusiness Metricscompany-opssectionedmetrics, targets, and guardrails
BOOTSTRAP.mdBootstrap Instructionsbootstrapchecklistone-time onboarding checklist
HEARTBEAT.mdHeartbeat Checklistheartbeatchecklistrecurring Heartbeat v3 checklist work

Project and role subdirectories

  • Project-specific context lives under .cowork/projects/<projectId>/
  • CONTEXT.md is the project-scoped task brief, decisions, and notes file
  • ACCESS.md is the project-scoped access and boundary file for task and role usage
  • Per-role persona files live under .cowork/agents/<roleId>/
  • The health model also tracks supporting directories such as .cowork/memory/, .cowork/memory/hourly/, .cowork/memory/weekly/, .cowork/projects/, and .cowork/agents/

Frontmatter, parsing, and injection

Tracked files can begin with simple frontmatter:

---
updated: 2026-03-14
---
  • updated is expected on files with freshness windows so the app can mark stale context correctly
  • Bodies are sanitized and redacted before prompt injection
  • Oversized files are truncated to each file's prompt budget and reported with a truncation warning
  • Parsers are file-specific: sectioned for heading-based notes, kv-lines for filled key/value fields, checklist for rules and recurring lists, and decision-log for durable bullet-style history

Special handling

  • BOOTSTRAP.md is onboarding-only context, not a durable memory file
  • When BOOTSTRAP.md is first present, CoWork OS records bootstrapSeededAt in .cowork/workspace-state.json
  • When BOOTSTRAP.md is later removed, CoWork OS records onboardingCompletedAt and does not recreate it during missing-only init flows
  • HEARTBEAT.md is reserved for recurring Heartbeat v3 checklist work and is intentionally separate from general task/session context

Health, linting, and revisions

  • The app surfaces workspace-kit health with missing tracked entries, stale files, warning/error counts, revision counts, and onboarding metadata
  • ACCESS.md and TOOLS.md receive additional secret detection to catch likely credentials or copied tokens
  • Tracked writes keep snapshots under .cowork/**/.history/<file>/ together with revision metadata
  • search_memories indexes .cowork/ markdown alongside the main memory system
  • npm run kit:lint validates the current workspace kit from the command line
  • npm run kit:lint -- --json emits raw status JSON
  • npm run kit:lint -- --strict exits non-zero on warnings or missing tracked entries

Quick-open kit files

The Memory Hub exposes Open USER.md and Open MEMORY.md buttons that open the corresponding .cowork/ file directly in the system editor. If the file does not exist it is created from a default template (with full frontmatter and section scaffolding) before opening.

Configure in Settings > Memory Hub.


Role Profile Files

Define per-role personality and operating guidelines in .cowork/agents/<role-id>/. These files reuse the same contracts, parser rules, and titles as the root workspace kit, and role/task prompts can combine role files with root workspace files when scopes match.

FileTitlePurpose
IDENTITY.mdWorkspace Identityrole identity, ownership boundaries, confirmation rules
RULES.mdOperational Rulesrole-specific must/must-not behavior and safety defaults
SOUL.mdWorkspace Personacollaboration style, tone, execution philosophy
VIBES.mdCurrent Operating Modecurrent emphasis, urgency, and optimization target

Agent Teams

FeatureDescription
Team ManagementCreate and manage teams with multiple agent members
Persistent TeamsMark teams as persistent so they survive across sessions with a default workspace
Shared ChecklistsAgents share checklist items for coordinated task execution
Run TrackingTrack team runs with status, progress, and history
Collaborative ModeEphemeral teams with real-time thought sharing
Multi-LLM ModeDispatch same task to multiple providers with judge-based synthesis
Collaborative ThoughtsReal-time thought panel shows agent reasoning as it happens

Configure in Mission Control > Teams.


Mission Control

Centralized agent orchestration and monitoring dashboard. Access from Settings > Mission Control.

PanelPurpose
AgentsActive agents list with status dots, Pulse/Dispatch state, automation-profile-backed cadence, and manual trigger controls
Mission Queue5-column Kanban board (Inbox → Assigned → In Progress → Review → Done) with drag-and-drop
Feed & DetailsReal-time activity feed with event type and agent filters, plus task detail view with comments and mentions
Core HarnessRuntime traces, failure clusters, evals, experiments, and learnings

Header controls: Agent Teams management, Performance Reviews, Standup Report generation, and workspace selector with live stats (active agents, queued tasks, pending mentions).

All panels update in real-time via event subscriptions — no manual refresh needed.

Dispatched Agents Progress: When you mention agents in a task prompt (e.g. @Security Analyst review this codebase), the parent task's main window shows a live progress panel with:

  • Agent chips showing each dispatched agent's status (working/completed/failed)
  • Phase indicator (Dispatched → Working → Complete)
  • Real-time event stream from all child agent tasks (plans, steps, results)
  • Click any agent chip to jump to that agent's full task view

Cancelling a parent task automatically cascades to all dispatched child tasks.

See Mission Control for the full guide.


Digital Twins (Persona Templates)

Create role-specific AI digital twins from pre-built persona templates. Each twin absorbs cognitively draining tasks so the human stays in flow. Twins are optional persona presets, not direct owners of the always-on runtime. Accessible via the "Add Digital Twin" button in Mission Control's agents panel.

Templates (10 roles, 5 categories)

CategoryTemplates
EngineeringSoftware Engineer, Hardware Engineer, QA/Test Engineer, DevOps/SRE, Technical Writer
ManagementEngineering Manager, Technical Director, VP Engineering
ProductProduct Manager
Data & AnalyticsData Scientist / Analyst

What Each Template Includes

ComponentDescription
System PromptRole-tailored persona with behavior guidelines
CapabilitiesSkill tags (code, review, test, analyze, document, etc.)
Cognitive OffloadCategorized by mental burden relieved: context switching, status reporting, review prep, decision prep, documentation, dependency tracking
Recommended SkillsPre-mapped skills with required/optional flags
Autonomy Levelspecialist (IC roles) or lead (management roles)

Activation Flow

  1. Click "Add Digital Twin" in Mission Control agents panel
  2. Browse the template gallery — filter by category or search by name/tags
  3. Click a template card to open the activation dialog
  4. Customize: twin name, prompt/persona settings, and recommended skills
  5. Click "Create Digital Twin" — creates a new AgentRole with persona defaults

The twin appears in the agents panel as a normal role. If you want it to participate in always-on automation, attach a separate automation profile afterwards.


Build Mode

Dedicated "idea → working prototype" workflow powered by Live Canvas with four phases:

PhaseDescription
ConceptRestate the idea, identify core requirements, choose tech stack
PlanBreak down into components, define file structure, outline implementation
ScaffoldGenerate working code, push to canvas, create checkpoint
IterateRefine based on feedback, add features, polish UI

Each phase creates a named checkpoint. You can revert to any phase, diff between phases, and view the full phase timeline. Build Mode is available as a built-in skill (build-mode).

See Live Canvas for the full guide.


Usage Insights

Dashboard showing task activity, cost trends, agent efficiency, and productivity patterns. Access from Settings > Usage Insights.

Overview

The panel opens with a hero stats row showing four key metrics at a glance:

StatDescription
CompletedTotal tasks completed in the selected period
Success RatePercentage of completed tasks out of total, with a color-coded progress bar (green ≥ 70%, amber ≥ 40%, red < 40%)
FailedTotal failed tasks
Avg TimeAverage completion time across completed tasks

Below the hero row, detailed sections are arranged in a two-column grid for information density.

Workspace Filtering

The workspace dropdown at the top lets you filter insights to a single workspace or view All Workspaces (the default). "All Workspaces" aggregates metrics across every workspace, giving you a global view of your agent usage.

Sections

SectionDescription
Cost & TokensTotal cost, input/output token counts, cached-token totals, cache-read rate when available, and cost breakdown by model
Agent Efficiency (AWU)Agentic Work Unit metrics — see below
Activity by DayTasks per day-of-week with peak day indicator
Activity by HourHourly task histogram with peak hour indicator
Top SkillsMost-used skills ranked by usage count
Skill Usage by PackSkills grouped by their parent plugin pack with aggregated usage counts and mini bar charts
Persona PerformancePer-persona totals, success/failure mix, retry behavior, and cost attribution
Feedback & QualityTask-result satisfaction rate, top rejection reasons, retried-task count, and average attempts

Agentic Work Units (AWU)

Inspired by Salesforce's AWU concept, an Agentic Work Unit represents one successfully completed unit of agent work.

Definition: 1 AWU = 1 task with status = 'completed' AND terminal_status of ok or partial_success.

The AWU section shows:

MetricDescription
AWU CountNumber of completed work units in the period
Tokens per AWUTotal tokens consumed ÷ AWU count (lower is more efficient)
Cost per AWUTotal cost ÷ AWU count (lower is cheaper)
AWUs per DollarAWU count ÷ total cost (higher is better ROI)
Efficiency TrendPercentage change in tokens/AWU and cost/AWU vs the previous period. A downward arrow (green) means improvement; upward (red) means regression

The trend comparison uses the same period length — e.g., if you're viewing a 7-day window, it compares against the prior 7 days. The AWU section is hidden when no tasks were completed in the period.

Period Selection

Supports 7-day, 14-day, and 30-day windows. Per-pack analytics cross-reference skill usage with plugin pack membership, showing which packs drive the most value.


Daily Briefing

Proactive morning briefing combining:

  • Task summary: Completed in last 24 hours, currently in progress, scheduled for today
  • Recent highlights: Key insights and decisions from memory
  • Suggested priorities: Based on user profile goals, or sensible defaults

Configurable as a scheduled task in Settings > Scheduled Tasks with time picker and channel delivery.


Citation Engine

Automatic source attribution for web research. When agents use web_search or web_fetch, the Citation Engine tracks and deduplicates all referenced URLs, assigning sequential citation indices.

FeatureDescription
Auto-trackingIntercepts results from web_search and web_fetch tools
DeduplicationSame URL referenced multiple times gets a single [N] index
System prompt injectionFormatted citation list injected into LLM context so the agent can reference sources
Citation panelUI panel showing all sources with URL, title, domain, snippet, and access timestamp

Citations appear inline in agent responses as [1], [2], etc. and link to the source in the Citation Panel.


Scratchpad Tools

Session-scoped note-taking system for agents during long-running tasks.

ToolDescription
scratchpad_writeWrite or update notes with key-value pairs (max 100-char keys, 10,000-char values)
scratchpad_readRetrieve all notes or a specific note by key

Notes persist to .cowork/scratchpad-{taskId}.json for crash recovery. The scratchpad is ephemeral per task — useful for agents to track intermediate findings, partial results, and working state during complex multi-step tasks.


Workflow Pipeline & Deep Work Mode

Workflow Pipeline

Multi-phase task execution for complex workflows. The Workflow Decomposer detects multi-step prompts (using connectives like "then", "after that", "next", "finally") and splits them into sequential phases.

FeatureDescription
Auto-detectionRegex-based decomposition of multi-phase prompts
5 phase typesresearch, create, deliver, analyze, general
Sequential executionEach phase creates a child task; output pipes into the next phase
LLM fallbackComplex prompts that resist regex decomposition use LLM-powered splitting
Pipeline eventspipeline_started, phase_started, phase_completed, pipeline_completed

Deep Work Mode

Extended execution mode for complex tasks that need sustained focus:

  • Extended timeouts — Deep work tasks get longer execution budgets
  • Progress journaling — Agent records progress notes during execution, visible in the task timeline
  • Memory compression pause — Memory service pauses background compression during active deep work to avoid context disruption

Document Generation Tools

Four dedicated agent tools for generating formatted documents from task context:

ToolOutputDescription
compile_latexPDF + .tex source pairingCompile a workspace .tex file with an installed system engine (tectonic, latexmk, xelatex, lualatex, or pdflatex) and register the PDF with source metadata
generate_documentPDFGenerate PDF documents with markdown content and structured sections
generate_presentationPPTXGenerate PowerPoint presentations with multiple slides
generate_spreadsheetXLSXGenerate Excel spreadsheets with multiple sheets and data

These tools complement the existing document skills (spreadsheet.ts, document.ts, presentation.ts) by providing direct LLM-callable tool interfaces. Generated files are registered as task artifacts with proper MIME types.

Spreadsheet outputs also participate in the artifact workbench. Task completion cards, file_created, and artifact_created events can render the compact spreadsheet card for Excel, delimited, Numbers, Google Sheets shortcut, ODS, and XLSB outputs. Editable spreadsheet formats default to the right sidebar; native/app-owned formats open externally. Fullscreen mode preserves task follow-up context and filters the context panel to the current prompt after a follow-up starts. See Spreadsheet Artifacts.

Word-style document outputs also participate in the artifact workbench. Task completion cards, file_created, file_modified, and artifact_created events can render the compact document card for DOCX, DOCM, DOTX, DOTM, DOC, RTF, ODT, OTT, and Pages outputs. DOCX defaults to the right-sidebar editor and can expand to fullscreen with follow-up context; other document formats use best-effort preview or external-open actions. See Document Artifacts.

For explicit LaTeX, TeX, TikZ, .tex, or "write a paper and compile PDF" requests, the runtime prefers the source-first path: write the editable .tex file, then call compile_latex. CoWork does not bundle a TeX distribution; if no supported engine is installed, the task keeps the .tex source and reports the missing dependency instead of silently falling back to the HTML/markdown PDF generator.

PPTX outputs also participate in the artifact workbench. Task completion cards, file_created, file_modified, artifact_created events, and assistant summaries can show an inline presentation card, and default Open launches the resizable sidebar viewer for .pptx decks. CoWork always extracts slide text and notes first, then renders or reuses cached slide images in the background through the Codex presentation runtime with local converter fallback. Fullscreen mode keeps the functional follow-up composer and refreshes the preview after matching deck updates. See Presentation Artifacts and PPTX Preview.

Web page outputs also participate in the artifact workbench. Task completion cards, file_created, file_modified, artifact_created events, and assistant summaries can show an inline web page card for generated .html / .htm files. Built React/Vite/Next output entrypoints such as dist/index.html, build/index.html, and out/index.html preview in the same sandboxed iframe surface. React-style source projects without built output return a structured build-output-needed state; CoWork does not auto-run dev servers from the artifact viewer. See Web Page Artifacts.

Live website testing uses the Browser Workbench instead of the generated web artifact iframe. Browser-use prompts open a visible right-sidebar/fullscreen webview with a persistent workspace profile, Browser V2 snapshot refs, functional navigation controls, diagnostics, screenshots, annotation, and cursor movement for agent actions. See Browser Workbench.


Event Triggers

Condition-based automation engine that fires actions in response to events.

FeatureDescription
Trigger sourcesChannel gateway messages, cron service, webhooks, and MCP connector/resource events
Action typescreate_task, send_message, wake_agent
Condition logic"all" (AND) evaluation of multiple conditions
CooldownConfigurable cooldown period (default 1 min) to prevent rapid re-firing
Connector content filtersOptional serverId, connectorId, and resourceUri filters let a trigger subscribe to specific MCP-backed content changes
Variable substitutionEvent data can be injected into action prompts/titles
HistoryLast 50 fires per trigger stored for audit

Configure in Settings > Event Triggers.


File Hub

Unified file aggregation service combining local workspace files, task artifacts, and cloud storage into a single searchable interface.

FeatureDescription
Multi-sourceLocal workspace files, task artifacts, connected cloud storage
SearchFilename-based search across all connected sources
Recent filesTracks recently accessed files with timestamps
MIME detection20+ common formats (PDF, images, docs, sheets, slides, code, etc.)
Spreadsheet workbenchWorkbook and CSV/TSV task outputs open in a resizable sidebar or fullscreen editable grid with copy/save/zoom and follow-up composer controls; native spreadsheet formats keep external artifact actions
Presentation workbenchPPTX decks show artifact cards, sidebar/fullscreen preview, thumbnails, text, speaker notes, fast text-first loading, cached rendered slides, and external open/show actions when opened from task artifacts or Files
Web page workbenchGenerated HTML/HTM files and built React output open in a resizable sidebar or fullscreen sandboxed iframe preview with browser/folder/copy actions and follow-up refresh after completion
Source/rendered pairsLaTeX .tex files compiled through compile_latex are paired with their generated PDFs in task artifact surfaces

Access from the File Hub panel in the sidebar.


Web Access

Serve CoWork OS as a web application accessible from any browser on the network.

FeatureDescription
HTTP serverConfigurable host/port with static file serving
AuthenticationBearer token with timing-safe comparison
CORSOrigin whitelisting for cross-origin access
REST APIMaps endpoints to IPC channels (tasks, workspaces, accounts, briefings, suggestions)
WebSocketReal-time event streaming for connected clients
Health checkUnauthenticated /api/health endpoint for monitoring

Configure in Settings > Web Access.


Vision Tools

Multi-provider image and PDF analysis with caching and optimization.

ToolDescription
analyze_imageAnalyze any image with the active non-Gemini vision LLM (OpenAI, Anthropic, Azure OpenAI, Bedrock)
read_pdf_visualConvert PDF pages to images and analyze layout/design
parse_documentExtract text from PDFs and other document formats; this is the preferred path for ordinary PDF summaries, Q&A, extraction, comparison, and transformation
FeatureDescription
Result cachingSHA1-keyed cache (128 entries) prevents redundant vision API calls
Auto-downscalingImages >2MB automatically downscaled to 1600×1200 at 80% quality
Active-provider routingImage analysis uses the active non-Gemini image-capable provider; otherwise the user is asked to switch models
Retry logicTransient errors (429, 5xx, timeouts) trigger single retry
PDF text readingUploaded PDFs use compact prompt excerpts first, then parse_document for deeper content; normal text PDFs use embedded text, while weak/scanned PDFs can fall back to OCR-aware extraction
PDF visual conversionread_pdf_visual uses pdftoppm to convert PDF pages to PNG for layout/design/page-appearance analysis

Adaptive Complexity

Three-tier UI density controlling which features and settings are visible:

TierDescription
FocusedSimplified view — hides Connected Tools, Remote Access, Extensions, Infrastructure. Shows only core settings.
StandardDefault view — all settings visible (default)
PowerFull power-user view with all settings and advanced options

Configure in Settings > Appearance.


Configurable Guardrails

GuardrailDefaultRange
Token Budget100,0001K - 10M
Cost Budget$1.00 (disabled)$0.01 - $100
Iteration Limit505 - 500
Dangerous Command BlockingEnabledOn/Off + custom
Auto-Approve Trusted CommandsDisabledOn/Off + patterns
File Size Limit50 MB1 - 500 MB
Domain AllowlistDisabledOn/Off + domains

Code Tools

Built-in tools for efficient code navigation and editing:

ToolDescription
globFast pattern-based file search (e.g., **/*.ts)
grepRegex content search across files with context lines
edit_fileSurgical file editing with find-and-replace
git_commitCommit changes in the workspace (or worktree)
git_diffView staged/unstaged changes
git_branchList, create, or switch branches

Live Canvas

Agent-driven visual workspace for interactive content creation and data visualization.

  • Interactive Preview: Full browser interaction within the canvas
  • Snapshot Mode: Auto-refresh preview every 2 seconds
  • Canvas Tools: canvas_open_session, canvas_set_state, canvas_eval, canvas_close_session
  • Named Checkpoints: Save, restore, diff, and label canvas states for easy navigation
  • Build Mode: Phased idea-to-prototype workflow (Concept → Plan → Scaffold → Iterate) with per-phase checkpoints
  • Visual Annotation: visual_open_annotator and visual_update_annotator for iterative image refinement
  • Export: HTML, open in browser, or reveal in Finder
  • Snapshot History: Browse previous canvas states
  • Keyboard Shortcuts: Toolbar controls for common actions

Generated web page artifacts use a separate durable-output path. When a task writes .html / .htm or built React output such as dist/index.html, the task feed shows a web artifact card that opens in the sidebar/fullscreen artifact viewer instead of the Live Canvas session surface. See Web Page Artifacts.

See Live Canvas for the full guide.


Browser Automation

Three-tier web interaction stack — from lightweight HTTP fetching to visible in-app Browser V2 automation to anti-bot scraping — all as native agent tools with no external CLI dependencies.

In-App Browser Workbench

Interactive browser-use tasks open inside CoWork OS by default. When a user asks the agent to go to a site and test, use, click through, or inspect it as a normal user, browser_navigate opens a visible browser workbench in the resizable right sidebar. Browser V2 controls the same webview the user can see through a main-process session manager and CDP-backed actions, using a persistent per-workspace browser profile that is isolated from system Chrome.

The browser workbench supports:

  • right-sidebar placement with the same persisted width behavior as document, spreadsheet, presentation, and web artifact workbenches
  • tab strip, URL bar, profile/security indicator, back, forward, reload, fullscreen, and close controls
  • visible routing for navigation, snapshots, ref-aware actions, keyboard/scroll/select actions, content extraction, evaluation, diagnostics, downloads/uploads, dialogs, emulation, traces, and screenshots
  • compact accessibility snapshots with short-lived refs for precise click, fill, type, read, hover, drag, and upload actions
  • visible cursor movement and action pulses for agent clicks, fills, typing, selects, waits, reads, scrolls, and navigation
  • diagnostics drawer and tools for console, network, downloads, storage, and trace state
  • workspace screenshot capture plus in-app screenshot annotation that can be saved or sent back to the agent as an image attachment
  • fullscreen mode with the same follow-up composer and latest-turn/working context frame used by artifact workbenches
  • optional fallback to forced headless Playwright or explicit Chrome DevTools attach for background runs and signed-in system Chrome/Edge sessions

Use web_fetch for reading a known static URL. Use the browser workbench for interactive websites, JavaScript-heavy pages, forms, app testing, and visual checks. See Browser Workbench for user behavior and Browser V2 Architecture for the implementation contract.

Web Search (6 providers, always available)

Multi-provider web search with automatic fallback. DuckDuckGo is built-in and requires no API key, so web_search works out of the box for every user.

ProviderTypesAPI KeyNotes
DuckDuckGoWebNot requiredBuilt-in free fallback, always last in chain
TavilyWeb, NewsRequiredAI-optimized results (recommended)
ExaWeb, NewsRequiredSemantic search and research-heavy retrieval
Brave SearchWeb, News, ImagesRequiredPrivacy-focused
SerpAPIWeb, News, ImagesRequiredGoogle results
Google Custom SearchWeb, ImagesRequiredDirect Google integration

Paid providers are tried first in configured order. DuckDuckGo is automatically appended as the last-resort fallback. Includes retry with exponential backoff, provider cooldowns, and explicit primary/fallback ordering in Settings.

Architecture

Tier 0: web_search                   (multi-provider search — always available)
Tier 1: web_fetch / http_request     (no browser — fastest)
Tier 2: browser_* tools              (visible in-app Browser V2 workbench by default, Playwright/external-CDP fallback)
Tier 3: scrape_* tools               (Scrapling — anti-bot bypass)

The agent auto-selects the appropriate tier: web_search for discovering information, web_fetch for reading known URLs, browser_* when interaction, JS rendering, or visible app testing is needed, and scrape_* for anti-bot-protected sites.

Browser Tools (34 tools — visible Browser V2 workbench + native Playwright/external-CDP fallback)

Browser tools first target the active visible browser workbench for the selected task. If no renderer/webview is available, or the task explicitly requests force_headless, profile, browser_channel, or debugger_url, the tools fall back to native Playwright or explicit external CDP. The legacy headless flag is compatibility-only and does not override visible browser workbench routing for normal site testing.

ToolDescription
browser_attachAttach to existing Chrome/Edge via Chrome DevTools Protocol after explicit real-browser consent. See Chrome DevTools attach below.
browser_act_batchExecute batched actions (click, fill, type, press, wait, scroll) in sequence with optional delays
browser_navigateNavigate to URL with configurable wait states; opens the visible in-app browser workbench by default
browser_snapshotReturn compact accessibility nodes with short-lived refs, focus state, console summary, and network summary
browser_screenshotCapture viewport, full-page, or supported element/ref screenshots
browser_get_contentExtract text, links, and form data from current page
browser_clickClick by Browser V2 ref or legacy selector
browser_hoverMove pointer over an element by ref or selector
browser_dragDrag from one snapshot ref to another
browser_fillFill form fields by ref or selector
browser_typeType text character-by-character by ref or selector
browser_pressPress keyboard keys (Enter, Tab, Escape, shortcuts)
browser_waitWait for element visibility with timeout
browser_scrollScroll page (up, down, top, bottom)
browser_selectSelect dropdown options
browser_get_textExtract text from a specific ref or selector
browser_upload_fileUpload a workspace-readable file into a file input
browser_handle_dialogAccept or dismiss the latest JavaScript dialog
browser_tabsList tabs for the active browser session
browser_switch_tabSwitch to a tab by tab id
browser_close_tabClose a tab by tab id where supported
browser_consoleReturn recent redacted console messages
browser_networkReturn recent redacted network requests, responses, and failures
browser_downloadsReturn recent browser downloads
browser_storageReturn redacted local/session storage for the current page
browser_emulateSet viewport/device emulation for responsive testing
browser_trace_startStart lightweight browser tracing
browser_trace_stopStop lightweight browser tracing
browser_evaluateExecute JavaScript in browser context
browser_backNavigate browser history back
browser_forwardNavigate browser history forward
browser_reloadReload the current page
browser_save_pdfSave page as PDF file
browser_closeClose browser session and free resources

Web Fetch Tools (2 tools)

Lightweight HTTP without browser overhead — preferred for reading known URLs.

ToolDescription
web_fetchFetch URL → HTML-to-Markdown conversion with optional CSS selector filtering
http_requestRaw HTTP requests (GET, POST, PUT, DELETE, PATCH, HEAD, OPTIONS) with custom headers/body

Chrome DevTools Attach Mode

Attach to an existing Chrome/Edge instance to control a signed-in browser session (e.g. Gmail, social media). Uses the Chrome DevTools Protocol and requires explicit real-browser consent before control.

Setup:

  1. Launch Chrome with remote debugging: chrome --remote-debugging-port=9222 (or add --remote-debugging-port=9222 to your Chrome shortcut).
  2. Visit chrome://inspect/#devices to verify the endpoint.
  3. The agent asks for explicit consent showing the target browser/profile/tab/domain.
  4. The agent uses browser_attach with debugger_url: "http://localhost:9222" (or the WebSocket URL from the version endpoint) and confirm_real_browser_control: true.
  5. After attach, browser_navigate and other browser tools operate on the attached session.

See Chrome Remote Debugging for full setup guides.

Profile presets vs attach mode: Use browser_attach with debugger_url when you want to control an already running signed-in Chrome/Edge session after consent. Use profile="user" when you want to launch a new Chrome instance with your system profile — but Chrome must not already be running with that profile (profile lock). For existing sessions, attach mode is the correct choice.

Note: If you close the Chrome window while attached, subsequent browser actions will fail with "Target closed". Re-attach with browser_attach after relaunching Chrome.

Browser Features

FeatureDescription
Multi-BrowserChromium (bundled), Chrome (system), Brave (auto-discovered)
Visible WorkbenchDefault Browser V2 surface inside the task sidebar/fullscreen workbench
Workspace Browser ProfileEmbedded webview uses a persistent workspace partition isolated from system Chrome
Accessibility Snapshot Refsbrowser_snapshot returns compact nodes with short-lived refs used by click/fill/type/read/hover/drag/upload actions
CDP-Backed Workbench ActionsMain-process automation controls the renderer-owned webview through Electron debugger/CDP rather than DOM-script-first control
Diagnostics DrawerConsole, network, downloads, storage, and trace state are visible in-app and available through tools
Visible CursorAgent browser actions render cursor movement and click/action pulses over the in-app webview
Screenshot AnnotationCapture, mark up, save, and send browser screenshots back to the agent as image attachments
Real-Browser ConsentSystem Chrome/Edge profile control requires explicit approval; default workbench never silently reuses system cookies
Profile Presetsuser (launch new Chrome with system profile after consent — fails if Chrome is already running), chrome-relay (extension relay), workspace (workspace default). For existing signed-in sessions, use browser_attach instead.
Persistent ProfilesCookies and storage persist across tasks in .cowork/browser-profiles/
Consent Auto-Dismiss40+ pattern detectors for cookie/GDPR consent popups
Retry Logic2-attempt retry with per-attempt timeout calculation
Failure DiagnosticsScreenshot + page content + URL captured on failure
Domain GuardrailsWhitelist enforcement via GuardrailManager
Headless/HeadedToggle visible browser window for debugging
Configurable TimeoutsPer-tool timeout_ms parameter (default: 90s)

Comparison with ClawHub Agent Browser

CapabilityClawHub Agent BrowserCoWork OS Browser
ArchitectureExternal Rust CLI, commands via Bash shellBrowser V2 session manager with visible Electron-workbench default plus Playwright/external-CDP adapters
PerformanceCLI process spawn per command + JSON serializationPersistent browser session, CDP-backed workbench actions, fallback adapters only when needed
Navigationopen, back, forward, reloadbrowser_navigate, browser_back, browser_forward, browser_reload
Element interaction12 commands (click, fill, type, hover, drag, check, select, etc.)Ref-aware click, fill, type, read, hover, drag, upload, press, scroll, and select tools
Page analysisAccessibility tree snapshots with @ref identifiersbrowser_snapshot compact accessibility refs plus content extraction and element text
Screenshots/PDFScreenshot + full-page + PDF exportbrowser_screenshot viewport/full/element where supported + browser_save_pdf
JavaScripteval "expression"browser_evaluate (full JS execution)
Wait strategiesElement, text, URL, network idle, JS conditionElement visibility plus navigation wait states and snapshot refresh after page updates
Tabs/framesTab management, iframe switchingWorkbench tabs and active-tab tool routing; iframe support follows backend snapshot/action support
State managementstate save/load JSON filesPersistent browser profiles (automatic)
Network interceptionRoute, mock, block requestsNetwork diagnostics exposed; request mutation remains guarded/internal
Video recordingrecord start/stop to WebMLightweight trace tooling; video capture is not a core Browser V2 tool
Device emulationPresets ("iPhone 14"), geolocation, viewportViewport/device metrics through browser_emulate
Cookies/storageManual cookie and localStorage managementWorkspace profile persistence plus redacted storage diagnostics
Anti-bot bypassNoneScrapling integration (TLS fingerprinting, Cloudflare bypass)
Consent popupsNoneAuto-dismissal with 40+ pattern detectors
Retry on failureNone (single attempt)2-attempt retry with diagnostics
Domain guardrailsNoneWhitelist enforcement
Lightweight fetchNone (always launches browser)web_fetch for reads without browser overhead
Multi-browserPlaywright onlyChromium, Chrome, Brave
IntegrationLoose (CLI → Bash → agent)Tight (session manager, visible workbench, IPC, daemon logging, artifact registry, diagnostics drawer)

Key advantage: CoWork OS's Browser V2 approach keeps normal website testing in the visible app surface while using a shared session manager for automation, diagnostics, guardrails, and fallback adapters. The tiered architecture also means the agent does not launch or control a browser when a simple HTTP fetch is enough.


Web Scraping (Scrapling)

Advanced web scraping powered by Scrapling — anti-bot bypass, stealth browsing, adaptive element tracking, and structured data extraction.

FeatureDescription
Anti-Bot BypassTLS fingerprinting impersonates real browsers at the network level
Stealth ModeCloudflare Turnstile bypass, stealth headers, browser fingerprint masking
Playwright FetcherFull browser rendering for JavaScript-heavy sites
Structured ExtractionAuto-detect and extract tables, lists, headings, and metadata
Batch ScrapingScrape up to 20 URLs in a single operation
Persistent SessionsMulti-step workflows with login → navigate → extract
Proxy SupportRoute requests through HTTP/HTTPS/SOCKS5 proxies
Rate LimitingConfigurable requests-per-minute throttling

Agent Tools

ToolDescription
scrape_pageScrape a single URL with fetcher selection, CSS selectors, link/image/table extraction
scrape_multipleBatch scrape multiple URLs with shared config
scrape_extractExtract structured data (tables, lists, headings, meta, or custom selectors)
scrape_sessionMulti-step session with persistent browser state
scraping_statusCheck Scrapling installation and version

Fetcher Modes

ModeBest ForSpeed
DefaultMost sites — fast HTTP with TLS fingerprintingFast
StealthCloudflare-protected sites, anti-bot detectionMedium
PlaywrightJavaScript-rendered SPAs, dynamic contentSlow

Skills

Five scraping-specific skills are included: Web Scraper (general-purpose), Price Tracker (e-commerce), Site Mapper (crawl + structure), Lead Scraper (contact extraction), Content Monitor (change detection + scheduling).

Setup

pip install scrapling
scrapling install   # downloads stealth browsers

Configure in Settings > Web Scraping. Disabled by default — enable to make scraping tools available to agents.


System Tools

  • Screenshots (full screen or specific windows)
  • Clipboard read/write
  • Open applications, URLs, and file paths
  • AppleScript automation
  • Apple Calendar: Create, update, delete events
  • Apple Reminders: Create, complete, update, list reminders

Remote Access

  • Tailscale Serve: Expose to your private tailnet
  • Tailscale Funnel: Public HTTPS endpoint
  • SSH Tunnels: Standard SSH port forwarding
  • WebSocket API: Programmatic task management with LAN access

See Remote Access for details.


MCP (Model Context Protocol)

  • MCP Client: Connect to external MCP servers
  • MCP Host: Expose CoWork's tools as an MCP server
  • MCP Registry: Browse and install servers from a catalog
  • Versioned tool snapshots: Tool discovery tracks a stable catalog hash across native tools and MCP state so status/tool changes invalidate caches immediately

Enterprise MCP Connectors

44 pre-built connectors for enterprise integrations. Install from Settings > Connectors > Browse Registry.

ConnectorTypeNotes
SalesforceCRMOAuth, health, list/search/create/update
JiraIssue TrackingOAuth, health, projects, issues
HubSpotCRMOAuth, contacts, companies, deals
ZendeskSupportOAuth, tickets, search
ServiceNowITSMhealth, list, get, search, create, update
LinearProducthealth, projects, issues
AsanaWork Managementhealth, projects, tasks
OktaIdentityhealth, users, groups
ResendEmailsend, webhooks
DiscordCommunity19 tools: guilds, channels, messages, roles
Google WorkspaceProductivity (OAuth)Calendar, Drive, Gmail
FigmaDesignget file, export
VercelDeployprojects, deployments
MondayWork Managementboards, items
MiroWhiteboardboards, content
SupabaseDatabasequery, tables, auth
ExcalidrawDiagramscreate, update elements
StripePaymentscustomers, payments, products
Hugging FaceMLmodels, inference, Gradio
AhrefsSEOsearch, metrics
Mermaid ChartDiagramsvalidate, render SVG
CloudflareInfrastructureWorkers, KV, D1, R2
MakeAutomationscenarios, modules
Clinical TrialsLegal/Healthsearch studies
SmartsheetSpreadsheetsheets, rows
NetlifyDeploysites, deploy
AirtableDatabasebases, records
PayPalPaymentsinvoices, orders
SquarePaymentstransactions, API
AttioCRMcompanies, notes
HoneycombObservabilitydatasets, queries
Cal.comSchedulingbookings, event types
CloudinaryMediaupload, find assets
TavilyWeb Searchsearch, extract, crawl
tldrawDiagramsread/write .tldr canvases
AmplitudeAnalyticstrack events, users
ClerkAuthusers, sessions, invitations
MemNotesmem_it, notes, collections
GrafanaMonitoringdashboards, datasources
MailtrapEmailsend, templates, sandbox
SocketSecuritydependency scores
MetabaseAnalyticsdashboards, queries
Shadcn UIComponentslist, search, install
GrowthBookFeature Flagsflags, experiments
DraftsNotes (macOS)create, search drafts
FantasticalCalendar (macOS)events, schedule
TombaEmailfinder, verifier, domain search

GitHub and Notion prefer native CoWork integrations first, with MCP as fallback. See Enterprise Connectors for the full catalog and contract.


Chat Integration Setup + Skill Proposals

Two orchestration tools are available for runtime setup and governed expansion:

ToolPurpose
integration_setupChat-native Tier-1 integration management with list, inspect, and configure, including OAuth, health checks, and stale-plan protection via expected_plan_hash
skill_proposalApproval-gated skill proposal lifecycle (create, list, approve, reject) with workspace-local persistence and duplicate cooldown controls

Tier-1 providers currently covered by integration_setup: resend, google-workspace, jira, linear, hubspot, salesforce, zendesk, servicenow.

See Integration Setup, Skill Proposals, and Bootstrap Lifecycle for full request/response contracts and operational examples.


Cloud Integrations

The main composer supports grouped @ mentions for Agents, Integrations, and Files. The Integrations section only shows configured, locally usable integrations. Google Workspace splits into Gmail, Google Drive, and Google Calendar; gateway channels and MCP connectors appear only when connected/configured. Selecting an integration inserts an icon+name chip, preserves clean prompt text such as @Gmail, and sends integrationMentions metadata as soft routing guidance without granting permissions or restricting tools. See Composer Mentions.

ServiceToolActions
Notionnotion_actionSearch, read, create, update, query data sources
Boxbox_actionSearch, read, upload, manage files
OneDriveonedrive_actionSearch, read, upload, manage files
Google Workspacegmail_action, google_drive_action, google_calendar_actionGmail, Drive, Calendar with shared OAuth
Dropboxdropbox_actionList, search, upload, manage files
SharePointsharepoint_actionSearch sites, manage drive items

Configure by clicking any card in Settings > Integrations. Enterprise MCP connectors (Salesforce, Jira, HubSpot, Slack, etc.) are also managed from the same tab.


Infrastructure

Built-in cloud infrastructure tools registered as native agent tools — no MCP subprocess, no external dependency at runtime. The agent can provision cloud resources, manage domains, and make payments directly.

How It Works

Infrastructure tools are registered in the Tool Registry alongside file, shell, and browser tools. When the agent needs cloud resources, it calls these tools directly — no subprocess overhead, no external server. All credentials are stored encrypted in the OS keychain via SecureSettingsRepository.

Benefits

  • Zero latency overhead: Tools execute in-process, no MCP subprocess or network hop
  • Unified approval flow: Payment and registration operations use the same approval dialogs as shell commands and file deletions
  • Encrypted credentials: API keys and wallet private keys stored via OS keychain (macOS Keychain, Windows DPAPI, Linux libsecret)
  • Provider-based architecture: Swap E2B for another sandbox provider, or Namecheap for Cloudflare — each capability is a pluggable provider class

Cloud Sandboxes (E2B)

Spin up isolated Linux VMs for running code, deploying services, or testing in a clean environment.

ToolDescription
cloud_sandbox_createCreate a new sandbox (name, timeout, env vars)
cloud_sandbox_execRun a shell command in a sandbox
cloud_sandbox_write_fileWrite a file into a sandbox
cloud_sandbox_read_fileRead a file from a sandbox
cloud_sandbox_listList all active sandboxes
cloud_sandbox_deleteDelete a sandbox and free resources
cloud_sandbox_urlGet the public URL for an exposed port

Sandboxes auto-expire per E2B tier (5 min default, configurable up to 60 min on free tier). E2B provides $100 free credits with no credit card required.

Domain Registration (Namecheap)

Search, register, and manage domains and DNS records.

ToolDescription
domain_searchSearch available domains across TLDs (.com, .io, .ai, .dev, etc.)
domain_registerRegister a domain (requires user approval)
domain_listList all registered domains
domain_dns_listList DNS records for a domain
domain_dns_addAdd a DNS record (A, AAAA, CNAME, MX, TXT, NS)
domain_dns_deleteDelete a DNS record

Domain registration requires explicit user approval before any purchase is made.

Wallet & Payments

Built-in USDC wallet on Base network for infrastructure payments.

ToolDescription
wallet_infoGet wallet address, network, and USDC balance
wallet_balanceGet current USDC balance
x402_checkCheck if a URL requires x402 payment
x402_fetchFetch a URL with automatic x402 payment (requires approval)

The wallet is auto-generated on first setup, with the private key encrypted in the OS keychain. The wallet address and balance are displayed in the sidebar. x402 is an HTTP-native payment protocol where the agent signs EIP-712 typed data to authorize USDC payments on Base — useful for paying for API access, premium content, or compute resources.

Status & Configuration

ToolDescription
infra_statusGet overall status: provider connections, active sandboxes, wallet state

Configure in Settings > Infrastructure. The settings UI shows:

  • Provider connection status (E2B, Namecheap, Wallet)
  • API key configuration for each provider
  • Wallet address with copy button and balance display
  • Tool category toggles (enable/disable sandbox, domain, or payment tools independently)
  • Coinbase Agentic Wallet remote signer configuration (wallet.provider = coinbase_agentic) — see Coinbase Agentic Signer Contract

Personality System

Customize agent behavior via Settings or conversation:

  • Personalities: Professional, Friendly, Concise, Creative, Technical, Casual
  • Personas: Jarvis, Friday, HAL, Computer, Alfred, Intern, Sensei, Pirate, Noir
  • Response Style: Emoji usage, response length, code comments, explanation depth
  • Quirks: Catchphrases, sign-offs, analogy domains
  • Relationship: Agent remembers your name and tracks interactions

Visual Theme System

Visual StyleDescription
ModernRefined non-terminal UI style with rounded components (default)
TerminalCLI-inspired interface with prompt-style visuals
Color ModeDescription
SystemFollows your macOS light/dark mode preference
LightClean light interface
DarkDark mode for reduced eye strain

Configure in Settings > Appearance.


Scheduled Tasks (Cron Jobs)

Schedule recurring tasks with cron expressions and optional channel delivery.

  • Standard cron syntax with workspace binding
  • Create from an existing task with task view ... > Add automation...
  • Task-sourced jobs preserve a source task title, task ID, and cowork://tasks/<taskId> deeplink in the scheduled prompt/description
  • Run mode presets: Chat for no-shell unattended work, Local for shell-enabled workspace work; task worktree mode is shown disabled until cron jobs can preserve worktree context
  • Channel delivery to any of the 17 channels through the shared gateway delivery path, with idempotency, formatting, chunking, and outbox retry behavior aligned with normal chat replies
  • Conditional delivery (deliverOnlyIfResult)
  • Template variables: {{today}}, {{tomorrow}}, {{week_end}}, {{now}}
  • Chat context variables: {{chat_messages}}, {{chat_since}}, etc.
  • Scheduled task prompts should produce the final result as task output; the scheduler delivers that output to the selected channel
  • Run history with status and duration
ScheduleExpression
Every hour0 * * * *
Daily at 9am0 9 * * *
Weekdays at 6pm0 18 * * 1-5
Weekly on Sunday0 0 * * 0

Parallel Task Queue

Run multiple tasks concurrently with configurable limits (1-10, default: 3). Tasks beyond the limit are queued in FIFO order with auto-start and persistence across restarts.


Built-in Skills (147)

CategorySkills
DeveloperGitHub, GitLab, Linear, Jira, Sentry, Code Reviewer, Multi-PR Review, Developer Growth Analysis
CommunicationSlack, Discord, Telegram, Email, Voice Calls
ProductivityNotion, Obsidian, Todoist, Apple Notes/Reminders/Calendar, PRD Generator, Memory Kit
MediaSpotify, YouTube, SoundCloud
ImageImage Generation (Gemini/OpenAI/Azure), Agentic Image Loop
DocumentsExcel, Word, PDF, PowerPoint
FrontendFrontend Design, React Best Practices, React Native Best Practices, Taste Skill
MobileiOS Development, Android Development
Game DevUnity Development, Unreal Engine Development, Game Performance Optimization
IaC / DevOpsTerraform Operations, Kubernetes Operations, Cloud Migration, Docker Compose Operations
DataSupabase SDK Patterns
SearchLocal Web Search (SearXNG), Bird
FinanceCrypto Trading, Crypto Execution, Trading Foundation, DCF Valuation, Earnings Analyzer, ESG Scorer, Financial Modeling, Market Screener, Portfolio Optimizer, Risk Analyzer, Tax Optimizer
MarketingEmail Marketing Bible
Use CasesBooking Options, Draft Reply, Family Digest, Household Capture, Newsletter Digest, Transaction Scan

Web Browser Mode (Planned)

Access CoWork OS from any web browser — no Electron desktop app required.

AspectDetails
Howcowork-os --serve --port 3000 starts a Node.js server exposing the full React UI over HTTP/WebSocket
ApproachReuses all existing main-process logic (agent, tools, database, gateways). IPC calls are mapped to HTTP/WebSocket endpoints
Desktop featuresSystem tray, desktop screenshots, and AppleScript degrade gracefully. File dialogs use browser-native pickers
SecurityChallenge-response authentication (extends existing control plane auth). HTTPS recommended for production
Existing foundationControl plane already serves a web dashboard at http://127.0.0.1:18789/. Web mode extends this to the full React UI

See Architecture: Web Browser Mode for the implementation plan.


WebSocket Control Plane

Programmatic API for external automation and mobile companion apps.

  • Challenge-response token authentication
  • Full task API (create, list, get, cancel)
  • Real-time event streaming
  • Approval API for remote approval management
  • Channel management API
  • Web dashboard at http://127.0.0.1:18789/
ModeBindingUse Case
Local Only127.0.0.1:18789Desktop automation
LAN Access0.0.0.0:18789Mobile companions

Configure in Settings > Control Plane.