warp oz integration
Warp & Oz Integration Guide
Evaluation Date: 2026-02-11 Status: Research Complete — Integration Planning Priority: Evaluation / Developer Tooling Enhancement
Overview
Warp is an Agentic Development Environment (ADE) — a terminal-first application that replaces your default terminal and integrates AI agents natively. Available on macOS, Windows, and Linux.
Oz is Warp's orchestration platform for cloud agents — the layer that enables running, managing, and scheduling coding agents at scale.
Key Differentiator: Warp is a meta-platform. It can run Claude Code, Codex, and Gemini CLI inside itself, while also providing its own native agents, cloud orchestration, and team collaboration features.
Warp Terminal Features
Core Terminal
- Modern terminal with syntax highlighting, rich completions, block-based output
- Native file editor (syntax highlighting, tabs, find-replace — NOT a full IDE)
- Code Review panel for real-time diffs of agent-generated changes
- Warpify — extends Warp features into SSH sessions and subshells (bash/zsh/fish)
Agent Mode
- Switch between command-line and conversational AI
- Local agents for interactive coding assistance
- Full Terminal Use — agents attach to interactive processes (database shells, debuggers)
- Web search — auto-triggered when models deem it useful
Warp Drive (Cloud Knowledge Base)
- Workflows — parameterized commands (YAML format,
{{arg}}syntax) - Prompts — parameterized natural language queries for Agent Mode
- Notebooks — Markdown documentation
- Environment Variables — DOTENV configuration
- Real-time sync, folder hierarchy, team/direct/link sharing
- Import/export (YAML, Markdown, DOTENV)
Benchmarks (Claimed)
- #1 on Terminal-Bench (52%)
- 75.8% on SWE-bench Verified
Oz Platform Architecture
Three Layers
| Layer | Components | Purpose |
|---|---|---|
| Foundation | Docker containers, git repos, startup commands ("Environments") | Execution context |
| Orchestration | CLI, API, SDK, cloud infra, scheduling | Invocation & control |
| Management | Session sharing, artifact tracking, human handoff | Visibility & collaboration |
Two Agent Types
Local Agents (Interactive):
- Run inside the Warp terminal
- Real-time natural language interaction
- Write/edit code, debug, run commands
- Error recovery (auto-retry with adjustments)
- Full Terminal Use — see live terminal buffer, write to PTY
Cloud Agents (Autonomous, Beta):
- Run in background on Oz cloud or self-hosted infrastructure
- Triggered by events, schedules, or integrations
- Parallel execution at scale
- Team observability through session sharing
- Can create PRs, manage issues, provide live progress updates
Cloud Agent Triggers
| Trigger | Description |
|---|---|
| Scheduled | Cron expressions for recurring tasks |
| CLI | Oz CLI for on-demand terminal invocation |
| API & SDK | Programmatic triggering from custom apps |
| Slack | @Oz in messages/threads, reads conversation context |
| Linear | Tag Oz on issues, reads issue context |
| GitHub Actions | CI/CD pipeline integration |
Cloud Agent Environments
- Define Docker image (glibc-based required — Debian, Ubuntu; Alpine NOT supported)
- Clone GitHub repositories into execution context
- Setup commands (sequential, idempotent)
- Shared across triggers — same env for Slack, Linear, CLI invocations
Self-Hosting (Enterprise Only)
- Run Oz workers on own infrastructure via Docker
- Oz orchestrator still manages lifecycle, observability, management
- Requires: compute machine (Linux recommended), Docker, outbound internet
- Worker:
docker run warpdotdev/oz-agent-worker --worker-id "my-worker" - Route tasks via
--hostparameter
Agent Capabilities
Skills
- Markdown files with YAML frontmatter (
.agents/skills/SKILL.md) - Auto-discovered based on working directory
- Invoked via natural language or
/{skill-name}slash commands - Project-level (
.agents/skills/) or global (~/) - Cross-compatible: Reads
.claude/skills/directories too
Rules
- Reusable guidelines shaping agent behavior
- Two tiers: Global rules + Project rules (
AGENTS.md/WARP.md) - Precedence: Subdirectory > Root > Global
- Cross-compatible: Recognizes
CLAUDE.md,.cursorrules,GEMINI.md,.clinerules,.windsurfrules,.github/copilot-instructions.md
Agent Profiles & Permissions
- Tiered autonomy: Let agent decide / Always prompt / Always allow / Never
- Named profiles (e.g., "YOLO mode" for personal, "Prod mode" for restricted)
- Switchable per context
Model Choice
- OpenAI: GPT-5.2, GPT-5.1, GPT-5 variants
- Anthropic: Claude Opus, Sonnet, Haiku
- Google: Gemini 3 Pro, 2.5 Pro
- Fireworks AI: GLM 4.6
- 3 Auto modes (cost, speed, quality)
- Fallback system for provider outages
- BYOK: Bring your own API keys (OpenAI, Anthropic, Google)
MCP (Model Context Protocol)
- CLI servers (Warp manages startup) or HTTP/SSE servers (remote endpoints)
- Auth: env vars, OAuth browser flow, custom headers
- Pre-built: GitHub, Sentry, Grafana, Linear, Chroma, Figma, Slack, Atlassian, Notion
- Team sharing with auto env-var scrubbing
Planning
/planslash command or natural language- Structured plans with version history
- Execute full plan or specific sections
- Auto-save to Warp Drive, exportable as Markdown
Computer Use (Experimental)
- GUI interaction in isolated cloud sandboxes
- Screenshots, clicks, typing, keyboard shortcuts
- Uses Claude Opus 4.6
- Primary use case: UI testing loop
Codebase Context
- Auto-indexes Git repos on open
- Respects
.gitignore,.warpindexingignore,.cursorignore - 5,000+ files per codebase (higher on paid tiers)
- Embeddings index stays local (never stored on Warp servers)
- Does NOT work in SSH/WSL sessions
Integration Mapping: BlueFly Agent Platform ↔ Warp/Oz
Direct Correspondence
| BlueFly Component | Warp/Oz Equivalent | Integration Opportunity |
|---|---|---|
| BuildKit CLI | Oz CLI | Oz triggers cloud agents; BuildKit wraps Oz commands |
| agent-mesh (coordination) | Oz orchestration layer | Oz for cloud scheduling/triggers; agent-mesh for internal routing |
| GitLab Workspaces (stateless) | Oz Cloud Environments | Both use Docker; Oz adds Slack/Linear triggers |
| platform-agents (orchestrator) | Oz scheduled agents | Oz schedules OSSA agent runs on cron |
| agent-protocol (MCP) | Warp MCP support | Same MCP servers work in both — config is portable |
| agent-router (LLM routing) | Warp model choice + BYOK | Multi-model support; BYOK uses existing API keys |
| workflow-engine (Langflow) | Warp Planning + Task Lists | Complementary — Oz for triggers, workflow-engine for DAGs |
| compliance-engine (Cedar) | Agent Profiles | Warp has basic profiles; Cedar policies are far more sophisticated |
| agent-brain (vector/semantic) | Warp Codebase Context | Warp indexes locally; agent-brain provides cross-project semantic search |
| studio-ui (React) | Warp Code (native editor) | Different surfaces — studio-ui for web, Warp for terminal |
What Oz Adds (Gaps in Current Platform)
- Slack/Linear Triggers —
@Ozin Slack triggers cloud agent. Current path is@bluefly-platformin GitLab only. Oz extends to chat-based triggers. - Parallel Cloud Agent Spawning — Programmatic N agents via CLI/API/SDK. BuildKit swarm runs locally; Oz runs in cloud containers.
- Session Sharing — Auto-generated shareable links with full audit trail. agent-tracer handles observability but lacks live interactive sharing.
- Self-Hosted Workers — Docker workers on Oracle Cloud VM or NAS. Keeps execution on-prem while Oz handles orchestration.
- Computer Use — GUI interaction for UI testing of Drupal modules in sandboxed environments.
- GitHub Actions Integration — Trigger agents from CI/CD (gitlab_components could integrate).
What BlueFly Has That's Superior
| Capability | BlueFly Solution | Why It's Better |
|---|---|---|
| Policy enforcement | Cedar policies (compliance-engine) | Enterprise RBAC/compliance vs basic profiles |
| Agent definitions | OSSA v0.4.1 manifests | Standardized spec with validation vs loose skill format |
| Internal routing | agent-mesh + Tailscale MagicDNS | Private mesh network vs cloud-only routing |
| Vector search/RAG | agent-brain (Qdrant + Neo4j) | Full knowledge graph vs file-level indexing |
| Drupal integration | 20+ custom modules | Deep CMS integration; Warp has zero Drupal awareness |
| GitLab Duo | gitlab-agent_ossa (Go) | Native GitLab agent vs external integration |
| Separation of duties | 4-tier OSSA access model | Formal role conflict matrix vs simple permission toggles |
Architecture Integration
HUMAN DEVELOPMENT LAYER
┌─────────────────────────────┐
│ Warp Terminal (M4/M3) │
│ ├─ Local Agents (interactive)│
│ ├─ MCP Servers (portable) │
│ ├─ AGENTS.md / CLAUDE.md │
│ └─ Codebase Context (local) │
└──────────┬──────────────────┘
│
┌────────────────┼────────────────┐
▼ ▼ ▼
┌─────────────┐ ┌──────────────┐ ┌──────────────┐
│ Oz Cloud │ │ GitLab CI/CD │ │ GitLab Duo │
│ (triggers) │ │ (pipelines) │ │ (@bluefly) │
│ Slack/Linear│ │ gitlab_comps │ │ gitlab_ossa │
│ Cron/API │ │ │ │ │
└──────┬──────┘ └──────┬───────┘ └──────┬───────┘
│ │ │
└────────────────┼─────────────────┘
▼
┌─────────────────────────────┐
│ OSSA Agent Platform │
│ ├─ agent-mesh (coordination)│
│ ├─ agent-router (LLM) │
│ ├─ agent-protocol (MCP) │
│ ├─ compliance-engine (Cedar) │
│ └─ agent-brain (memory) │
└──────────┬──────────────────┘
▼
┌─────────────────────────────┐
│ Execution Environments │
│ ├─ Oracle Cloud (K8s/kagent) │
│ ├─ Oz Self-Hosted Workers │
│ ├─ GitLab Workspaces │
│ └─ Vast.ai (GPU) │
└─────────────────────────────┘
Implementation Roadmap
Phase 1: Developer Experience (Immediate — No Cost)
Goal: Replace default terminal with Warp on M4/M3 workstations.
- Install Warp on M4 (primary) and M3 (secondary)
- Configure BYOK with existing
~/.tokens/anthropicand~/.tokens/openai - Verify
AGENTS.mdandCLAUDE.mdare auto-discovered by Warp rules - Port existing
.claude/skills/to.agents/skills/format (Warp reads both) - Configure existing MCP servers in Warp settings
- Test Warp Codebase Context indexing across key repos (agent-buildkit, platform-agents, etc.)
- Set up Agent Profiles: "Dev" (permissive), "Prod" (restricted), "Drupal" (PHP-aware)
- Evaluate Warp Drive for sharing workflows/prompts across M4/M3
Cost: $0 (Free tier) or $20/mo (Build for 1,500 credits + BYOK)
Phase 2: Cloud Agent Triggers (Short-term — 2-4 weeks)
Goal: Extend agent triggers beyond GitLab to Slack and scheduled tasks.
- Set up Oz Slack integration for on-demand agent requests in team channels
- Create Oz Environments for key repos (agent-buildkit, platform-agents, compliance-engine)
- Map OSSA stateless workers to Oz scheduled agents:
documentation-aggregator→ daily wiki sync cronvulnerability-scanner→ nightly security scan croncost-intelligence-monitor→ 8-hour cost analysis cron
- Test Oz CLI integration:
oz run --prompt "..." --env production - Evaluate self-hosted workers on Oracle Cloud VM (Enterprise plan required)
Cost: Build plan ($20/mo) + Oz compute credits
Phase 3: Orchestration Bridge (Medium-term — 1-2 months)
Goal: Bridge Oz triggers into the OSSA agent platform.
- Build
buildkit oz:triggercommand wrapping Oz CLI - Route Oz cloud agent results through agent-mesh for OSSA compliance validation
- Feed Oz session data into agent-tracer for unified observability
- Add Oz API/SDK integration to workflow-engine for external trigger support
- Create
ai_agents_warpDrupal module for Oz session management - Map Warp Agent Profiles to OSSA access tiers:
- Warp "Dev" → OSSA
tier_3_full_access - Warp "Prod" → OSSA
tier_1_read - Warp "Review" → OSSA
tier_2_write_limited
- Warp "Dev" → OSSA
Cost: Enterprise plan (custom) if self-hosting needed
Phase 4: Full Platform Integration (Long-term)
Goal: Oz as an additional execution environment alongside Oracle K8s and GitLab Workspaces.
- Register Oz cloud agents in platform-agents registry
- Use Oz Computer Use for automated Drupal UI testing
- Implement Oz API as trigger source in agent-mesh gateway
- Create gitlab_components CI template for Oz agent dispatch
- Unified dashboard: agent-tracer + Oz session sharing
Pricing
| Plan | Cost | Credits | Key Features |
|---|---|---|---|
| Free | $0 | 75/mo (150 first 2 months) | Core terminal, limited sharing |
| Build | $20/mo | 1,500/mo | BYOK, 40 repos/100K files indexing, premium models, Reload Credits |
| Business | $50/mo/seat | 1,500/mo + shared | SSO, enforced ZDR, shared credits, admin (up to 50 seats) |
| Enterprise | Custom | Custom | Self-hosting, custom terms, advanced compliance |
1 credit = 1 prompt submission. Cloud agents consume credits for AI + compute.
Reload Credits: ~40-50% cheaper than old overages, roll over monthly (12-month validity), $10-$100 increments.
Recommendation: Start with Build ($20/mo) + BYOK. Existing Anthropic/OpenAI keys mean you only pay Warp for orchestration, not model costs.
Comparison: Warp/Oz vs Claude Code
| Dimension | Warp + Oz | Claude Code |
|---|---|---|
| Core | Terminal replacement + agent platform | CLI agent (runs in any terminal) |
| UI | Native app with file tree, diff viewer, code editor | Terminal-only, no GUI |
| Agent hosting | Local + Cloud (Oz) | Local only |
| Multi-agent | Parallel cloud agents at scale | Sub-agents within single session |
| Team features | Session sharing, team Drive, admin profiles | Individual-focused |
| Models | Multi-provider (OpenAI, Anthropic, Google, Fireworks) | Anthropic models only |
| Triggers | Cron, Slack, Linear, GitHub Actions, API, CLI | Manual invocation only |
| MCP | Full support | Full support |
| Skills/Rules | .agents/skills/, AGENTS.md (cross-compatible) | .claude/skills/, CLAUDE.md |
| Self-hosting | Enterprise Docker workers | N/A |
| Pricing | $20/mo (Build) | Usage-based via Anthropic API |
Bottom Line: They complement each other. Warp is the terminal + orchestration surface. Claude Code is the most capable individual coding agent. Run Claude Code inside Warp for best of both worlds.
Key Risks & Considerations
- Cloud agents are still in beta
- Self-hosting requires Enterprise plan (custom pricing — could be expensive)
- Alpine Docker images not supported (must use glibc-based)
- Codebase context doesn't work in SSH/WSL
- Computer Use is experimental and sandboxed only
- Credit-based pricing requires usage monitoring
- Oz is GitHub-centric (not GitLab) — integration may need adapter work
- No native GitLab CI/CD trigger (GitHub Actions only currently)