No more vibe coding in the blind logo

No more vibe coding in the blind

Summon your army of elves

2026-03-11

Product Introduction

Definition: ELVES (AI Agent Orchestration for Your Codebase) is an open-source, Tauri-based desktop application designed to serve as a sophisticated management layer and development environment for terminal-based AI coding agents. Categorized as an AI Agent Orchestrator, it provides a specialized GUI and runtime environment specifically for tools like Claude Code and Codex, allowing developers to execute complex coding tasks through a structured, isolated, and observable workflow.

Core Value Proposition: The primary mission of ELVES is to eliminate "vibe coding in the blind"—the chaotic and unmanaged process of switching between various AI prompts and local terminal sessions without persistent context or safety guardrails. By integrating git worktree isolation, SQLite-backed persistent memory, and a unified shipping interface, ELVES transforms fragmented AI interactions into a professional-grade Software Development Lifecycle (SDLC) workflow. It enables developers to scale their output by running multiple AI agents in parallel without risking the integrity of their main working directory.

Main Features

1. Worktree-Isolated Workspaces: ELVES leverages the native git worktree command to create dedicated, physically isolated working directories for every AI-assigned task. Unlike traditional branch switching that modifies the current directory state, ELVES spawns a new worktree for each task (e.g., "Add OAuth Login"). This ensures that AI agents operate in a clean environment where they cannot interfere with uncommitted local changes or other ongoing parallel tasks. Once the agent completes its work, the developer can review the diff and "Ship It" (merge) or discard the entire worktree, leaving the primary codebase pristine.

2. Persistent FTS5 SQLite Memory: To combat the "memory loss" often associated with stateless AI agent sessions, ELVES implements a shared memory layer powered by SQLite with FTS5 (Full-Text Search) capabilities. This system captures decisions, technical preferences, and project-specific context across sessions. It features a unique relevance decay algorithm where critical architectural decisions remain searchable and prioritized, while ephemeral "noise" or outdated context fades over time. This provides the AI agent with a "long-term brain" that understands the project's evolution beyond a single prompt window.

3. WebGL-Accelerated PTY Terminal: The platform includes a high-performance, GPU-accelerated terminal rendering engine. This WebGL terminal is wired directly to the underlying AI agent (Claude Code or Codex) and supports advanced features such as Unicode, search overlays, and a built-in toolbar. A critical technical advantage is the session auto-persistence and resume capability, which allows developers to close the app or lose connection without losing the terminal's state or the agent's progress.

4. Advanced Task Analysis and Orchestration: Before an agent is even spawned, ELVES performs an automated task complexity analysis. It classifies the user's plain-English request and selects the optimal operating mode (solo or team-based) and configuration. This pre-flight orchestration ensures that the agent is initialized with the correct environment variables, MCP (Model Context Protocol) servers, and project-specific context required to execute the task successfully.

Problems Solved

1. Context Switching and Workspace Contamination: Pain Point: Developers often struggle with "dirty" working directories when AI agents generate hundreds of lines of code that need to be tested and potentially reverted. Solution: ELVES solves this through physical directory isolation. By utilizing worktrees, the main development branch remains untouched while the AI works in a parallel universe, effectively eliminating the risk of accidental overwrites or dependency conflicts.

2. Fragmented Tooling and Lack of Observability: Pain Point: Switching between different AI engines (Claude vs. Codex) usually requires re-configuring environments and losing previous chat context. Solution: ELVES provides a runtime-agnostic interface. The same memory, same visual dashboard, and same "Ship It" flow apply regardless of the underlying LLM engine, providing a unified developer experience (DX).

3. Target Audience:

  • Full-Stack Developers: Seeking to automate boilerplate tasks like OAuth integration or API scaffolding.
  • DevOps Engineers: Using AI agents to refactor CI/CD scripts or infrastructure-as-code in isolated environments.
  • Open Source Maintainers: Who need to review and merge AI-generated PRs using structured merge strategies (squash, rebase, or merge).

4. Use Cases:

  • Parallel Feature Prototyping: Running three different AI agents simultaneously to explore three different UI library implementations.
  • Automated Bug Fixing: Assigning a task like "Fix all TypeScript linting errors" and letting it run in the background without halting manual coding.
  • Context-Aware Refactoring: Using the persistent memory to ensure the AI follows specific architectural patterns established months prior.

Unique Advantages

1. Differentiation from IDE Extensions: While most AI coding tools are plugins (like GitHub Copilot), ELVES is a standalone orchestrator. It doesn't just suggest code; it manages the git lifecycle of the task. It provides a higher level of abstraction by handling branch creation, worktree cleanup, and usage telemetry, which IDE-integrated chats typically lack.

2. The "Ship It" Workflow: The key innovation is the structured exit strategy. When an agent finishes, ELVES provides a visual "Ship It" flow that includes a merge strategy picker (Merge, Rebase, Squash). Upon shipping, it automatically extracts relevant learnings into the SQLite memory, removes the temporary worktree, and deletes the branch, automating the "janitorial" work of software development.

3. Real-Time Usage Insights: ELVES includes a telemetry dashboard specifically for Claude Code. It tracks token usage by model, daily activity, and project breakdowns. It even generates an AI-powered narrative report of what was accomplished, providing transparency into AI costs and productivity gains.

Frequently Asked Questions (FAQ)

1. What makes ELVES different from using Claude Code directly in the terminal? While Claude Code is a powerful agent, ELVES provides the essential infrastructure around it. It adds git worktree isolation to prevent your local environment from getting messy, a persistent SQLite memory so the agent remembers previous sessions, and a GUI dashboard for tracking token usage and task status across multiple projects.

2. How does the persistent memory system work in ELVES? ELVES uses an SQLite database with FTS5 search to store context and decisions. This is "shared memory," meaning if an agent learns a specific project preference in one task, it can access that memory in a future task. The system also uses relevance decay, ensuring that the most important context stays accessible while irrelevant old data is deprioritized.

3. Does ELVES support Windows or Linux? Currently, ELVES is a macOS-first application available via DMG or Homebrew. However, the roadmap includes support for Linux and Windows in upcoming releases. It is built using Tauri, which facilitates cross-platform compatibility as the development progresses.

Subscribe to Our Newsletter

Get weekly curated tool recommendations and stay updated with the latest product news