Voice-first agent orchestration

Herd Your
Agents.

Walk around with smart glasses while your AI coding agents build, test, and deploy. VoxHerd is the voice remote control for Claude Code, Codex, and Gemini CLI.

9:41
VoxHerd
Connected
> quantcoachCLAUDEdone

Refactored pricing engine to ISO 8601 dates.

TaskWebTerm3m ago
> aligned-toolsCLAUDE✏️writing

Adding Zod validation schemas for API routes.

src/schemas/api.ts

TaskWebTerm3m ago
> homeschoolCLAUDEdone

Set up project structure and initial routes.

TaskWebTerm3m ago
Activity
2:31quantcoach registered
2:31aligned-tools registered
2:32homeschool registered
2:33quantcoach completed
iOS|macOS|Linux|Smart Glasses

Features

Everything you need.

Voice-first control

Speak naturally to your agents. VoxHerd uses on-device speech recognition to parse your intent and route commands to the right project -- no typing required.

"Tell Homeschool to run the tests""Switch to QuantCoach""Approve it"

Multi-agent orchestration

Run Claude Code, Codex, and Gemini CLI sessions simultaneously across different projects. Each agent works independently while VoxHerd keeps you in the loop on all of them.

quantcoach
aligned-tools
homeschool

Smart glasses ready

Walk around the house, go for a coffee, take the dog out. Your agents report in through Meta Ray-Ban smart glasses via Bluetooth audio -- just listen and respond.

Hands-freeBluetooth audioMeta DAT SDK

Intelligent routing

Fuzzy project matching, context-aware routing, and compound utterance parsing. Say 'Switch to Homeschool, now add integration tests' and VoxHerd handles both intents.

intent:switch + command | explicit override | react to last
Hook System

Zero-config integration

VoxHerd hooks into each agent's lifecycle events. When a session starts, completes, or needs permission, the bridge server is notified instantly. Haiku generates concise summaries that are announced via TTS. Works with Claude Code hooks natively, and Codex and Gemini CLI via tmux integration.

# ~/.claude/settings.json
{
"hooks": {
"stop": [{
"command": "~/.voxherd/hooks/on-stop.sh"
}],
"session_start": [{
"command": "~/.voxherd/hooks/on-session-start.sh"
}]
}
}

Architecture

How it works.

Three components, one voice-driven workflow. No cloud services, no accounts, no auth. Everything runs on your local network.

01

Agent Lifecycle Hooks

Mac / Linux

Bash scripts and tmux watchers that fire on session start, stop, and permission requests. They POST events to the bridge server with session context and Haiku-generated summaries. Native hooks for Claude Code, tmux integration for Codex and Gemini CLI.

02

Bridge Server

FastAPI on port 7777

The central nervous system. Receives hook events, manages session state, dispatches voice commands to Claude Code, Codex, or Gemini CLI, and streams everything over WebSocket.

03

iOS App

iPhone / Smart Glasses

A voice remote control. Captures speech, parses intent, sends commands over WebSocket, and announces results via TTS. Never touches code or files directly.

YouGlasses / Phone miciOS STTIntent Parse
|
WebSocketBridge Server :7777Agent CLI dispatch
|
Agent lifecycle hooksBridge ServerWebSocketiOS TTSYou

AI Agents

Your fleet, your voice.

VoxHerd orchestrates any number of Claude Code, Codex, and Gemini CLI instances across your projects. Manage them all with natural conversation.

C
Claude Code
C
Codex
G
Gemini CLI
C
Cline
C
Cursor Agent
C
Copilot CLI
A
Aider
C
Continue
O
OpenCode
A
Amp
G
Goose
C
Claude Code
C
Codex
G
Gemini CLI
C
Cline
C
Cursor Agent
C
Copilot CLI
A
Aider
C
Continue
O
OpenCode
A
Amp
G
Goose

Works with any terminal-based AI coding agent via hooks or tmux integration.

Conversational Flow

QuantCoach

Refactored pricing engine to use ISO 8601 dates. All 47 tests passing.

You

Great, now add the currency conversion endpoint.

Aligned Tools

Wants permission to run: npm install zod

You

Approve it.

You

Switch to Homeschool. Add integration tests for the auth module.

Homeschool

Working on it...

Get Started

Up and running in minutes.

VoxHerd runs on your Mac or Linux machine. No cloud, no accounts. Just install and start herding.

1
1

Install hooks

$ cd hooks && bash install.sh

Deploys hook scripts to ~/.voxherd/hooks/ and patches your agent settings. Supports Claude Code hooks natively, plus tmux-based integration for Codex and Gemini CLI.

2
2

Start the bridge

$ python -m bridge run --tts

Starts the FastAPI bridge server on port 7777 with text-to-speech announcements.

3
3

Connect your phone

$ ws://macbook.local:7777/ws/ios

Point the iOS app at your bridge server. Start talking to your agents.

Quick validation (no iOS app needed)

# Watch bridge events
wscat -c ws://localhost:7777/ws/ios
# Simulate a stop event
curl -X POST http://localhost:7777/api/events \ -H "Content-Type: application/json" \ -d '{"event":"stop","session_id":"test-123"}'
# Check sessions
curl http://localhost:7777/api/sessions