Moshi

Definition: Moshi is an iOS terminal application leveraging the Mosh (Mobile Shell) protocol to establish direct, resilient connections between iPhones and development machines. It enables real-time interaction with AI coding agents (e.g., Claude Code) via SSH, bypassing cloud dependencies.
Core Value Proposition: Moshi eliminates the desk-bound requirement for AI pair-programming by providing uninterrupted terminal access during commutes, travel, or offline scenarios. Its UDP-based resilience and on-device processing empower developers to command coding agents from anywhere.

Mosh-Powered Resilience:
- How it works: Uses Mosh’s UDP protocol instead of TCP, dynamically adapting to IP changes and network switches. Sessions persist through Wi-Fi/cellular transitions, device sleep, or brief disconnections.
- Technologies: Mosh (Mobile Shell), UDP with forward error correction, state synchronization.
On-Device Voice Commands:
- How it works: Integrates OpenAI’s Whisper model for local speech-to-text conversion. Spoken commands (e.g., "git push origin main") translate directly to terminal input without cloud processing.
- Technologies: Whisper iOS implementation, CoreML for offline inference.
Biometric SSH Key Security:
- How it works: Stores SSH keys in Apple’s Secure Enclave-backed Keychain. Face ID/Touch ID decrypts keys per session, preventing app-level access.
- Technologies: iOS Keychain Services, Secure Enclave, biometric authentication.
Agent Notification System:
- How it works: Monitors background terminal processes and triggers iOS push alerts when AI agents (e.g., Claude Code) require input or complete tasks.
- Technologies: Background app refresh, local process tracking.
Terminal-Optimized UI:
- How it works: Custom keyboard with hardware-key emulation (Ctrl, Esc, arrows) and Tmux shortcuts. Supports split panes, scrollback, and session restoration.
- Technologies: SwiftUI for adaptive layouts, custom input accessory views.

Pain Point: Developers cannot respond to AI agent prompts (e.g., Claude Code clarification requests) when away from their workstations, halting coding workflows.
Target Audience:
- Remote software engineers using AI pair-programming tools.
- DevOps professionals managing servers via CLI during travel.
- Open-source contributors needing terminal access in low-connectivity areas.
Use Cases:
- Approving AI-generated code changes from a mobile device during commute.
- Debugging production issues via SSH while offline.
- Voice-controlling code deployments from a coffee shop.

Differentiation: Unlike standard SSH clients (e.g., Termius), Moshi prioritizes Mosh’s session resilience and AI-agent interoperability. Competitors lack integrated voice input, biometric key security, and agent-specific notifications.
Key Innovation: Merging on-device Whisper with Mosh creates a zero-latency, offline-ready terminal pipeline—unique in mobile development tools.

Does Moshi work without internet?
Yes, Moshi maintains active terminal sessions during offline periods via Mosh’s local echo and state prediction, syncing when connectivity resumes.
How secure are SSH keys in Moshi?
Keys are stored exclusively in Apple’s Keychain, encrypted by the Secure Enclave. Biometric authentication is required per session, with no cloud backup.
Can Moshi connect to Windows/Linux machines?
Absolutely. Moshi supports any SSH-accessible device (Linux, macOS, Windows WSL) with Mosh server installed.
Is voice command processing done in the cloud?
No. Whisper runs entirely on-device, ensuring voice data never leaves your iPhone.
What happens if my phone sleeps during a task?
Mosh’s UDP protocol preserves session state. Upon wake, Moshi automatically re-syncs with your dev machine without manual reconnection.

the mobile interface for AI coding agents