Audio Tools
Explore the best new Audio tools and products curated by the community.
SFX Stacks is a desktop app for searching large local SFX libraries. Instead of relying only on filenames, metadata, or folder browsing, it lets you describe the sound you need and explore similar sounds, making search faster and improving discovery. Built for sound designers, game audio, and other audio workflows with large local libraries.
Avec is the free AI email app that lets you handle your Gmail inbox in seconds! (1) Smart filtering: Avec surfaces the emails that need your attention first, and learns your preferences over time. (2) Write with your voice: Record a quick voice note and let Avec turn it into a clear email that sounds like you. (3) Clear your inbox: Not every email deserves the same attention. After you’ve handled the important ones, clear the rest with a swipe. Unsubscribe and block spammy senders with one tap.
Google's TTS API with inline audio tags, multi-speaker dialogue, and 70+ language support. For developers building voice agents, dubbing tools, or AI content products via the Gemini API and Vertex AI.
100% private on-device voice models for speech-to-text and meeting transcription on macOS. No cloud APIs, no data leaves your machine without your explicit permission.
YouTube has speed control, captions, auto-translate — but no accent control. Now it does. Free Chrome extension, on-device AI, one toggle.
VoxCPM2 is a 2B open-source TTS model with 30-language support, 48kHz output, voice design from text alone, controllable voice cloning, and real-time streaming fast enough for production voice workflows.
The open-source alternative to WisprFlow is now available on iOS and Android. Type 4x faster by using your voice. Works in any app, and now even on the go. Voquill gives you full control over where your voice data goes: bring your own API key, use our cloud, or stand up your own server. Voquill even works for highly regulated industries since all data is kept in-house
Doing is for AI builders who use voice and screenshots to bring context to Claude Code, Codex, and other AI agents. Tap a hotkey and Doing listens. Optimized over thousands of hours of building with Claude & Codex. Blazing fast, private, local, no account, no subs. Just a quality tool that you own and works well.
SueprCmd is MacOs Launcher — open source alternative to Raycast Pro, WisprFlow, and Speechify in one place. With Raycast-compatible extensions, unlimited clipboard, notes, snippets, Excalidraw canvases, voice dictation with local models, text-to-speech from any app, a powerful calculator with live currency conversions, and AI via your own key or Ollama. Open source productivity suite to 10x yourself
Real-time voice transcription for Mac. Words appear as you speak 99% accuracy, AutoPaste anywhere, any app and copy to clipboard automatically., 50+ languages, fully offline. No cloud. No subscription. One payment, yours forever. → Works in any app — Slack, Notion, Word, anywhere → AutoPaste to any text field automatically → Native Mac shortcuts & menu bar → Your voice never leaves your device → History for all your transcription "all Local on your device"
Keeby adds real mechanical keyboard sounds to your MacBook. Every sound is recorded from actual switches, not synthesized. Choose from 11 switch profiles including Gateron Red, Holy Panda, Alps Blue, Box Navy, and more. Spatial audio places left keys in your left speaker and right keys in your right. A reactive visualizer follows your typing in real time. Tone controls let you dial between thock and clack. Runs from your menu bar, fully offline, no data collected.
Google AI Edge Eloquent is a free, offline-first dictation app. Powered by on-device Gemma models, it automatically removes filler words and stumbles. It offers 100% local processing for privacy, with an optional Gemini cloud mode for advanced cleanup.
MAI-Transcribe-1 is Microsoft’s new multilingual speech-to-text model built for real-world audio. It delivers best-in-class accuracy across 25 languages, strong robustness in noisy environments, faster batch transcription, and pricing aimed at production speech workflows.
VoiceOS is the universal voice → action for your computer. Eliminates app-hopping, maximizes focus and productivity. Speak naturally, and VoiceOS instantly executes workflows while keeping you in control with a quick confirmation step. Works system-wide on Mac and Windows.
The native macOS sample manager built for Eurorack and hardware samplers. Organize, validate, and convert your library for Morphagene, Squid Salmple, Digitakt, and more, so you spend less time on file prep and more time making music.
Introducing Lightning V3 — Smallest AI's most advanced text-to-speech model. With 100ms latency, a 3.89 WVMOS score, and support for English, Hindi, Spanish, Tamil and 15+ languages, V3 was preferred over OpenAI's GPT-4o-mini-TTS by listeners 76.2% of the time. Get audio output in 44.1 kHz and powers voice assistants, IVR systems, content creation and conversational AI with human-like speech. Instant voice cloning from just 10 seconds of audio. Real-time. Expressive. Enterprise-ready.
Veo 3.1 Lite is Google’s most cost-efficient video generation model on the Gemini API. It enables high-volume Text-to-Video and Image-to-Video creation at <50% cost of Fast, with 720p/1080p output, flexible ratios, and adjustable durations for scalable video apps.
This Easter, turn your voice into something unexpected. On Noiz, crack a voice egg to unlock new AI voices, or create your own with a prompt and image. From playful characters to unique greetings, generate expressive voices in seconds.
SUN creates interactive audio content on demand. Generate podcasts, audiobooks, or courses on any topic, ask questions while listening, and learn in the context of your life. Unlike static platforms, SUN understands your world—from notes, emails, and AI tools—to deliver truly personalized audio experiences. Built for continuous, screen-free learning that helps you grow every day.
SUN creates interactive audio content on demand. Generate podcasts, audiobooks, or courses on any topic, ask questions while listening, and learn in the context of your life. Unlike static platforms, SUN understands your world—from notes, emails, and AI tools—to deliver truly personalized audio experiences. Built for continuous, screen-free learning that helps you grow every day.
SUN creates interactive audio content on demand. Generate podcasts, audiobooks, or courses on any topic, ask questions while listening, and learn in the context of your life. Unlike static platforms, SUN understands your world—from notes, emails, and AI tools—to deliver truly personalized audio experiences. Built for continuous, screen-free learning that helps you grow every day.