Audio Tools
Explore the best new Audio tools and products curated by the community.
The best AI voices, now with a face. Create studio-grade talking videos from a script, a voice, and an avatar - all in one place.
Tide turns voice memos into layered sound sketches. Takes stack onto one tape — hum a bassline, beatbox over it, sing the hook. The waveform paints itself as you record. Scrub like vinyl, loop the good part, send it to Choppa or your DAW. No subs, no cloud. Launch month — 50% off until the end of July!
Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.
Most voice translation APIs work great in demos. Then real users show up with background noise, accents and verification code that gets garbled. We built our technology on a million live contact center calls where accuracy is non negotiable. 96% accuracy on real calls, zero patient safety incidents, 61+ languages with any to any pair. Translation API is now available self-serve with 60 mins free credit upon signup to dev dashboard.
Vaani is a voice-preserving AI dubbing tool to help you dub in 40+ languages, in one go, at a fraction of the cost of a traditional dub session. Where other tools give you a generic AI voice and lips that drift off-beat, Vaani clones your voice, preserves your music, and holds the meaning across languages, with frame-accurate lip sync. Built for anyone creating videos, from creators and brands to media companies, OTTs and studios.
Wave lets you invoke an AI model anywhere on macOS using just your voice. Hold a hotkey, speak, and release—your speech is transcribed, processed, and the result appears exactly where you need it. If you're typing, it replaces or inserts text. If you're reading, it shows a floating answer. Works across all apps with selected text as context.
LocalClicky is a Mac menubar app that lets you have a real conversation with your computer - completely offline. Say "Computer" to start a session. It stays listening. You chain commands back to back. Say "goodbye" when you're done. Everything runs on your machine: voice transcription, LLM multi models, VAD, macOS say No API keys. No subscription. No data leaving your Mac. MIT licensed.
Audex Trace is a Mac mini player for Apple Music that shows what is actually playing in real time: codec, bit depth, sample rate, and output match status. It also shows Playing Next, estimates upcoming queue quality, and warns about tracks that may skip or fail before they interrupt playback.
Command your Mac with your Voice, Uses your OpenAPI/Anthropic API key stored on your machine, we own no servers and do not retain any of your data.
Curlo is a privacy-first macOS app for searching, previewing, and organizing large sound libraries. Find SFX or music by describing what you want to hear, search for similar sounds, edit metadata & UCS, manage tags, and keep everything fully local on your Mac.
A macOS menu-bar app that turns any conversation into a clean markdown transcript, with a local speech model running entirely on-device. One global shortcut brings up a small bar at the bottom of your screen. It captures your mic and the system audio as separate tracks, labels who said what, and lets you flag key moments mid-call that sit inline at the right timestamp. No bot joins the call, nothing leaves your Mac, no account, no subscription.
Introducing Parrot: Ringg’s speech-to-text model for production-grade voice agents. Capture Hindi-heavy and noisy real-world conversations with low-latency inference, stronger transcript quality, and Hindi validation built for downstream workflows.
Tweaking knobs is a time-honored tradition in sound design. Chatting with AI is revolutionizing industries. JAMtime.ai embraces both, while keeping the human firmly in the driver's seat. Build and tweak your guitar pedal with phrases as simple or technical as you like, from "brighter" to "comb filter into a plate reverb." The AI writes a real DSP graph, not generated audio. Come fall in love with the JAMtime.ai workflow. Then take it to your DAW with free VST/AU plugins for Mac, Windows, Linux.
Insta360 Mic Pro is a pro wireless mic with a customizable E-Ink display, 3-mic array, AI noise canceling, directional pickup modes, 32-bit float internal recording, timecode sync, 400m range, and multi-cam creator workflows.
Voiser helps creators, teams, and businesses turn text into the most human like AI voiceovers. With 140+ languages, 1000+ voices, emotional voice styles, custom instructions, and fast generation, you can create realistic voiceovers for videos, ads, training content, podcasts, and global projects in minutes.
Download 👉 https://github.com/sunapp-ai/sun-to-spotify SUN-to-Spotify is a skill that lets you generate AI podcasts, audiobooks, and then publish them directly to your Spotify library for streaming or offline listening. Just describe what you want to hear: startup advice, history deep dives, philosophy, news, or custom learning content, and SUN creates a personalized audio experience in minutes. Built for creators, developers, and curious minds exploring the future of AI native audio.
A TTS model should give you two things: an oscar-worthy performance and a verifiable signature to prove it's yours. DramaBox is the first to do both. Describe a scene the way you would to an actor, like 'a talk show host gasps in mock shock, bursts into laughter,' and the model interprets it as performance. Every output is watermarked with Resemble Watermarker. Open source, English-only for now, find it in your Resemble account or on Hugging Face.
Ready-made creative workflows. Upload your input, pick a template, get a finished asset - product shots, mockups, style transfers, character sheets, and more.
For busy professionals who can't remember important details, Chronicle is a personal AI memory system that lets you voice-record facts, ideas, and information and instantly retrieve them with natural language questions. Unlike journaling apps focused on introspection and mood tracking, Chronicle is designed for total recall with minimum friction—capturing what you need to remember, not how you feel.
Pop makes voice notes first class in everyday messaging. Amazing transcripts, a magic editor to summarise or clean up, edit the audio of your notes by editing the transcript & more.
Every time I jumped between Spotify, Zoom, and YouTube I had to manually switch audio outputs. It drove me crazy. So I built Sound Warden. It lives in your menu bar and automatically routes each app to the audio device you want. Set it once, forget it forever. ✅ Per-app audio routing ✅ Menu bar — always accessible ✅ Lightweight, no background bloat Built for anyone who uses multiple audio devices daily.