audio Tools

Explore the best new audio tools and products curated by the community.

Mutter AI Dictation logo
Mutter AI Dictation
Think out loud and get a polished version of your thoughts
WritingArtificial IntelligenceAudio

Speak the rough thought and Mutter shapes it into finished writing right where you type, about 3x faster than typing. A 100% on-device mode keeps sensitive words on your Mac.

2026-06-19
0
Narration Room logo
Narration Room
Turn source text into editable multi-voice scripts
MacArtificial IntelligenceAudio

Narration Room is a native Mac app, not just a text-to-speech box. It turns source text into editable multi-voice scripts, then lets creators cast voices, adjust delivery, preview on a visual timeline, and export polished audio. Standouts: source-grounded AI modes, 40+ on-device voices, PDF/Word/Markdown import, dictation mode; offline and local.

2026-06-19
0
VoiceOS logo
VoiceOS
A voice assistant that's a real JARVIS for your computer
ProductivityAudio

VoiceOS is the universal voice → action for your computer. Eliminates app-hopping, maximizes focus and productivity. Speak naturally, and VoiceOS instantly executes workflows while keeping you in control with a quick confirmation step. Works system-wide on Mac and Windows.

2026-06-18
0
Tyto by ai-coustics logo
Tyto by ai-coustics
Audio insight that predicts voice AI performance
Developer ToolsArtificial IntelligenceAudio

Tyto is a lightweight model that runs on your audio stream and predicts whether the audio reaching your agent will cause downstream failures. It outputs a single score plus a breakdown across six dimensions: noise, speaker reverb, speaker loudness, interfering speech, background media speech, packet loss. Try it here: https://ai-coustics.github.io/Project-Tyto-Real-Time-Demo/

2026-06-17
0
Avatars in ElevenCreative  logo
Avatars in ElevenCreative
A dedicated entry point for talking-head video
AudioVideo

The best AI voices, now with a face. Create studio-grade talking videos from a script, a voice, and an avatar - all in one place.

2026-06-13
0
Tide logo
Tide
Layered voice notes that paint themselves
MusicUser ExperienceAudio

Tide turns voice memos into layered sound sketches. Takes stack onto one tape — hum a bassline, beatbox over it, sing the hook. The waveform paints itself as you record. Scrub like vinyl, loop the good part, send it to Choppa or your DAW. No subs, no cloud. Launch month — 50% off until the end of July!

2026-06-12
0
Gemini 3.5 Live Translate logo
Gemini 3.5 Live Translate
Latest audio model for live speech-to-speech translation
AndroidLanguagesAudio

Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.

2026-06-10
0
Krisp Voice Translation API logo
Krisp Voice Translation API
Real-time speech-to-speech translation built for accuracy
APIDeveloper ToolsAudio

Most voice translation APIs work great in demos. Then real users show up with background noise, accents and verification code that gets garbled. We built our technology on a million live contact center calls where accuracy is non negotiable. 96% accuracy on real calls, zero patient safety incidents, 61+ languages with any to any pair. Translation API is now available self-serve with 60 mins free credit upon signup to dev dashboard.

2026-06-09
0
Vaani logo
Vaani
Lip-synced AI dubbing for creators, brands and studios
ProductivityArtificial IntelligenceAudio

Vaani is a voice-preserving AI dubbing tool to help you dub in 40+ languages, in one go, at a fraction of the cost of a traditional dub session. Where other tools give you a generic AI voice and lips that drift off-beat, Vaani clones your voice, preserves your music, and holds the meaning across languages, with frame-accurate lip sync. Built for anyone creating videos, from creators and brands to media companies, OTTs and studios.

2026-06-08
0
Wave logo
Wave
Turn your voice into text — local or cloud, your choice
ProductivityGitHubAudio

Wave lets you invoke an AI model anywhere on macOS using just your voice. Hold a hotkey, speak, and release—your speech is transcribed, processed, and the result appears exactly where you need it. If you're typing, it replaces or inserts text. If you're reading, it shows a floating answer. Works across all apps with selected text as context.

2026-06-07
0
LocalClicky logo
LocalClicky
Control your Mac with your voice locally
Open SourceGitHubTechAudio

LocalClicky is a Mac menubar app that lets you have a real conversation with your computer - completely offline. Say "Computer" to start a session. It stays listening. You chain commands back to back. Say "goodbye" when you're done. Everything runs on your machine: voice transcription, LLM multi models, VAD, macOS say No API keys. No subscription. No data leaving your Mac. MIT licensed.

2026-06-05
0
Audex Trace logo
Audex Trace
Trace what Apple Music is actually playing
MacAppleAudio

Audex Trace is a Mac mini player for Apple Music that shows what is actually playing in real time: codec, bit depth, sample rate, and output match status. It also shows Playing Next, estimates upcoming queue quality, and warns about tracks that may skip or fail before they interrupt playback.

2026-06-04
0
TaskGPT  logo
TaskGPT
Voice agent for MacOS
Artificial IntelligenceAudio

Command your Mac with your Voice, Uses your OpenAPI/Anthropic API key stored on your machine, we own no servers and do not retain any of your data.

2026-06-03
0
Curlo logo
Curlo
Local AI search to find SFX and music by describing it
ProductivityArtificial IntelligenceAudio

Curlo is a privacy-first macOS app for searching, previewing, and organizing large sound libraries. Find SFX or music by describing what you want to hear, search for similar sounds, edit metadata & UCS, manage tags, and keep everything fully local on your Mac.

2026-05-27
0
Trace logo
Trace
No-frills offline meeting transcripts with context
NotesMenu Bar AppsAudio

A macOS menu-bar app that turns any conversation into a clean markdown transcript, with a local speech model running entirely on-device. One global shortcut brings up a small bar at the bottom of your screen. It captures your mic and the system audio as separate tracks, labels who said what, and lets you flag key moments mid-call that sit inline at the right timestamp. No bot joins the call, nothing leaves your Mac, no account, no subscription.

2026-05-26
0
Parrot Speech-to-text API logo
Parrot Speech-to-text API
Fast, accurate STT for production-grade voice agents
APIArtificial IntelligenceAudio

Introducing Parrot: Ringg’s speech-to-text model for production-grade voice agents. Capture Hindi-heavy and noisy real-world conversations with low-latency inference, stronger transcript quality, and Hindi validation built for downstream workflows.

2026-05-26
0
JAMtime.ai logo
JAMtime.ai
Just tell your guitar pedal how to sound
Artificial IntelligenceAudioAlpha

Tweaking knobs is a time-honored tradition in sound design. Chatting with AI is revolutionizing industries. JAMtime.ai embraces both, while keeping the human firmly in the driver's seat. Build and tweak your guitar pedal with phrases as simple or technical as you like, from "brighter" to "comb filter into a plate reverb." The AI writes a real DSP graph, not generated audio. Come fall in love with the JAMtime.ai workflow. Then take it to your DAW with free VST/AU plugins for Mac, Windows, Linux.

2026-05-22
0
Insta360 Mic Pro logo
Insta360 Mic Pro
Pro audio with a customizable color E-Ink face
HardwareAudio

Insta360 Mic Pro is a pro wireless mic with a customizable E-Ink display, 3-mic array, AI noise canceling, directional pickup modes, 32-bit float internal recording, timecode sync, 400m range, and multi-cam creator workflows.

2026-05-20
0
Voiser AI logo
Voiser AI
Human-like AI voiceovers in 140+ languages
AndroidEducationArtificial IntelligenceAudio

Voiser helps creators, teams, and businesses turn text into the most human like AI voiceovers. With 140+ languages, 1000+ voices, emotional voice styles, custom instructions, and fast generation, you can create realistic voiceovers for videos, ads, training content, podcasts, and global projects in minutes.

2026-05-18
0
SUN-to-Spotify  logo
SUN-to-Spotify
Generate audio with SUN and send it to your Spotify library
EducationArtificial IntelligenceAudio

Download 👉 https://github.com/sunapp-ai/sun-to-spotify SUN-to-Spotify is a skill that lets you generate AI podcasts, audiobooks, and then publish them directly to your Spotify library for streaming or offline listening. Just describe what you want to hear: startup advice, history deep dives, philosophy, news, or custom learning content, and SUN creates a personalized audio experience in minutes. Built for creators, developers, and curious minds exploring the future of AI native audio.

2026-05-17
0
DramaBox by Resemble AI logo
DramaBox by Resemble AI
AI turns scene descriptions into vocal performances
ProductivityArtificial IntelligenceGitHubAudio

A TTS model should give you two things: an oscar-worthy performance and a verifiable signature to prove it's yours. DramaBox is the first to do both. Describe a scene the way you would to an actor, like 'a talk show host gasps in mock shock, bursts into laughter,' and the model interprets it as performance. Every output is watermarked with Resemble Watermarker. Open source, English-only for now, find it in your Resemble account or on Hugging Face.

2026-05-15
0

Subscribe to Our Newsletter

Get weekly curated tool recommendations and stay updated with the latest product news