audio Tools

Explore the best new audio tools and products curated by the community.

Think out loud and get a polished version of your thoughts

Speak the rough thought and Mutter shapes it into finished writing right where you type, about 3x faster than typing. A 100% on-device mode keeps sensitive words on your Mac.

2026-06-19

Narration Room

Turn source text into editable multi-voice scripts

MacArtificial IntelligenceAudio

Narration Room is a native Mac app, not just a text-to-speech box. It turns source text into editable multi-voice scripts, then lets creators cast voices, adjust delivery, preview on a visual timeline, and export polished audio. Standouts: source-grounded AI modes, 40+ on-device voices, PDF/Word/Markdown import, dictation mode; offline and local.

2026-06-19

VoiceOS

A voice assistant that's a real JARVIS for your computer

ProductivityAudio

VoiceOS is the universal voice → action for your computer. Eliminates app-hopping, maximizes focus and productivity. Speak naturally, and VoiceOS instantly executes workflows while keeping you in control with a quick confirmation step. Works system-wide on Mac and Windows.

2026-06-18

Tyto by ai-coustics

Audio insight that predicts voice AI performance

Developer ToolsArtificial IntelligenceAudio

Tyto is a lightweight model that runs on your audio stream and predicts whether the audio reaching your agent will cause downstream failures. It outputs a single score plus a breakdown across six dimensions: noise, speaker reverb, speaker loudness, interfering speech, background media speech, packet loss. Try it here: https://ai-coustics.github.io/Project-Tyto-Real-Time-Demo/

2026-06-17

Avatars in ElevenCreative

A dedicated entry point for talking-head video

AudioVideo

The best AI voices, now with a face. Create studio-grade talking videos from a script, a voice, and an avatar - all in one place.

2026-06-13

Tide

Layered voice notes that paint themselves

MusicUser ExperienceAudio

Tide turns voice memos into layered sound sketches. Takes stack onto one tape — hum a bassline, beatbox over it, sing the hook. The waveform paints itself as you record. Scrub like vinyl, loop the good part, send it to Choppa or your DAW. No subs, no cloud. Launch month — 50% off until the end of July!

2026-06-12

Gemini 3.5 Live Translate

Latest audio model for live speech-to-speech translation

AndroidLanguagesAudio

Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.

2026-06-10

Krisp Voice Translation API

Real-time speech-to-speech translation built for accuracy

APIDeveloper ToolsAudio

Most voice translation APIs work great in demos. Then real users show up with background noise, accents and verification code that gets garbled. We built our technology on a million live contact center calls where accuracy is non negotiable. 96% accuracy on real calls, zero patient safety incidents, 61+ languages with any to any pair. Translation API is now available self-serve with 60 mins free credit upon signup to dev dashboard.

2026-06-09

Vaani

Lip-synced AI dubbing for creators, brands and studios

ProductivityArtificial IntelligenceAudio

Vaani is a voice-preserving AI dubbing tool to help you dub in 40+ languages, in one go, at a fraction of the cost of a traditional dub session. Where other tools give you a generic AI voice and lips that drift off-beat, Vaani clones your voice, preserves your music, and holds the meaning across languages, with frame-accurate lip sync. Built for anyone creating videos, from creators and brands to media companies, OTTs and studios.

2026-06-08

Wave

Turn your voice into text — local or cloud, your choice

ProductivityGitHubAudio

Wave lets you invoke an AI model anywhere on macOS using just your voice. Hold a hotkey, speak, and release—your speech is transcribed, processed, and the result appears exactly where you need it. If you're typing, it replaces or inserts text. If you're reading, it shows a floating answer. Works across all apps with selected text as context.

2026-06-07

LocalClicky

Control your Mac with your voice locally

Open SourceGitHubTechAudio

LocalClicky is a Mac menubar app that lets you have a real conversation with your computer - completely offline. Say "Computer" to start a session. It stays listening. You chain commands back to back. Say "goodbye" when you're done. Everything runs on your machine: voice transcription, LLM multi models, VAD, macOS say No API keys. No subscription. No data leaving your Mac. MIT licensed.

2026-06-05

Audex Trace

Trace what Apple Music is actually playing

MacAppleAudio

Audex Trace is a Mac mini player for Apple Music that shows what is actually playing in real time: codec, bit depth, sample rate, and output match status. It also shows Playing Next, estimates upcoming queue quality, and warns about tracks that may skip or fail before they interrupt playback.

2026-06-04

TaskGPT

Voice agent for MacOS

Artificial IntelligenceAudio

Command your Mac with your Voice, Uses your OpenAPI/Anthropic API key stored on your machine, we own no servers and do not retain any of your data.

2026-06-03

Curlo

Local AI search to find SFX and music by describing it

ProductivityArtificial IntelligenceAudio

Curlo is a privacy-first macOS app for searching, previewing, and organizing large sound libraries. Find SFX or music by describing what you want to hear, search for similar sounds, edit metadata & UCS, manage tags, and keep everything fully local on your Mac.

2026-05-27

Trace

No-frills offline meeting transcripts with context

NotesMenu Bar AppsAudio

A macOS menu-bar app that turns any conversation into a clean markdown transcript, with a local speech model running entirely on-device. One global shortcut brings up a small bar at the bottom of your screen. It captures your mic and the system audio as separate tracks, labels who said what, and lets you flag key moments mid-call that sit inline at the right timestamp. No bot joins the call, nothing leaves your Mac, no account, no subscription.

2026-05-26

Parrot Speech-to-text API

Fast, accurate STT for production-grade voice agents

APIArtificial IntelligenceAudio

Introducing Parrot: Ringg’s speech-to-text model for production-grade voice agents. Capture Hindi-heavy and noisy real-world conversations with low-latency inference, stronger transcript quality, and Hindi validation built for downstream workflows.

2026-05-26

JAMtime.ai

Just tell your guitar pedal how to sound

Artificial IntelligenceAudioAlpha

Tweaking knobs is a time-honored tradition in sound design. Chatting with AI is revolutionizing industries. JAMtime.ai embraces both, while keeping the human firmly in the driver's seat. Build and tweak your guitar pedal with phrases as simple or technical as you like, from "brighter" to "comb filter into a plate reverb." The AI writes a real DSP graph, not generated audio. Come fall in love with the JAMtime.ai workflow. Then take it to your DAW with free VST/AU plugins for Mac, Windows, Linux.

2026-05-22

Insta360 Mic Pro

Pro audio with a customizable color E-Ink face

HardwareAudio

Insta360 Mic Pro is a pro wireless mic with a customizable E-Ink display, 3-mic array, AI noise canceling, directional pickup modes, 32-bit float internal recording, timecode sync, 400m range, and multi-cam creator workflows.

2026-05-20

Voiser AI

Human-like AI voiceovers in 140+ languages

AndroidEducationArtificial IntelligenceAudio

Voiser helps creators, teams, and businesses turn text into the most human like AI voiceovers. With 140+ languages, 1000+ voices, emotional voice styles, custom instructions, and fast generation, you can create realistic voiceovers for videos, ads, training content, podcasts, and global projects in minutes.

2026-05-18

SUN-to-Spotify

Generate audio with SUN and send it to your Spotify library

EducationArtificial IntelligenceAudio

Download 👉 https://github.com/sunapp-ai/sun-to-spotify SUN-to-Spotify is a skill that lets you generate AI podcasts, audiobooks, and then publish them directly to your Spotify library for streaming or offline listening. Just describe what you want to hear: startup advice, history deep dives, philosophy, news, or custom learning content, and SUN creates a personalized audio experience in minutes. Built for creators, developers, and curious minds exploring the future of AI native audio.

2026-05-17

DramaBox by Resemble AI

AI turns scene descriptions into vocal performances

ProductivityArtificial IntelligenceGitHubAudio

A TTS model should give you two things: an oscar-worthy performance and a verifiable signature to prove it's yours. DramaBox is the first to do both. Describe a scene the way you would to an actor, like 'a talk show host gasps in mock shock, bursts into laughter,' and the model interprets it as performance. Every output is watermarked with Resemble Watermarker. Open source, English-only for now, find it in your Resemble account or on Hugging Face.

2026-05-15

audio Tools

Subscribe to Our Newsletter