ElevenLabs text-to-speech CLI for speaking text, browsing voices, and saving audio from the terminal.
$brew install steipete/tap/sag # auto-taps steipete/tap
AI Analysis
Unofficial ElevenLabs text-to-speech CLI that mimics macOS say, with voice lookup, optional speaker playback, and audio file output from the terminal.
What It Enables
- Generate spoken audio from text arguments, stdin, or input files and either play it immediately or save MP3 or WAV output.
- Browse available ElevenLabs voices, filter them by name, labels, or semantic query text, and preview samples before choosing one.
- Tune model, rate, speed, latency, and voice settings so scripted voiceovers, alerts, or narration steps can be repeated from the shell.
Agent Fit
- Commands are flag-driven and non-interactive once credentials and a voice are set, so agents can call it directly in media-generation workflows.
- Machine readability is weak: voice listings are tabular, synthesis returns audio streams or files, and the only structured data is internal API traffic rather than CLI output.
- Best fit for a narrow step inside a larger automation, such as rendering narration or audible alerts, rather than inspecting or mutating complex service state.
Caveats
- Requires an ElevenLabs account and API key before most commands work.
- A lot of the product value is human-facing playback and voice choice, so unattended automation is narrower than the README feature list suggests.