Media & Creative

Voice

Record a message, transcribe speech, hear your AI talk back.

Talk to MoClaw in voice and get a voice reply. Record a message in any chat surface, MoClaw transcribes it, generates a response, and (if you want) reads the response back in a chosen voice. Works in the web app, the iOS and Android Telegram bot, and any Slack DM.

How it works

3 steps to wire up Voice, no engineering required.

  1. 1

    Tap the mic in any chat

    On web, hold space-bar. On Telegram, send a voice note. On Slack, attach an audio file.

  2. 2

    MoClaw transcribes and replies

    Whisper-class transcription with multi-language support. The reply comes back as text by default.

  3. 3

    Or have it read back to you

    Toggle 'voice reply' in Settings to get audio replies in a voice you pick (ElevenLabs, OpenAI TTS, or system voices).

Try saying

Real prompts you can paste into Voice.

  • Voice memo: 'Remind me to follow up with the design team about the new homepage by EOD Wednesday.'
  • While driving: 'Read me the headlines from Hacker News, top 5.'
  • Send me a daily morning brief as a 90-second voice note instead of text.

Step by step demo

What actually happens when you send the prompt.

Prompt 01 4 steps

“Voice memo from the car: 'Email Sarah saying we're a yes on the Q2 partnership and ask her to send the contract.'”

What MoClaw does

  1. 1 Transcribes the voice note using Whisper.
  2. 2 Looks up Sarah in your Gmail contacts (most recent thread).
  3. 3 Drafts the email in your voice — short, decisive, asks for the contract.
  4. 4 Asks for confirmation in chat (since it's a real outbound email).
Result

Reply pops up: 'Drafted to sarah@partner.co. Subject: Q2 partnership — yes from us. Body: Sarah, we're a yes on the Q2 partnership. Can you send the contract this week? Best, [you]. Send?' You tap send while still driving.

Voice integration for busy teams and founders

Teams that commonly use Voice with MoClaw workflows.

FAQ

Quick answers about pricing, privacy, and limits.

What languages does transcription support?
100+ via Whisper. Auto-detected per recording. Mixing languages in the same recording works too.
Which voices can I pick?
On free, OpenAI TTS voices (Alloy, Echo, Fable, Onyx, Nova, Shimmer). On paid, the full ElevenLabs catalog plus voice cloning from a 30-second sample.
Does it work hands-free?
On the iOS Telegram client, yes — voice in, voice out, no taps. On the web, you need to start each turn with space-bar (browser security restriction).
Where does my audio go?
Encrypted in flight, transcribed on a hosted Whisper instance, audio deleted within 60 seconds. Transcripts stay in your chat history (encrypted, deletable from Settings).

Try MoClaw free.

1,000 credits a month, or bring your own key for unlimited usage.

Cancel anytime