logo

Voice Input (Speech to Text)

Voice input on TypingMind allows you to convert spoken language into written text, making it accessible and user-friendly for all users.
We offer 2 speech-to-text providers:
  • Web API (Free)
  • OpenAI Whisper (Whisper AI - pay as you go)
Let’s see how to set them up.

1. Web API (Free)

You can use the Web API speech for free. To enable this:
  • Click the β€œmicrophone” icon on the bottom right of the app, next to the message area
  • Choose β€œWeb API” speech-to-text provider from the drop-down list
  • Start recording your voice, which will be automatically transcribed into text so you can easily feed the AI model
Image without caption

2. OpenAI Whisper

To enable OpenAI Whisper, you must enter your OpenAI API key:
  • Go to API key
Image without caption
Then follow the steps below to enable it:
  • Click the β€œmicrophone” icon on the bottom right of the app, next to the message area
  • Choose β€œOpenAI Whisper” speech-to-text provider from the drop-down list
  • Start recording your voice, which will be automatically transcribed into text so you can easily feed the AI model
πŸ’‘
You can also upload your recorded meetings/webinars with OpenAI Whisper to get the transcription. Please note that the uploaded file can not be exceeded 25MB
Image without caption