Skip to main content

Speech-To-Text

tip

DeepGram is for VoiceWizardPro Only! Subscribe to the Patreon or Kofi to unlock it.

Convert Speech to Text to send through OSC (to VRChat or anywhere else)
Change the speech to text method from Settings > Audio > Speech to Text

Each of these methods require some sort of setup (except system speech). Click the name of the Speech-to-Text method to take you to its respective wiki page for more information.

STT Methods List

Speech-to-Text Method	Description	Free Pricing	Continuous
System Speech	This method is the default and has the worst recognition quality. Although it can improved with training and editing the speech dictionary	Unlimited	yes
Azure	Great recognition quality without needing to sacrifice computational resources. Built in Translations	5 speech recognition hours + 5 speech translation hours. This is actually much more than it seems when not using continuous recognition. (yes you can for example translate from English to English after your recognition hours run out for 10 total hours.)	both
Vosk	Ok recognition quality at the cost of computational resources (CPU and RAM). Can have higher recognition quality than Web Captioner depending on model used. (does not work on x86 version)	Unlimited	yes
Web Captioner	Ok recognition quality using "Web Speech API" through Web Captioner. Only available on Google Chrome. Multi-Language support.	Unlimited	yes
Whisper	AMAZING recognition quality at the cost of computational resources (GPU and RAM). Can have higher recognition accuracy than Azure depending on model used. (Experimental implementation) (does not work on x86 version)	Unlimited	yes
DeepGram	Similar quality to Azure Recognition	Only available with Voice Wizard Pro, limits vary with selected tier	both

STT Methods List