Main Content

Speech Transcription and Synthesis

Use third-party APIs for text-to-speech and speech-to-text

Audio Toolbox™ provides examples for small-vocabulary recognition and sound synthesis. To perform general text-to-speech and speech-to-text, Audio Toolbox provides interfaces to popular third-party APIs. Supported APIs include Google® Speech, IBM® Watson Speech, and Microsoft® Azure Speech. To use this functionality, you must download the Audio Toolbox extended functionality for text2speech and speech2text from File Exchange.

Once you install the speech-to-text functionality, you can interact with it graphically in the Signal Labeler app to quickly label regions of speech.


Signal LabelerLabel signal attributes, regions, and points of interest, and extract features