speech2text
Syntax
Description
transcribes speech in the input audio signal to text using a pretrained wav2vec 2.0
model.transcript
= speech2text(audioIn
,fs
)
Using wav2vec 2.0 requires Deep Learning Toolbox™ and installing the pretrained model.
specifies options using one or more name-value arguments. For example,
transcript
= speech2text(audioIn
,fs
,Name=Value
)speech2text(x,fs,Language="es")
transcribes a signal containing
Spanish-language speech.
[
also returns the unprocessed server output from the third-party speech service.transcript
,rawOutput
] = speech2text(___)
Examples
Input Arguments
Name-Value Arguments
Output Arguments
References
[1] Baevski, Alexei, Henry Zhou, Abdelrahman Mohamed, and Michael Auli. “Wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations,” 2020. https://doi.org/10.48550/ARXIV.2006.11477.
Extended Capabilities
Version History
Introduced in R2022b