torchaudio.models.wavlm_base¶
- torchaudio.models.wavlm_base(encoder_projection_dropout: float = 0.1, encoder_attention_dropout: float = 0.1, encoder_ff_interm_dropout: float = 0.1, encoder_dropout: float = 0.1, encoder_layer_drop: float = 0.1, aux_num_out: Optional[int] = None) Wav2Vec2Model [source]¶
构建“base” WaveLM 模型 [Chen 等人, 2022]。该架构与 Wav2Vec2 模型 [Baevski 等人, 2020] 兼容,因此输出类为
Wav2Vec2Model
。- 参数:
encoder_projection_dropout (float) – 参阅
wav2vec2_model()
。encoder_attention_dropout (float) – 参阅
wav2vec2_model()
。encoder_ff_interm_dropout (float) – 参阅
wav2vec2_model()
。encoder_dropout (float) – 参阅
wav2vec2_model()
。encoder_layer_drop (float) – 参阅
wav2vec2_model()
。aux_num_out (int, optional) – 参阅
wav2vec2_model()
。
- 返回:
生成的模型。
- 返回类型: