torchaudio.models.hubert_xlarge¶
- torchaudio.models.hubert_xlarge(encoder_projection_dropout: float = 0.0, encoder_attention_dropout: float = 0.0, encoder_ff_interm_dropout: float = 0.0, encoder_dropout: float = 0.0, encoder_layer_drop: float = 0.0, aux_num_out: Optional[int] = None) Wav2Vec2Model [源代码]¶
从 HuBERT [Hsu et al., 2021] 构建 “超大”
HuBERT
- 参数:
encoder_projection_dropout (float) – 参见
wav2vec2_model()
。encoder_attention_dropout (float) – 参见
wav2vec2_model()
。encoder_ff_interm_dropout (float) – 参见
wav2vec2_model()
。encoder_dropout (float) – 参见
wav2vec2_model()
。encoder_layer_drop (float) – 参见
wav2vec2_model()
。aux_num_out (int 或 None, 可选) – 参见
wav2vec2_model()
。
- 返回:
生成的模型。
- 返回类型: