torchaudio.prototype.pipelines¶

pipelines 子包包含带有预训练权重和相关工具的模型的 API。

RNN-T 流式/非流式 ASR¶

`EMFORMER_RNNT_BASE_MUSTC`	基于 Emformer-RNNT 的预训练 ASR pipeline，能够执行流式和非流式推理。
`EMFORMER_RNNT_BASE_TEDLIUM3`	基于 Emformer-RNNT 的预训练 ASR pipeline，能够执行流式和非流式推理。

HiFiGANVocoderBundle 定义了 HiFiGAN Vocoder pipeline，能够将 mel 频谱图转换为波形。

用于捆绑关联信息以使用预训练 HiFiGANVocoder 的数据类。

HiFiGAN Vocoder pipeline，在 The LJ Speech Dataset [Ito and Johnson, 2017] 上训练。

`VGGishBundle`	从 torchvggish 和 tensorflow-models 移植的 VGGish [Hershey et al., 2017] 推理 pipeline。
`VGGishBundle.VGGish`	VGGish 模型 [Hershey et al., 2017] 的实现。
`VGGishBundle.VGGishInputProcessor`	将原始波形转换为批处理示例，用作 VGGish 的输入。

从 torchvggish 和 tensorflow-models 移植的预训练 VGGish [Hershey et al., 2017] 推理 pipeline。