TACOTRON2_GRIFFINLIM_PHONE_LJSPEECH¶

torchaudio.pipelines.TACOTRON2_GRIFFINLIM_PHONE_LJSPEECH¶

基于音素的 TTS 管线，使用在 LJSpeech [Ito and Johnson, 2017] 数据集上训练了 1,500 个 epoch 的 Tacotron2 模型，并使用 GriffinLim 作为声码器。

文本处理器基于音素对输入文本进行编码。它使用 DeepPhonemizer 将字形转换为音素。该模型 (en_us_cmudict_forward) 是在 CMUDict 上训练的。

您可以在此处找到训练脚本。文本处理器设置为 “english_phonemes”。

示例 - “Hello world! T T S stands for Text to Speech!”

示例 - “The examination and testimony of the experts enabled the Commission to conclude that five shots may have been fired,”

文档