滑动窗口均值归一化¶
- class torchaudio.transforms.SlidingWindowCmn(cmn_window: int = 600, min_cmn_window: int = 100, center: bool = False, norm_vars: bool = False)[source]¶
对每个语音应用滑动窗口倒谱均值(以及可选的方差)归一化。
- 参数:
- 示例
>>> waveform, sample_rate = torchaudio.load("test.wav", normalize=True) >>> transform = transforms.SlidingWindowCmn(cmn_window=1000) >>> cmn_waveform = transform(waveform)