torch.compiler.cudagraph_mark_step_begin¶

torch.compiler.cudagraph_mark_step_begin()[source][source]¶

表示一个新的推理或训练迭代即将开始。

CUDA Graphs 将释放先前迭代中的张量。每当调用 torch.compile 时，只要没有待处理的 backward 调用，就会开始新的迭代。

如果该启发式方法不正确，例如在以下示例中所示，请手动使用此 API 进行标记。

@torch.compile(mode="reduce-overhead")
def rand_foo():
    return torch.rand([4], device="cuda")

for _ in range(5):
    torch.compiler.cudagraph_mark_step_begin()
    rand_foo() + rand_foo()

有关更多详细信息，请参阅 torch.compiler_cudagraph_trees

torch.compiler.cudagraph_mark_step_begin¶

文档

教程

资源