(beta) Using TORCH_LOGS python API with torch.compile#

Created On: Jan 24, 2024 | Last Updated: Jan 31, 2024 | Last Verified: Nov 05, 2024

Author: Michael Lazos

import logging

This tutorial introduces the TORCH_LOGS environment variable, as well as the Python API, and demonstrates how to apply it to observe the phases of torch.compile.

Note

This tutorial requires PyTorch 2.2.0 or later.

Setup#

In this example, we’ll set up a simple Python function which performs an elementwise add and observe the compilation process with TORCH_LOGS Python API.

Note

There is also an environment variable TORCH_LOGS, which can be used to change logging settings at the command line. The equivalent environment variable setting is shown for each example.

import torch

# exit cleanly if we are on a device that doesn't support torch.compile
if torch.cuda.get_device_capability() < (7, 0):
    print("Skipping because torch.compile is not supported on this device.")
else:
    @torch.compile()
    def fn(x, y):
        z = x + y
        return z + 2


    inputs = (torch.ones(2, 2, device="cuda"), torch.zeros(2, 2, device="cuda"))


# print separator and reset dynamo
# between each example
    def separator(name):
        print(f"==================={name}=========================")
        torch._dynamo.reset()


    separator("Dynamo Tracing")
# View dynamo tracing
# TORCH_LOGS="+dynamo"
    torch._logging.set_logs(dynamo=logging.DEBUG)
    fn(*inputs)

    separator("Traced Graph")
# View traced graph
# TORCH_LOGS="graph"
    torch._logging.set_logs(graph=True)
    fn(*inputs)

    separator("Fusion Decisions")
# View fusion decisions
# TORCH_LOGS="fusion"
    torch._logging.set_logs(fusion=True)
    fn(*inputs)

    separator("Output Code")
# View output code generated by inductor
# TORCH_LOGS="output_code"
    torch._logging.set_logs(output_code=True)
    fn(*inputs)

    separator("")

===================Dynamo Tracing=========================
I1015 19:14:55.487000 22593 torch/_dynamo/utils.py:1749] [0/0] ChromiumEventLogger initialized with id 9dfffdfd-9815-4484-a2d0-004db051375c
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0] torchdynamo start compiling fn /var/lib/workspace/recipes_source/torch_logs.py:39, stack (elided 5 frames):
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/bin/sphinx-build", line 7, in <module>
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     sys.exit(main())
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/cmd/build.py", line 339, in main
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return make_main(argv)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/cmd/build.py", line 213, in make_main
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return make_mode.run_make_mode(argv[1:])
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/cmd/make_mode.py", line 181, in run_make_mode
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return make.run_generic_build(args[0])
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/cmd/make_mode.py", line 169, in run_generic_build
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return build_main(args + opts)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/cmd/build.py", line 293, in build_main
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     app = Sphinx(args.sourcedir, args.confdir, args.outputdir,
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/application.py", line 272, in __init__
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     self._init_builder()
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/application.py", line 343, in _init_builder
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     self.events.emit('builder-inited')
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx/events.py", line 97, in emit
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     results.append(listener.handler(self.app, *args))
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_gallery.py", line 757, in generate_gallery_rst
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     ) = generate_dir_rst(
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 606, in generate_dir_rst
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     results = parallel(
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 607, in <genexpr>
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     p_fun(fname, target_dir, src_dir, gallery_conf) for fname in iterator
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/var/lib/workspace/conf.py", line 85, in wrapper
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     p.start()
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/process.py", line 121, in start
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     self._popen = self._Popen(self)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/context.py", line 224, in _Popen
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return _default_context.get_context().Process._Popen(process_obj)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/context.py", line 281, in _Popen
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return Popen(process_obj)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/popen_fork.py", line 19, in __init__
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     self._launch(process_obj)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/popen_fork.py", line 71, in _launch
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     code = process_obj._bootstrap(parent_sentinel=child_r)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrap
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     self.run()
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/lib/python3.10/multiprocessing/process.py", line 108, in run
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     self._target(*self._args, **self._kwargs)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/var/lib/workspace/conf.py", line 73, in call_fn
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     result = func(*args, **kwargs)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 1374, in generate_file_rst
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     output_blocks, time_elapsed = execute_script(
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 1192, in execute_script
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     execute_code_block(
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 1048, in execute_code_block
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     is_last_expr, mem_max = _exec_and_get_memory(
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 876, in _exec_and_get_memory
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     mem_max, _ = call_memory(
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 1725, in _sg_call_memory_noop
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     return 0.0, func()
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/usr/local/lib/python3.10/dist-packages/sphinx_gallery/gen_rst.py", line 794, in __call__
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     exec(self.code, self.fake_main.__dict__)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]   File "/var/lib/workspace/recipes_source/torch_logs.py", line 59, in <module>
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]     fn(*inputs)
V1015 19:14:55.489000 22593 torch/_dynamo/convert_frame.py:1397] [0/0]
I1015 19:14:55.494000 22593 torch/_dynamo/symbolic_convert.py:3842] [0/0] Step 1: torchdynamo start tracing fn /var/lib/workspace/recipes_source/torch_logs.py:39
I1015 19:14:55.494000 22593 torch/fx/experimental/symbolic_shapes.py:3769] [0/0] create_env
V1015 19:14:55.498000 22593 torch/_dynamo/symbolic_convert.py:1315] [0/0] [__trace_source] TRACE starts_line /var/lib/workspace/recipes_source/torch_logs.py:41 in fn (fn)
V1015 19:14:55.498000 22593 torch/_dynamo/symbolic_convert.py:1315] [0/0] [__trace_source]             z = x + y
V1015 19:14:55.499000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE LOAD_FAST x []
V1015 19:14:55.499000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE LOAD_FAST y [LazyVariableTracker()]
V1015 19:14:55.500000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE BINARY_ADD None [LazyVariableTracker(), LazyVariableTracker()]
V1015 19:14:55.501000 22593 torch/_dynamo/variables/builder.py:3493] [0/0] wrap_to_fake L['x'] (2, 2) StatefulSymbolicContext(dynamic_sizes=[<DimDynamic.STATIC: 2>, <DimDynamic.STATIC: 2>], dynamic_strides=[<DimDynamic.INFER_STRIDE: 4>, <DimDynamic.INFER_STRIDE: 4>], constraint_sizes=[None, None], constraint_strides=[None, None], specialize_on=[[], []], view_base_context=None, tensor_source=LocalSource(local_name='x', is_input=True, dynamism=None, is_derefed_cell_contents=False), shape_env_to_source_to_symbol_cache={}) <class 'torch.Tensor'>
V1015 19:14:55.502000 22593 torch/_dynamo/output_graph.py:2995] [0/0] create_graph_input L_x_ L['x'] FakeTensor(..., device='cuda:0', size=(2, 2)) at debug_level 0 before=False
V1015 19:14:55.504000 22593 torch/_dynamo/variables/builder.py:3493] [0/0] wrap_to_fake L['y'] (2, 2) StatefulSymbolicContext(dynamic_sizes=[<DimDynamic.STATIC: 2>, <DimDynamic.STATIC: 2>], dynamic_strides=[<DimDynamic.INFER_STRIDE: 4>, <DimDynamic.INFER_STRIDE: 4>], constraint_sizes=[None, None], constraint_strides=[None, None], specialize_on=[[], []], view_base_context=None, tensor_source=LocalSource(local_name='y', is_input=True, dynamism=None, is_derefed_cell_contents=False), shape_env_to_source_to_symbol_cache={}) <class 'torch.Tensor'>
V1015 19:14:55.504000 22593 torch/_dynamo/output_graph.py:2995] [0/0] create_graph_input L_y_ L['y'] FakeTensor(..., device='cuda:0', size=(2, 2)) at debug_level 0 before=False
V1015 19:14:55.507000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE STORE_FAST z [TensorVariable()]
V1015 19:14:55.507000 22593 torch/_dynamo/symbolic_convert.py:1315] [0/0] [__trace_source] TRACE starts_line /var/lib/workspace/recipes_source/torch_logs.py:42 in fn (fn)
V1015 19:14:55.507000 22593 torch/_dynamo/symbolic_convert.py:1315] [0/0] [__trace_source]             return z + 2
V1015 19:14:55.508000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE LOAD_FAST z []
V1015 19:14:55.508000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE LOAD_CONST 2 [TensorVariable()]
V1015 19:14:55.508000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE BINARY_ADD None [TensorVariable(), ConstantVariable(int: 2)]
V1015 19:14:55.509000 22593 torch/_dynamo/symbolic_convert.py:1341] [0/0] [__trace_bytecode] TRACE RETURN_VALUE None [TensorVariable()]
I1015 19:14:55.510000 22593 torch/_dynamo/symbolic_convert.py:4059] [0/0] Step 1: torchdynamo done tracing fn (RETURN_VALUE)
V1015 19:14:55.510000 22593 torch/_dynamo/symbolic_convert.py:4063] [0/0] RETURN_VALUE triggered compile
V1015 19:14:55.510000 22593 torch/_dynamo/output_graph.py:1343] [0/0] COMPILING GRAPH due to GraphCompileReason(reason='return_value', user_stack=[<FrameSummary file /var/lib/workspace/recipes_source/torch_logs.py, line 42 in fn>], graph_break=False)
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code] TRACED GRAPH
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]  ===== __compiled_fn_1_fdab5f6b_7aaf_4f80_8c22_fd04afa4e6fb =====
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]  /usr/local/lib/python3.10/dist-packages/torch/fx/_lazy_graph_module.py class GraphModule(torch.nn.Module):
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]     def forward(self, L_x_: "f32[2, 2][2, 1]cuda:0", L_y_: "f32[2, 2][2, 1]cuda:0"):
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]         l_x_ = L_x_
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]         l_y_ = L_y_
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]          # File: /var/lib/workspace/recipes_source/torch_logs.py:41 in fn, code: z = x + y
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]         z: "f32[2, 2][2, 1]cuda:0" = l_x_ + l_y_;  l_x_ = l_y_ = None
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]          # File: /var/lib/workspace/recipes_source/torch_logs.py:42 in fn, code: return z + 2
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]         add_1: "f32[2, 2][2, 1]cuda:0" = z + 2;  z = None
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]         return (add_1,)
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]
V1015 19:14:55.513000 22593 torch/_dynamo/output_graph.py:1983] [0/0] [__graph_code]
I1015 19:14:55.515000 22593 torch/_dynamo/output_graph.py:2167] [0/0] Step 2: calling compiler function inductor
I1015 19:14:56.828000 22593 torch/fx/experimental/symbolic_shapes.py:5242] [0/0] produce_guards
I1015 19:14:56.830000 22593 torch/fx/experimental/symbolic_shapes.py:5242] [0/0] produce_guards
I1015 19:14:56.834000 22593 torch/_dynamo/output_graph.py:2172] [0/0] Step 2: done compiler function inductor
I1015 19:14:56.837000 22593 torch/fx/experimental/symbolic_shapes.py:5242] [0/0] produce_guards
V1015 19:14:56.837000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['x'].size()[0] 2 None
V1015 19:14:56.837000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['x'].size()[1] 2 None
V1015 19:14:56.838000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['x'].stride()[0] 2 None
V1015 19:14:56.838000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['x'].stride()[1] 1 None
V1015 19:14:56.838000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['x'].storage_offset() 0 None
V1015 19:14:56.839000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['y'].size()[0] 2 None
V1015 19:14:56.839000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['y'].size()[1] 2 None
V1015 19:14:56.839000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['y'].stride()[0] 2 None
V1015 19:14:56.839000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['y'].stride()[1] 1 None
V1015 19:14:56.840000 22593 torch/fx/experimental/symbolic_shapes.py:5462] [0/0] track_symint L['y'].storage_offset() 0 None
V1015 19:14:56.840000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['x'].size()[0] == 2
V1015 19:14:56.840000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['x'].size()[1] == 2
V1015 19:14:56.841000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['x'].stride()[0] == 2
V1015 19:14:56.841000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['x'].stride()[1] == 1
V1015 19:14:56.841000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['x'].storage_offset() == 0
V1015 19:14:56.841000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['y'].size()[0] == 2
V1015 19:14:56.842000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['y'].size()[1] == 2
V1015 19:14:56.842000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['y'].stride()[0] == 2
V1015 19:14:56.842000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['y'].stride()[1] == 1
V1015 19:14:56.843000 22593 torch/fx/experimental/symbolic_shapes.py:5675] [0/0] Skipping guard L['y'].storage_offset() == 0
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3687] [0/0] [__guards] GUARDS:
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards]
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] TREE_GUARD_MANAGER:
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] +- RootGuardManager
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | +- LAMBDA_GUARD: torch._functorch.aot_autograd.utils.top_saved_tensors_hooks ids == None  # _dynamo/output_graph.py:688 in init_ambient_guards
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | +- DEFAULT_DEVICE: utils_device.CURRENT_DEVICE == None                           # _dynamo/output_graph.py:676 in init_ambient_guards
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | +- GLOBAL_STATE: ___check_global_state()
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | +- TORCH_FUNCTION_MODE_STACK: ___check_torch_function_mode_stack()
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | +- GuardManager: source=L['x'], accessed_by=FrameLocalsGuardAccessor(key='x', framelocals_idx=0), type=<class 'torch.Tensor'>, tag_safe=(True, False)
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | | +- TENSOR_MATCH: check_tensor(L['x'], Tensor, DispatchKeySet(CUDA, BackendSelect, ADInplaceOrView, AutogradCUDA), torch.float32, device=0, requires_grad=False, size=[2, 2], stride=[2, 1])  # z = x + y  # ar/lib/workspace/recipes_source/torch_logs.py:41 in fn
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | | +- NO_HASATTR: hasattr(L['x'], '_dynamo_dynamic_indices') == False           # z = x + y  # ar/lib/workspace/recipes_source/torch_logs.py:41 in fn
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | | +- NO_TENSOR_ALIASING: check_no_aliasing(L['x'], L['y'])
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | +- GuardManager: source=L['y'], accessed_by=FrameLocalsGuardAccessor(key='y', framelocals_idx=1), type=<class 'torch.Tensor'>, tag_safe=(True, False)
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | | +- TENSOR_MATCH: check_tensor(L['y'], Tensor, DispatchKeySet(CUDA, BackendSelect, ADInplaceOrView, AutogradCUDA), torch.float32, device=0, requires_grad=False, size=[2, 2], stride=[2, 1])  # z = x + y  # ar/lib/workspace/recipes_source/torch_logs.py:41 in fn
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | | +- NO_HASATTR: hasattr(L['y'], '_dynamo_dynamic_indices') == False           # z = x + y  # ar/lib/workspace/recipes_source/torch_logs.py:41 in fn
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards] | | +- NO_TENSOR_ALIASING
V1015 19:14:56.843000 22593 torch/_dynamo/guards.py:3404] [0/0] [__guards]
V1015 19:14:56.863000 22593 torch/_dynamo/guards.py:3436] [0/0] [__guards] Guard eval latency = 46.31 us
I1015 19:14:56.864000 22593 torch/_dynamo/pgo.py:893] [0/0] put_code_state: no cache key, skipping
I1015 19:14:56.865000 22593 torch/_dynamo/convert_frame.py:1505] [0/0] run_gc_after_compile: running gc
V1015 19:14:56.869000 22593 torch/_dynamo/convert_frame.py:1832] skipping: inner (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_compile.py)
V1015 19:14:56.869000 22593 torch/_dynamo/convert_frame.py:1832] skipping: disable (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/decorators.py)
V1015 19:14:56.870000 22593 torch/_dynamo/convert_frame.py:1832] skipping: innermost_fn (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/eval_frame.py)
V1015 19:14:56.870000 22593 torch/_dynamo/convert_frame.py:1832] skipping: __init__ (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/eval_frame.py)
V1015 19:14:56.870000 22593 torch/_dynamo/convert_frame.py:1832] skipping: __init__ (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/eval_frame.py)
V1015 19:14:56.871000 22593 torch/_dynamo/convert_frame.py:1832] skipping: nothing (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/eval_frame.py)
V1015 19:14:56.871000 22593 torch/_dynamo/convert_frame.py:1832] skipping: __call__ (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/eval_frame.py)
V1015 19:14:56.871000 22593 torch/_dynamo/convert_frame.py:1832] skipping: _fn (reason: in skipfiles, file: /usr/local/lib/python3.10/dist-packages/torch/_dynamo/eval_frame.py)
===================Traced Graph=========================
I1015 19:14:56.872000 22593 torch/_dynamo/__init__.py:133] torch._dynamo.reset
I1015 19:14:56.872000 22593 torch/_dynamo/__init__.py:166] torch._dynamo.reset_code_caches
===================Fusion Decisions=========================
===================Output Code=========================
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] Output code:
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] # AOT ID: ['0_inference']
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from ctypes import c_void_p, c_long, c_int
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import torch
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import math
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import random
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import os
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import tempfile
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from math import inf, nan
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from cmath import nanj
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.hooks import run_intermediate_hooks
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.utils import maybe_profile
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.codegen.memory_planning import _align as align
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch import device, empty_strided
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.async_compile import AsyncCompile
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.select_algorithm import extern_kernels
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import triton
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import triton.language as tl
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.runtime.triton_heuristics import start_graph, end_graph
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._C import _cuda_getCurrentRawStream as get_raw_stream
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] aten = torch.ops.aten
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] inductor_ops = torch.ops.inductor
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] _quantized = torch.ops._quantized
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] assert_size_stride = torch._C._dynamo.guards.assert_size_stride
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] assert_alignment = torch._C._dynamo.guards.assert_alignment
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] empty_strided_cpu = torch._C._dynamo.guards._empty_strided_cpu
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] empty_strided_cpu_pinned = torch._C._dynamo.guards._empty_strided_cpu_pinned
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] empty_strided_cuda = torch._C._dynamo.guards._empty_strided_cuda
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] empty_strided_xpu = torch._C._dynamo.guards._empty_strided_xpu
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] empty_strided_mtia = torch._C._dynamo.guards._empty_strided_mtia
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] reinterpret_tensor = torch._C._dynamo.guards._reinterpret_tensor
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] alloc_from_pool = torch.ops.inductor._alloc_from_pool
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] async_compile = AsyncCompile()
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] empty_strided_p2p = torch._C._distributed_c10d._SymmetricMemory.empty_strided_p2p
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] # kernel path: /tmp/torchinductor_ci-user/zd/czd6xx2awzntuoqyco474vivsorvrqy4pp7aspz5mvcy3ndlgvkr.py
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] # Topologically Sorted Source Nodes: [z, add_1], Original ATen: [aten.add]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] # Source node to ATen node mapping:
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   add_1 => add_1
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   z => add
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] # Graph fragment:
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   %arg0_1 : Tensor "f32[2, 2][2, 1]cuda:0" = PlaceHolder[target=arg0_1]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   %arg1_1 : Tensor "f32[2, 2][2, 1]cuda:0" = PlaceHolder[target=arg1_1]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   %add : Tensor "f32[2, 2][2, 1]cuda:0"[num_users=1] = call_function[target=torch.ops.aten.add.Tensor](args = (%arg0_1, %arg1_1), kwargs = {})
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   %add_1 : Tensor "f32[2, 2][2, 1]cuda:0"[num_users=1] = call_function[target=torch.ops.aten.add.Tensor](args = (%add, 2), kwargs = {})
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] #   return %add_1
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] triton_poi_fused_add_0 = async_compile.triton('triton_poi_fused_add_0', '''
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import triton
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] import triton.language as tl
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.runtime import triton_helpers, triton_heuristics
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.runtime.triton_helpers import libdevice, math as tl_math
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] from torch._inductor.runtime.hints import AutotuneHint, ReductionHint, TileHint, DeviceProperties
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] triton_helpers.set_driver_to_gpu()
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] @triton_heuristics.pointwise(
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     size_hints={'x': 4},
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     filename=__file__,
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     triton_meta={'signature': {'in_ptr0': '*fp32', 'in_ptr1': '*fp32', 'out_ptr0': '*fp32', 'xnumel': 'i32', 'XBLOCK': 'constexpr'}, 'device': DeviceProperties(type='cuda', index=0, multi_processor_count=80, cc=86, major=8, regs_per_multiprocessor=65536, max_threads_per_multi_processor=1536, warp_size=32), 'constants': {}, 'configs': [{(0,): [['tt.divisibility', 16]], (1,): [['tt.divisibility', 16]], (2,): [['tt.divisibility', 16]]}]},
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     inductor_meta={'grid_type': 'Grid1D', 'autotune_hints': set(), 'kernel_name': 'triton_poi_fused_add_0', 'mutated_arg_names': [], 'optimize_mem': True, 'no_x_dim': False, 'num_load': 2, 'num_reduction': 0, 'backend_hash': '5C4E406C711B3861DF9C100323E0EC398E2F633BD8802E2E564CD4776AA7ED44', 'are_deterministic_algorithms_enabled': False, 'assert_indirect_indexing': True, 'autotune_local_cache': True, 'autotune_pointwise': True, 'autotune_remote_cache': None, 'force_disable_caches': False, 'dynamic_scale_rblock': True, 'max_autotune': False, 'max_autotune_pointwise': False, 'min_split_scan_rblock': 256, 'spill_threshold': 16, 'store_cubin': False, 'tiling_scores': {'x': 32}},
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     min_elem_per_thread=0
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] )
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] @triton.jit
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] def triton_poi_fused_add_0(in_ptr0, in_ptr1, out_ptr0, xnumel, XBLOCK : tl.constexpr):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     xnumel = 4
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     xoffset = tl.program_id(0) * XBLOCK
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     xindex = xoffset + tl.arange(0, XBLOCK)[:]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     xmask = xindex < xnumel
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     x0 = xindex
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     tmp0 = tl.load(in_ptr0 + (x0), xmask)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     tmp1 = tl.load(in_ptr1 + (x0), xmask)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     tmp2 = tmp0 + tmp1
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     tmp3 = 2.0
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     tmp4 = tmp2 + tmp3
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     tl.store(out_ptr0 + (x0), tmp4, xmask)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] ''', device_str='cuda')
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] async_compile.wait(globals())
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] del async_compile
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] class Runner:
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     def __init__(self, partitions):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         self.partitions = partitions
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     def recursively_apply_fns(self, fns):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         new_callables = []
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         for fn, c in zip(fns, self.partitions):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             new_callables.append(fn(c))
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         self.partitions = new_callables
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     def call(self, args):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         arg0_1, arg1_1 = args
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         args.clear()
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         assert_size_stride(arg0_1, (2, 2), (2, 1))
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         assert_size_stride(arg1_1, (2, 2), (2, 1))
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         with torch.cuda._DeviceGuard(0):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             torch.cuda.set_device(0)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             buf0 = empty_strided_cuda((2, 2), (2, 1), torch.float32)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             # Topologically Sorted Source Nodes: [z, add_1], Original ATen: [aten.add]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             stream0 = get_raw_stream(0)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             triton_poi_fused_add_0.run(arg0_1, arg1_1, buf0, 4, stream=stream0)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             del arg0_1
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]             del arg1_1
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]         return (buf0, )
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] runner = Runner(partitions=[])
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] call = runner.call
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] recursively_apply_fns = runner.recursively_apply_fns
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] def benchmark_compiled_module(times=10, repeat=10):
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     from torch._dynamo.testing import rand_strided
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     from torch._inductor.utils import print_performance
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     arg0_1 = rand_strided((2, 2), (2, 1), device='cuda:0', dtype=torch.float32)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     arg1_1 = rand_strided((2, 2), (2, 1), device='cuda:0', dtype=torch.float32)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     fn = lambda: call([arg0_1, arg1_1])
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     return print_performance(fn, times=times, repeat=repeat)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code] if __name__ == "__main__":
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     from torch._inductor.wrapper_benchmark import compiled_module_main
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]     compiled_module_main('None', benchmark_compiled_module)
V1015 19:14:57.002000 22593 torch/_inductor/codecache.py:1231] [0/0] [__output_code]
V1015 19:14:57.008000 22593 torch/_inductor/codecache.py:1232] [0/0] [__output_code] Output code written to: /tmp/torchinductor_ci-user/ni/cni3t3cso5ijqkr6duznfm6smljxqt3tavyv3xlg62ptjgd7kmll.py
============================================

Conclusion#

In this tutorial we introduced the TORCH_LOGS environment variable and python API by experimenting with a small number of the available logging options. To view descriptions of all available options, run any python script which imports torch and set TORCH_LOGS to “help”.

Alternatively, you can view the torch._logging documentation to see descriptions of all available logging options.

For more information on torch.compile, see the torch.compile tutorial.

Total running time of the script: (0 minutes 3.071 seconds)

(beta) Using TORCH_LOGS python API with torch.compile#

Setup#

Conclusion#

Docs

Tutorials

Resources