-
Notifications
You must be signed in to change notification settings - Fork 2
Instrumentation Modes
FindHao edited this page Aug 31, 2025
·
5 revisions
- Emits: opcode ID, warp ID, PC, kernel_launch_id, CTA IDs
- Use for: Proton instruction histogram; lowest overhead
- May be auto-enabled by analyses (see below)
- Emits: per-thread register values (plus unified registers), opcode ID, PC
- Use for: register value tracing and dataflow inspection
- Emits: 32-lane memory addresses for memory-reference instructions
- Use for: memory access pattern analysis
-
CUTRACER_INSTRUMENTaccepts comma-separated values. Analyses may also enable required modes implicitly. -
proton_instr_histogramauto-enablesopcode_only. -
deadlock_detectionauto-enablesreg_trace.
- When an analysis auto-enables a mode, you do not need to repeat it in
CUTRACER_INSTRUMENT. - Enabling additional modes increases overhead and output volume; prefer the minimal set that satisfies your analysis.