Skip to content

AMD backend under-declares VGPR count in the kernel descriptor #109

Description

@Zaneham

The AMD register allocator writes a num_vgprs to the KD lower than what the emitted code actually references (tests/stress.cu declares 8 but uses up to v13), so on hardware the kernel touches registers it never allocated. Likely max_vgpr accounting in emit.c not counting register pairs or post-RA VGPRs.

Metadata

Metadata

Assignees

No one assigned

    Labels

    backend: AMDAMD GFX9/10/11/12 backend (isel, regalloc, ELF encode)bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions