Skip to content

LLVM ISel failure on CDNA 3 (gfx942): Cannot select: i64 = FrameIndex<4> in complex collision kernel #421

@ZJLi2013

Description

@ZJLi2013

Root cause hypothesis: The AMDGPU LLVM backend's scratch memory lowering for CDNA targets uses flat addressing (i64), and the instruction selection for i64 = FrameIndex fails when the kernel has a large stack frame. This code path is likely not exercised by typical HPC/AI workloads. The same kernel may compile successfully on RDNA targets where scratch is accessed via buffer instructions with i32 offsets.

releated to genesis issue Genesis-Embodied-AI/genesis-world#2570

Thanks for checking
David

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions