Using attributes to specify workgroup configuration when lowering to GPU

antiagainst · February 10, 2020, 7:22pm

For constant workgroup sizes in gpu.launch it’s easy since we can just fold that into the generated SPIR-V module. For non-constant cases, we should still be able to define workgroup sizes at launch time via specialization constants. Seeing an gpu.launch with non-constant workgroup sizes, I think we should generate a SPIR-V module with specialization constants for the workgroup sizes. Then before launch, which is vkCmdDispatch in Vulkan, the Vulkan runtime should feed in the concrete values for the specialization constants for hardware driver compilers to finally compile the SPIR-V module. This cames at its costs: we need to specialize and recompile the SPIR-V module for every gpu.launch with non-constant workgroup sizes.

For IREE specifically, due to the fact that the runtime side is in IREE and the CodeGen side is in MLIR core, we need extra handshake and contracts to make the above happen. (E.g., the CodeGen need to know the SpecIds for the specialization constants to generate.)

Topic		Replies	Views
Allocating workgroup memory for use by mlir-cuda-runner MLIR	15	950	October 20, 2020
GPU Workgroup/shared memory address space is hard coded MLIR	9	774	December 29, 2022
[MLIR][GPU][Linalg] How to find a valid lowering pipeline for workgroup memory usage via `transform.structured.promote`? MLIR linalg , gpu	2	156	April 2, 2024
How to lower the combination of async gpu ops in `gpu` Dialect MLIR gpu	6	505	August 28, 2023
Running reductions on GPU MLIR	0	38	May 15, 2024

Using attributes to specify workgroup configuration when lowering to GPU

Related Topics