There are currently three separate passes that lower from SCF to GPU dialect
The first two convert
gpu.launch operations, while the last one is from
gpu.launch. The lowering from
gpu.launch predates the latter and has poorer semantics since it makes an assumption that the loops are parallel. This is something worth deprecating in favor of using the lowering from
The intent of this post is to get feedback if anybody is using these conversion before they are removed. The current plan is to remove these in the last week of May (after the current renaming of
loop dialect to
scf dialect is complete and also giving more time for folks to respond).
This would also mean that the following functions would also be removed
since they implement the core functionality of the above passes.