How does the tensorflow-mlir library work, to reduce tf code to run on GPU?

Is there a specific dialect which is used to reduce the TensorFlow to CuDNN/ MKL-DNN ?. Also is it first reduced to the gpu dialect (https://mlir.llvm.org/docs/Dialects/GPU/) and then converted to either a cudnn/mkl-dnn code ?.

