
Senior Performance Compiler Engineer - Triton
Work on the open-source Triton compiler to design and implement compiler technology (using MLIR and Triton's Python DSL), optimize high-level kernels to efficient low-level GPU code, and hand-tune critical paths (inline PTX). Collaborate with hardware architects and the CUDA compiler team; experience tuning BLAS/deep-learning kernels, numerics/linear algebra, and ML compilers (TVM/MLIR) is a plus. Base salary range: 184,000 USD - 287,500 USD per year.













