
Engineering Manager, Inference Benchmarking — AI Perf
Lead the engineering team within NVIDIA's Dynamo organization to advance AIPerf (an open-source inference benchmarking platform). Responsibilities include driving core infrastructure (load generation, ZMQ microservices, GPU telemetry, Prometheus, statistical confidence intervals, Kubernetes-native deployment), advising integrations with vLLM/TRT-LLM/SGLang, and hiring/mentoring senior engineers. Requires 8+ years software engineering experience and 3+ years engineering leadership.
















