
Senior HPC DevOps Engineer
Senior HPC DevOps and Network Engineer role at NVIDIA responsible for architecting, building, and operating large-scale HPC/AI clusters. Responsibilities include infrastructure-as-code, CI/CD, automation, monitoring/logging, networking (InfiniBand/RoCE/Ethernet), storage, virtualization, troubleshooting, and performance tuning. Requires B.Sc. (or related) and 5+ years' experience; experience with GPUs, Slurm, Kubernetes, and cloud platforms is listed.