
Principal Site Reliability Engineer - Observability and Telemetry Platform
Responsible for designing, implementing and supporting operational and reliability aspects of a large-scale observability and telemetry collection platform; engage across the whole service lifecycle from design through deployment and operation. Requires 15+ years relevant experience, deep knowledge of Linux, networking, containers, infrastructure automation and experience with tools such as Kubernetes, OpenStack, Prometheus, Grafana and OpenTelemetry; experience with Python/Go/Perl/Ruby.