Senior Site Reliability Engineer (Remote) Worldwide

Company: DuckDuckGo

We are a diverse, fully-distributed team from around the world, working toward a shared vision to raise the standard of trust online.

Join us as a Senior Site Reliability Engineer to help build and maintain world-class infrastructure to meet the needs of millions of users.

At DuckDuckGo, we currently serve 80+ million search queries a day (nearly doubling each year), anonymously leverage over 400 upstream sources for results, and serve more than 1PB of proxied traffic per month.

As part of our growing team, you will be dedicated to improving and scaling the reliability of our end-to-end infrastructure. We dive deep into complex operational challenges, including software, systems, automation, and process analysis. We empower our team to be self-directed and self-motivated in their work. If you'd thrive in that environment, and our core values resonate with you -- build trust, question assumptions, and validate direction -- you'll fit right in!

What you will do:

  • Lead projects from proposal through postmortem, assessing vague problems, proposing high-impact solutions, and executing them against a set of success criteria.

  • Develop effective tools, alerts, and responses to identify and address reliability risks.

  • Work closely with search engineers to triage production issues and determine appropriate remediation, including code changes and performance considerations.

  • Participate in our on-call rotation; triage and address reliability issues that come up in production.

  • Help determine the future technical direction of our deployment with an effort to improve reliability and performance.

What we are looking for:

  • At least 7 years of engineering experience, with 5+ years focused on tackling the reliability challenges of large-scale deployments and high-traffic, distributed systems

  • Experience with production troubleshooting, including: distributed systems, code, storage, networking, and operating systems

  • Moderate-to-advanced programming experience, preferably in a high-level language like Perl or Python

  • Experience participating in a 24x7 on-call rotation for a large-scale deployment.

  • Experience configuring and troubleshooting Linux and NGiNX

  • Strong organizational skills, you have an eye for detail and are not afraid to use it!

  • Effective project management skills; you have successfully launched projects from inception to production

  • Strong communication skills: You clearly articulate, in verbal and written communication, your recommendations and decisions

  • Comfortable providing feedback to an array of stakeholders, both internal and external

Other things to know:

  • While we leverage specific job titles for hiring purposes, we do not use them internally. Instead, we follow our own professional levels, with expectations for each level clearly defined across several dimensions.

  • We are a small, remote team distributed across time zones, and we rely on a variety of communication tools throughout the day. You should feel comfortable with the intricacies of this type of work situation.

  • Sometimes we meet up! While all company travel is currently on hold, once it is deemed safe to resume, expect to travel at least two times a year: once for our all-hands meetup and again for a team retreat (each ~4-5 days).

  • We believe in a focused approach to collaboration, where individual team members work on a single top priority at a time, each supporting larger, company-wide objectives. This philosophy serves to impact our vision to raise the standard of trust online.

  • Our work philosophy centers on empowered project management. All team members have opportunities to run projects.

  • Transparency supports individual and team success at DuckDuckGo. We encourage everyone to participate in areas of interest throughout the company. Anyone and everyone can (and should) ask questions and offer feedback about our products and internal projects.

  • We strive to exemplify our values (build trust, question assumptions, and validate direction) in everything we do.

  • While this is a full-time job, we offer a flexible work arrangement with no core hours, expecting an average commitment of 40 hours per week.

  • We support professional development of our team members through career advisory and a learning stipend, reinforcing our culture of growth and skill-building.

Other reasons to love working at DuckDuckGo:

  • Flexible vacation and sick leave practices

  • Flexible work schedule

  • Company-wide hack days

  • Company and team meetups

  • Open participation in company strategy

  • Family leave policy

  • Co-working reimbursement

  • Hardware and office setup benefits

  • Wellness and learning benefits

  • Charitable donation matching

  • US health benefits

  • US 401k

  • "Use good judgment" approach to company policies

DuckDuckGo does not work with any recruiting agencies or services. Instead, we work with each candidate one-on-one throughout a unique hiring process that we've built to reflect our company culture.

DuckDuckGo provides equal work opportunities to all team members and applicants and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.

If you think you might thrive in this environment, we would love to hear from you. Use the application to apply!

Please note that:

  1. A successful candidate will be subject to a background check.

  2. By applying for this role, you confirm that information submitted is accurate and that you understand falsification is cause for denial of employment or termination.

Vacancy page : http://duckduckgo.applytojob.com/apply/YpJrn8663O/Senior-Site-Reliability-Engineer-Remote