We invest in people who change the way the world works.

Interested in working with them?
Tell us about your professional DNA and get discovered by the amazing companies in our network.

Site Reliability Engineer - Product Resilience



Software Engineering, Product
San Jose, CA, USA
Posted on Thursday, April 25, 2024

** Sponsorship is not available for this position **

What you can expect

As a senior level Product Resilience SRE, you will define, scope, plan, and schedule Disaster Recovery Testing at Zoom. You will document any gaps identified by our testing, and drive technical solutions to address them. You will also produce designs and lead more junior team members through their implementation and deployment to production. You will design, develop, deploy, monitor, and scale DevOps Platform Services. You will own and create documentation of our Disaster Recovery, including externally-consumable documents. Finally, you will communicate with stakeholders including security teams, senior managers, and customers.

About the Team

You will be part of a DevOps team responsible for deploying and operating datacenter and cloud software infrastructure. You will be the tech lead for production resilience and disaster recovery, and you will define the roadmap for improvements in this area. Broadly speaking, your charter is to ensure that we are adequately prepared for disaster scenarios, and that external audiences can understand our readiness. This role requires advanced communication skills and the ability to interact with less technical audiences. You must also be technical yourself in order to drive meaningful improvements to our production software systems.

What we’re looking for

  • Experience in SRE or DevOps (at least 8+ years) - this is a senior level position
  • Experience with at least one programming language, in addition to scripting languages
  • Experience with logging and monitoring tools (e.g. ELK stack, Prometheus, Grafana)
  • Able to articulate complex technical solutions into externally-consumable presentations and documents (verbal and written)
  • Able to participate in on-call shifts and incident management as well as work after hours/weekends for application releases and deployments
  • Be familiar with cloud infrastructure technologies (e.g. kubernetes, terraform)
  • Experience with system design and distributed computing at scale
  • Be familiar with chaos engineering and fault injection tools (e.g. chaos monkey, Amazon Fault Injection Service)
  • Have a Bachelor's or Master's in CS or related major (nice to have)

Salary Range or On Target Earnings:





At Zoom, we offer a window of at least 5 days for you to apply because we believe in giving you every opportunity. Below is the potential closing date, just in case you want to mark it on your calendar. We look forward to receiving your application!

Anticipated Position Close Date:


In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Information about Zoom’s benefits is on our careers page here.

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure; there may be a different range for candidates in this and other locations.

Ways of Working
Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.

About Us
Zoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.
We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Here, you’ll work across teams to deliver impactful projects that are changing the way people communicate and enjoy opportunities to advance your career in a diverse, inclusive environment.

Our Commitment​
We believe that the unique contributions of all Zoomies is the driver of our success. To make sure that our products and culture continue to incorporate everyone's perspectives and experience we never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Zoom is proud to be an equal opportunity workplace and is an affirmative action employer. All your information will be kept confidential according to EEO guidelines.

We welcome people of different backgrounds, experiences, abilities and perspectives including qualified applicants with arrest and conviction records and any qualified applicants requiring reasonable accommodations in accordance with the law.

If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form and someone from our team will reach out soon. This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.