We invest in people who change the way the world works.

Interested in working with them?
59
companies
841
Jobs
Tell us about your professional DNA and get discovered by the amazing companies in our network.

Lead Site Reliability Engineer

ASAPP

ASAPP

Software Engineering
New York, NY, USA
Posted on Thursday, April 13, 2023
Join our team at ASAPP, where we're developing transformative Vertical AI designed to improve customer experience. Recognized by Forbes AI 50, ASAPP designs generative AI solutions that transform the customer engagement practices of Fortune 500 companies. With our automation and simplified work processes, we empower people to reach their full potential and create exceptional experiences for everyone involved. Work with our team of talented researchers, engineers, scientists, and specialists to help solve some of the biggest and most complex problems the world is facing.
Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. SREs design and implement the tools that automate building reliable and performant systems. We emphasize building tools over manual processes. We implement, not administer. We’re obsessed with automation, not repetition. Our job is to focus on building reliable infrastructure and tools for our product teams so that they can solve customer problems and deliver new features, not reinvent platforms.

What you'll do

  • Work with product engineering teams on service architecture and implementation
  • Deliver configuration as code and automate everything
  • Direct and implement monitoring and alerting systems to support rapid problem diagnosis
  • Perform Root Cause Analysis and design and deliver resolutions
  • Work on our Kubernetes / AWS infrastructure to support our product engineers
  • Write software to enable secure and performant communication in our production systems

What you'll need

  • +6 years of relevant experience bringing software to production at high scale
  • Participation in on-call rotation, triaging and addressing production issues
  • Obsession with automation and instrumentation
  • Understanding of complex systems and failure scenarios
  • Excellent communication skills
  • Knowledge of AWS services, containers and container management frameworks
  • Familiarity with Message Bus based systems and distributed architectures
  • Proficiency in Python and/or Go

What we'd like to see

  • BS or MS degree in the Computer Science field, or equivalent hands-on experience.
  • Experience in product oriented environments
  • Scalable distributed applications experience

Benefits

  • Competitive compensation with stock options
  • Comprehensive medical, vision, and dental insurance
  • 401k matching
  • Fitness and wellness stipend
  • Mobile phone reimbursement
  • Mental well-being benefits
  • Professional learning and development stipend
  • Parental leave, including adoptive and foster parents
  • 3 weeks paid time off (increases with tenure) and unlimited sick leave
ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at careers@asapp.com to obtain assistance. #LI-AG1 #LI-Hybrid