Senior Site Reliability Engineer

Posted 2 Days Ago
Remote
Senior level
Software • Automation
The Role
As a Senior Site Reliability Engineer at Rootly, you will manage critical services, define SLOs and error budgets, build tools for processes, and enhance service observability in collaboration with engineering teams. Your role involves participating in on-call rotations and improving infrastructure systems to support rapid growth and reliability in incident management.
Summary Generated by Built In

About Rootly

At Rootly, we are a mission to be the go-to way companies respond when things go wrong, helping every organization be more reliable. We do this by building an industry leading incident management platform that allows companies around the world consistently and quickly resolve incidents. We are not simply transforming an industry, we are carving an entirely new +$B segment ourselves and need incredible talent to achieve this ambitious goal together.

Customers love Rootly. Some of the fastest growing companies around the world such as NVIDIA, Figma, Canva, Tripadvisor, Squarespace and more rely on Rootly to power their critical incident management process. They obsess over our delightful enterprise-ready platform and unique partnership model. See why our customers have reviewed us 5 stars on G2.

Investors love Rootly. We are backed by some of the most respected funds in the world from Y Combinator to operators like the CTO of Dropbox and GitHub. We'd be happy to disclose our entire funding and profitability picture live during the interview. As a culture we relentlessly put transparency first. We conduct monthly financial reviews as a team so everyone has a pulse on the health of the business and publish what we are building in our weekly changelog.

About the role

This is a ground floor opportunity to be an early engineer and first SRE at Rootly and tangibly shape our trajectory. You will experience the balance of being scrappy and operating at scale. What you'll be doing described below one-day could look radically different the next. You will be empowered to identify opportunities that will help us grow and own it. In short, this role is designed for individuals that crave ownership, stimulating technical challenges, love shipping fast, and are mission driven. We won't sugar coat it. This will be hard work but guaranteed the best learning experience.

  • Participate in an on-call rotation to support critical Rootly services, and in some cases be on call with software teams
  • Participate in the definition and management of SLOs and error budgets for the Engineering teams that own services in production
  • Build tools to support our processes
  • Embed with feature delivery software teams to build and enhance observability, reliability, and availability of those services.
  • Work with other teams around Engineering to understand their systems and their challenges at the code level and identify improvements Rootly Infrastructure to improve the services they own (contribute code where possible)

About youMinimum Qualifications

  • You have 5+ years of experience in an SRE or Infrastructure Engineering role
  • 5+ years of experience writing software as a SWE or Software heavy SRE role
  • You have strong technical knowledge of cloud infrastructure, distributed systems, and reliability practices
  • You’ve supported services at web or RPC services at a significant scale

Preferred Qualifications

  • You have experience solving infrastructure problems by writing software
  • You have a big-picture perspective on systems and tools
  • You can collaborate with other Engineering teams to understand their systems and help to improve them

Projects you will work on

  • Moving off Heroku to AWS
  • Creating our CI/CD pipeline
  • Creating developer tools to enable our engineers to ship code fast and in a reliable way

Benefits

  • Comprehensive medical, dental, and vision
  • 3 weeks vacation + unlimited sick/mental health days + company-wide shutdown EOY
  • M2 MacBook Pro of choice
  • $1,000 for health and wellness
  • $1,000 for home office
  • $1,000 for visiting a teammate located in a different geography
  • WeWork membership
  • Weekly happy hour on Friday
  • Learning and advancement budget at your discretion
  • Annual retreat - at least once a year we gather together in person 🏝️
  • Ground floor opportunity to be an early member of a fast growing venture-backed startup


Rootly is an equal opportunity employer. We aim to create an environment where every team member at Rootly feels like they belong so they can have a greater impact on our business and customers. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Top Skills

Sre
Swe
The Company
HQ: San Francisco, California
35 Employees
On-site Workplace

What We Do

Rootly is an incident management platform on Slack that helps automate manual admin work during incidents.

Leading companies such as NVIDIA, Squarespace, Canva, Grammarly, OpenSea, Figma, and countless others trust Rootly to build a consistent incident response process.

See why they rate us 5 stars on G2: https://www.g2.com/products/rootly-manage-incidents-on-slack/reviews

Similar Jobs

Zeta Global Logo Zeta Global

Senior SRE Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote
United States
2194 Employees
Remote
Boston, MA, USA
400 Employees

Thumbtack Logo Thumbtack

Senior Site Reliability Engineer, Systems

eCommerce • Information Technology • On-Demand • Professional Services • Software
Remote
United States
1400 Employees

Atlassian Logo Atlassian

Senior Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees

Similar Companies Hiring

General Motors Thumbnail
Transportation • Software • Robotics • Manufacturing • Information Technology • Big Data • Automotive
Detroit, MI
165000 Employees
Chime Thumbnail
Software • Security • Mobile • Machine Learning • Fintech • Data Privacy • Analytics
San Francisco, CA
1336 Employees
Apryse Thumbnail
Software • Productivity • Automation • App development
Denver, CO
542 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account