Rootly Logo

Rootly

Senior Site Reliability Engineer

Posted 8 Days Ago
Remote
3 Locations
Senior level
Remote
3 Locations
Senior level
As a Senior Site Reliability Engineer at Rootly, you will be responsible for supporting critical services, defining SLOs, building tools for process support, and collaborating with engineering teams to enhance service reliability. You will lead efforts to improve infrastructure, build CI/CD pipelines, and solve infrastructure challenges through software development.
The summary above was generated by AI

About Rootly

At Rootly, we are a mission to be the go-to way companies respond when things go wrong, helping every organization be more reliable. We do this by building an industry leading incident management platform that allows companies around the world consistently and quickly resolve incidents. We are not simply transforming an industry, we are carving an entirely new +$B segment ourselves and need incredible talent to achieve this ambitious goal together.

Customers love Rootly. Some of the fastest growing companies around the world such as NVIDIA, Figma, Canva, Tripadvisor, Squarespace and more rely on Rootly to power their critical incident management process. They obsess over our delightful enterprise-ready platform and unique partnership model. See why our customers have reviewed us 5 stars on G2.

Investors love Rootly. We are backed by some of the most respected funds in the world from Y Combinator to operators like the CTO of Dropbox and GitHub. We'd be happy to disclose our entire funding and profitability picture live during the interview. As a culture we relentlessly put transparency first. We conduct monthly financial reviews as a team so everyone has a pulse on the health of the business and publish what we are building in our weekly changelog.

About the role

This is a ground floor opportunity to be an early engineer and first SRE at Rootly and tangibly shape our trajectory. You will experience the balance of being scrappy and operating at scale. What you'll be doing described below one-day could look radically different the next. You will be empowered to identify opportunities that will help us grow and own it. In short, this role is designed for individuals that crave ownership, stimulating technical challenges, love shipping fast, and are mission driven. We won't sugar coat it. This will be hard work but guaranteed the best learning experience.

  • Participate in an on-call rotation to support critical Rootly services, and in some cases be on call with software teams
  • Participate in the definition and management of SLOs and error budgets for the Engineering teams that own services in production
  • Build tools to support our processes
  • Embed with feature delivery software teams to build and enhance observability, reliability, and availability of those services.
  • Work with other teams around Engineering to understand their systems and their challenges at the code level and identify improvements Rootly Infrastructure to improve the services they own (contribute code where possible)

About youMinimum Qualifications

  • You have 5+ years of experience in an SRE or Infrastructure Engineering role
  • 5+ years of experience writing software as a SWE or Software heavy SRE role
  • You have strong technical knowledge of cloud infrastructure, distributed systems, and reliability practices
  • You’ve supported services at web or RPC services at a significant scale

Preferred Qualifications

  • You have experience solving infrastructure problems by writing software
  • You have a big-picture perspective on systems and tools
  • You can collaborate with other Engineering teams to understand their systems and help to improve them

Projects you will work on

  • Moving off Heroku to AWS
  • Creating our CI/CD pipeline
  • Creating developer tools to enable our engineers to ship code fast and in a reliable way

Benefits

  • Comprehensive medical, dental, and vision
  • 3 weeks vacation + unlimited sick/mental health days + company-wide shutdown EOY
  • M2 MacBook Pro of choice
  • $1,000 for health and wellness
  • $1,000 for home office
  • $1,000 for visiting a teammate located in a different geography
  • WeWork membership
  • Weekly happy hour on Friday
  • Learning and advancement budget at your discretion
  • Annual retreat - at least once a year we gather together in person 🏝️
  • Ground floor opportunity to be an early member of a fast growing venture-backed startup


Rootly is an equal opportunity employer. We aim to create an environment where every team member at Rootly feels like they belong so they can have a greater impact on our business and customers. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Top Skills

AWS
Ci/Cd
Sre
Swe

Similar Jobs

Yesterday
Texas, USA
Remote
460 Employees
Senior level
460 Employees
Senior level
Food • Logistics • Mobile • On-Demand • App development
As a Senior Site Reliability Engineer, you will enhance system reliability and performance by managing cloud and configuration processes while collaborating with various engineering teams. Responsibilities include monitoring system performance, automating operational tasks, troubleshooting service disruptions, mentoring team members, and improving team processes.
2 Days Ago
Moline, IL, USA
Remote
Hybrid
69,000 Employees
Senior level
69,000 Employees
Senior level
Artificial Intelligence • Cloud • Internet of Things • Machine Learning • Analytics • Industrial
The Senior Site Reliability Engineer at John Deere will design and maintain scalable infrastructure solutions, collaborate with software teams, automate deployment processes, implement monitoring solutions, and ensure system reliability and security. This role includes participating in incident response and mentoring junior engineers.
An Hour Ago
United States
Remote
2,194 Employees
Senior level
2,194 Employees
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will implement and manage service level objectives, lead postmortems, enhance system reliability, automate operations, and analyze key metrics to drive improvements. They will collaborate with product teams and lead initiatives on capacity and reliability while utilizing technology like OpenTelemetry and AWS.

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account