Elastic Logo

Elastic

Platform Observability - Senior Manager, Software Development

Posted 2 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Senior level
Remote
Hiring Remotely in Canada
Senior level
The Senior Manager will lead a globally distributed team of SREs, manage observability tools, mentor staff, and ensure platform reliability. They will collaborate on observability infrastructure and best practices, translating requirements into actionable plans while advocating for employee growth and engagement.
The summary above was generated by AI

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.

What is The Role:

The Platform Observability Team provides critical, scalable, and efficient observability processes and tooling for Elastic's internal developers and engineering teams. We operate 200+ deployments for logs, metrics, and traces, including regional logging, metrics, and monitoring clusters. We build self-service tooling that empowers engineers to instrument, monitor, and troubleshoot their services independently. We're part of Platform Infrastructure, the organization responsible for providing the runtime for all Elastic Cloud-powered products. We are builders at heart!

Are you a leader who puts people first but still loves diving into the technical details of observability and distributed systems? We're looking for a Senior Engineering Manager to lead this globally distributed team of 7 SREs across EMEA, Americas, and APJ.

We need someone who can truly understand the "how" and "why" of observability platforms, ensuring our internal customers have the tools they need to build reliable services at scale. You'll own SLA monitoring infrastructure for Elastic Cloud (ESS and Serverless) and drive adoption of Elastic's own observability stack across the organization.

We value leaders who embrace our SRE culture: go slow to go fast, own problems end-to-end, make sound and timely decisions, and create amazing experiences for both internal and external customers.

What You Will Be Doing:
  • People & Talent Management: You will mentor and lead a globally distributed team of SREs, fostering a culture of ownership, psychological safety, and continuous improvement. You'll be responsible for the full employee lifecycle, from hiring top SRE talent to helping team members reach their next promotion and creating clear career paths.
  • Strategy & Execution: How do we turn observability needs into platform capabilities? You'll partner with engineering teams and Platform SRE leadership to understand requirements, build roadmaps, and translate them into clear deliverables. You'll drive adoption of observability best practices and ensure our platforms meet the needs of internal customers.
  • Technical & Operational Leadership: We take full accountability for our platforms. You'll partner with your team to facilitate technical discussions, navigate trade-offs, and drive delivery of high-quality observability solutions. You'll champion reliability improvements, incident management processes, and blameless postmortems, keeping our platforms production-ready.
What You Bring:
  • Management Experience: 3+ years leading technical teams, with a focus on mentoring and talent development.
  • Technical Foundation: 5+ years in SRE, DevOps, or infrastructure engineering, with enough depth to understand the work and guide senior engineers through complex problems.
  • Observability Expertise: Strong understanding of observability principles: metrics, logs, traces, and APM. You know what good looks like. Experience defining and tracking SLIs, SLOs, and error budgets.
  • SaaS at Scale: Previous success supporting high-scale, multi-tenant global platforms.
  • Distributed Systems Background: Experience operating and scaling systems in cloud environments (AWS, GCP, or Azure) and improving reliability.
  • Distributed Leadership: Experience managing geographically distributed teams across multiple time zones and cultures.
  • Operational Rigor: Experience with incident management, on-call processes, and postmortem practices.
  • Infrastructure as Code: Familiarity with Kubernetes, Terraform, and GitOps practices.
  • Communication: A knack for translating technical strategy and progress for all audiences, technical or not. Strong written communications are important here.
Bonus Points:
  • Elastic Stack Expertise: Experience with Elastic Observability, Elasticsearch, Kibana, Beats, and/or APM.
  • Platform Engineering Background: Experience building internal developer platforms or self-service tooling.
  • You enjoy working with a distributed company and the active, asynchronous communication it requires.
  • You love a diverse environment, working with people all over the world. You believe that a diverse company is a better company
  • You are willing to listen and give everyone at the table a voice

Compensation for this role is in the form of base salary.  This role does not have a variable compensation component.  The typical starting salary range for new hires in this role is listed below. 

These ranges represent the lowest to highest salary we reasonably and in good faith believe we would pay for this role at the time of this posting.  We may ultimately pay more or less than the posted range, and the ranges may be modified in the future.  

An employee's position within the salary range will be based on several factors including, but not limited to, relevant education, qualifications, certifications, experience, skills, geographic location, performance, and business or organizational needs.

Elastic believes that employees should have the opportunity to share in the value that we create together for our shareholders. Therefore, in addition to cash compensation, this role is currently eligible to participate in Elastic's stock program.  Our total rewards package also includes a company-matched Registered Retirement Savings Plan (RRSP) with dollar-for-dollar matching up to 6% of eligible earnings, along with a range of other benefits offered with a holistic emphasis on employee well-being.

The typical starting salary range for this role is:
$154,000$243,600 CAD
Additional Information - We Take Care of Our People

As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life. Your age is only a number. It doesn’t matter if you’re just out of college or your children are; we need you for what you can do.

We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.

  • Competitive pay based on the work you do here and not your previous salary
  • Health coverage for you and your family in many locations
  • Ability to craft your calendar with flexible locations and schedules for many roles
  • Generous number of vacation days each year
  • Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service
  • Up to 40 hours each year to use toward volunteer projects you love
  • Embracing parenthood with minimum of 16 weeks of parental leave

Different people approach problems differently. We need that. Elastic is an equal opportunity employer and is committed to creating an inclusive culture that celebrates different perspectives, experiences, and backgrounds. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, pregnancy, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, disability status, or any other basis protected by federal, state or local law, ordinance or regulation.

We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all individuals. To request an accommodation during the application or the recruiting process, please email [email protected]. We will reply to your request within 24 business hours of submission.

Applicants have rights under Federal Employment Laws, view posters linked below: Family and Medical Leave Act (FMLA) Poster; Pay Transparency Nondiscrimination Provision Poster; Employee Polygraph Protection Act (EPPA) Poster and Know Your Rights (Poster)

Elasticsearch develops and distributes encryption software and technology that is subject to U.S. export controls and licensing requirements for individuals who are located in or are nationals of the following sanctioned countries and regions: Belarus, Cuba, Iran, North Korea, Russia, Syria, the Crimea Region of Ukraine, the Donetsk People’s Republic (“DNR”), and the Luhansk People’s Republic (“LNR”). If you are located in or are a national of one of the listed countries or regions, an export license may be required as a condition of your employment in this role. Please note that national origin and/or nationality do not affect eligibility for employment with Elastic.

Please see here for our Privacy Statement.

Top Skills

Apm
AWS
Azure
Beats
DevOps
Elastic Observability
Elasticsearch
GCP
Gitops
Kibana
Kubernetes
Sre
Terraform

Similar Jobs

40 Minutes Ago
Easy Apply
Remote
3 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Enterprise Web • Software • Design • Generative AI
As a Senior Product Designer, you will drive the design of onboarding experiences based on user research, leveraging data and AI to enhance product features and customer journeys. Collaborate with cross-functional teams to deliver high-quality designs.
Top Skills: AIData AnalysisDesignPrototypingUser Experience
41 Minutes Ago
In-Office or Remote
Richmond, BC, CAN
Mid level
Mid level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Participate in software development, focusing on cloud-based services; develop scalable code with unit tests, lead technical projects, and mentor team members.
Top Skills: .NetAWSAzureC#CouchbaseDockerGCPKubernetesNginxNoSQL
2 Hours Ago
Remote
Canada
Junior
Junior
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Design and develop scalable software solutions, collaborate with cross-functional teams, contribute to team culture, and support operational excellence.
Top Skills: AngularCSSHTMLJavaScriptMongoDBMySQLNode.jsPostgresPythonReact

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account