Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Reliability Engineer Jobs in Calgary

Coupa

Lead Database Reliability Engineer - 11606

Reposted 10 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Artificial Intelligence • Fintech • Information Technology • Logistics • Payments • Business Intelligence • Generative AI

Lead design, automation, and maintenance of cloud-based database infrastructure (primarily SQL Server and MySQL). Improve reliability with monitoring, HA/DR, automation, troubleshooting, on-call support, and mentoring of junior engineers while collaborating across teams.

Top Skills: AuroraAWSBashFailover ClusteringMySQLNew RelicOrchestratorPmmPythonRdsRubySQL ServerVividcortex

Tempus AI

Site Reliability Engineer

Reposted 6 Days AgoSaved

Hybrid

Calgary, AB

Mid level

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Analytics • Biotech • Generative AI

The Site Reliability Engineer will manage cloud infrastructure, automate tasks, collaborate in agile teams, and ensure service reliability and quality.

Top Skills: Aurora MysqlAWSAzureBashDockerGCPGoKubernetesPostgresPythonRubyTerraform

Enverus

Staff Site Reliability Engineer - 26248

8 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Big Data • Information Technology • Software • Analytics • Energy

Manage and scale Enverus' global AWS infrastructure, automate deployments and CI/CD, ensure high uptime, collaborate with developers to enable zero-downtime releases, participate in on-call rotations, and improve operational practices.

Top Skills: AWSAzureC#Ci/CdCloudFormationGoKubernetesLinuxPythonTerraformWindows

Coinbase

Staff Software Engineer, Core Reliability

19 Hours AgoSaved

Easy Apply

Remote

Calgary, AB

Easy Apply

Expert/Leader

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

Lead high-impact reliability projects to improve resiliency, scalability, and deployment safety for thousands of services. Build secure configuration and secrets systems, improve canary-based release systems, partner with critical teams to reduce operational toil, drive observability and reliability best practices, participate in on-call rotations, and communicate architecture decisions to stakeholders.

Top Skills: AWSAzureDatadogGCPGenerative AiGoKibanaRubyTerraform

Block

Senior Site Reliability Engineer

Reposted 3 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency

The Senior Site Reliability Engineer will enhance reliability of Block's platform, improve incident response using AI tools, and coordinate incident management. Responsibilities include building reliable systems, standardizing tools, and leading high-severity incidents during on-call rotations.

Top Skills: Amazon Web ServicesDatadogDynamoDBGrpcHTTPIstioJavaJSONKotlinKubernetesLaunchdarklyMySQLProtocol BuffersTerraformVitess

Affirm

Software Engineer II, Backend (Reliability Platform)

Reposted 5 Days AgoSaved

Easy Apply

Remote

Calgary, AB

Easy Apply

Mid level

Big Data • Fintech • Mobile • Payments • Financial Services

Design and build a centralized reliability platform for production systems, integrating distributed-systems engineering with AI-assisted tools. Implement AI agents for incident triage, log/trace summarization, and developer-facing APIs. Own projects end-to-end and collaborate with product, infra, data, and SRE teams to iterate and improve system health and debuggability.

Top Skills: ClaudeCursorDistributed SystemsGithub CopilotLlmsPython

Stripe

Integration Reliability Engineer, Payments

2 Days AgoSaved

Remote

Calgary, AB

Senior level

Payments • Software

Own and scale payments partner integrations and reporting; automate manual processes; build tooling and anomaly detection; collaborate with engineers, accounting, and partners to trace and reconcile large-scale money flows; troubleshoot and implement code changes.

Top Skills: GitJavaRubySQL

GitLab

Site Reliability Engineer, Intermediate to Senior Staff — Infrastructure Platforms

13 Days AgoSaved

Easy Apply

Remote

Calgary, AB

Easy Apply

Senior level

Cloud • Security • Software • Cybersecurity • Automation

Maintain and improve reliability, scalability, and automation for user-facing production systems. Build infrastructure tooling, operate Kubernetes-based services, write IaC, participate in on-call and incident response, and advance observability and runbooks to reduce toil and improve platform reliability.

Top Skills: AWSCi/CdGCPGitopsGoInfrastructure As Code (Iac)KubernetesKubernetes Operators/ControllersLoggingMetricsRubySlos/SlisTerraform

OpsMill

Product Reliability Engineer | US

Reposted 13 Days AgoSaved

Remote

Calgary, AB

Mid level

Information Technology • Software • Database • Automation

Owner of on-prem reliability and escalations: reproduce and resolve L2/L3 issues across heterogeneous Kubernetes environments, build diagnostics and automation, improve CI and e2e test stability, establish performance baselines, harden install/upgrade flows, and write tooling in Python/Go/Rust to reduce repeat incidents.

Top Skills: BenchmarkingCiCi/CdContainersE2E TestingGoHealth ChecksHelmInstallersIntegration TestingKubernetesLoad GenerationLogsMetricsNetworkingObservabilityPackagingProfilingPythonRbacRustStorageSupport BundlesTraces

Chelsea Avondale

Reliability Engineer

Reposted 17 Days AgoSaved

Remote

Calgary, AB

Junior

Insurance

As a Reliability Engineer, you'll design, implement, and maintain AWS cloud environments, ensuring systems' reliability and performance, while enhancing monitoring and incident response capabilities.

Top Skills: AWSNginxPythonUnixWindows

Embedding VC

Founding Platform & Reliability Engineer

Reposted 17 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Artificial Intelligence • Software • Generative AI

The Founding Platform & Reliability Engineer will design and operate reliable, scalable infrastructure for an AI storytelling platform, involving hands-on implementation and strategic decision-making.

Top Skills: AmplitudeAWSCloud RunFirebaseGCPModalNext.JsNode.jsPythonReactRedisSentryTypescriptUpstash

Babylist

Staff Engineer, Site Reliability

24 Days AgoSaved

Easy Apply

Remote or Hybrid

Calgary, AB

Easy Apply

Senior level

eCommerce • Healthtech • Kids + Family • Retail • Social Media

Own and evolve Babylist's AWS infrastructure, Terraform IaC, Kubernetes/EKS clusters, CI/CD, and observability for a platform serving millions. Lead incident response, improve developer tooling, and set reliability standards across engineering teams.

Top Skills: AWSCdnCircleCICloud NetworkingCronitorDatadogDnsEksGithub ActionsKubernetesLoad BalancersMySQLPagerdutyRdsRedisRuby On RailsSentrySidekiqTerraform

New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free

AuthZed

Sr. Site Reliability Engineer

Reposted 21 Hours AgoSaved

Remote

Calgary, AB

Senior level

Artificial Intelligence • Information Technology • Software • Database

As a Site Reliability Engineer, you will design, implement, and maintain scalable infrastructure, ensure system reliability, automate processes, and collaborate with engineering teams.

Top Skills: DockerElk StackGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonRubyTerraform

Kong

Senior SRE, Managed Gateways

YesterdaySaved

In-Office or Remote

Calgary, AB

Senior level

Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics

Lead reliability and implementation for Kong's Managed Gateways: design and operate multi-cloud, Kubernetes-based systems, own incident response and SLOs, automate CI/CD and IaC, mentor SREs, and drive enterprise customer onboarding and technical implementations.

Top Skills: AnsibleAWSAzureCassandraCi/CdDatadogElkGCPGoGrafanaIstioKubernetesLinkerdPostgresPrometheusTerraform

Sapsol Technologies Inc

Reliability Engineer

Reposted 23 Days AgoSaved

Remote

Calgary, AB

Expert/Leader

Information Technology • Software

The Reliability Engineer will develop and implement reliability test plans for IVD medical devices, conduct analyses, and lead cross-functional projects while ensuring compliance with regulatory standards.

Top Skills: JmpMatlabMinitabPythonR

Blue Origin

Senior Reliability Engineer – Hardware Systems

24 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Aerospace

Lead reliability strategy and environmental validation for satellite user terminal hardware. Design and run HALT/HASS/ESS tests, oversee weatherproofing and thermal/vibration testing, perform failure analysis (X-ray, microscopy, cross-sectioning), own DFMEA and physics-of-failure models to predict MTBF and warranty risk, collaborate with electrical and mechanical teams and external labs to drive hardware improvements and regulatory compliance.

Top Skills: CfdCross-SectioningDfmeaElectronic Thermal Cycle SystemsEnvironmental ChambersEssFeaHaltHassIp67JmpMicroscopyMinitabPower Delivery Network (Pdn) AnalysisReliasoftSalt Fog Corrosion TestingUv Exposure TestingVibration/Shaker TablesWeibull Life-Data AnalysisX-Ray Inspection

OutSystems

Senior Site Reliability Engineer

Reposted YesterdaySaved

In-Office or Remote

Calgary, AB

Senior level

Software

The Senior Site Reliability Engineer will lead service onboarding, maintain SLAs/SLOs, design secure infrastructure, automate operational tasks, and respond to incidents while ensuring system reliability and performance.

Top Skills: AWSCloudFormationElk StackGoGrafanaHadoopKubernetesPythonTerraform

Cresta

Infrastructure Engineer/SRE

Reposted 3 Days AgoSaved

Remote

Calgary, AB

Senior level

Artificial Intelligence • Other • Sales • Software

Design and advance core infrastructure for engineering, ensure Kubernetes reliability, automate operations, and support AI infrastructure.

Top Skills: AWSAzureCi/CdCloudFormationGitopsGoGCPHelmKubernetesKustomizePostgresPythonTerraform

WEX Inc.

Senior Staff Site Reliability Engineer

Reposted 3 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Fintech • Payments

The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.

Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk

Kong

Senior Site Reliability Engineer, Kong Konnect

Reposted 5 Days AgoSaved

In-Office or Remote

Calgary, AB

Senior level

Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics

The role involves operating and scaling Kong's SaaS platform, building automated infrastructure, optimizing multi-region data layers, enhancing observability, and ensuring reliability across services.

Top Skills: ArgocdAWSAzureBashClickhouseDatadogDruidGCPGoGrafanaHelmKubernetesPostgresPrometheusPythonRedisTerraformTerragruntThanos

ClickHouse

Senior Site Reliability Engineer- Remote

Reposted 5 Days AgoSaved

Remote

Calgary, AB

Senior level

Database • Analytics

The Senior Site Reliability Engineer will ensure reliability and scalability of cloud infrastructure, enhance incident management, and optimize operational efficiencies through collaboration with various teams.

Top Skills: AnsibleAWSAzureDocker SwarmGoGoogle Cloud PlatformKubernetesPuppetPythonTerraform

Arena (arena.ai)

Site Reliability Engineer

7 Days AgoSaved

Remote or Hybrid

Calgary, AB

Senior level

Artificial Intelligence • Information Technology • Software

Build and operate the core infrastructure for Arena's online evaluation systems: design low-latency, high-reliability APIs and gateways, implement enterprise-grade features (rate limiting, auth, metering, audit logging), instrument observability (tracing, latency, usage tracking), integrate with LLM providers and the evaluation platform, and collaborate with research and product teams to scale and harden systems for bursty, unpredictable traffic.

Top Skills: Anthropic ApiAWSDistributed TracingGCPGoGoogle Llm ApisKubernetesOpenai ApiPostgresRedisRustTerraform

Waabi

Senior / Staff Software Engineer (Observability / SRE)

Reposted 7 Days AgoSaved

Remote or Hybrid

Calgary, AB

Senior level

Transportation

Design and develop Waabi's observability stack, optimize performance, build automation tooling, and support application requirements while leading projects and mentoring teams.

Top Skills: AWSC/C++DockerGoGrafanaJavaKubernetesOpentelemetryPythonRust

Hadrian

Site Reliability Engineer, Robotics

Reposted 7 Days AgoSaved

In-Office or Remote

Calgary, AB

Mid level

Aerospace • Hardware • Software • Defense • Manufacturing

As a Site Reliability Engineer, you'll ensure robotics system reliability, build telemetry integration, and develop tools for diagnostics and automation, collaborating with engineering teams for enhanced production reliability.

Top Skills: C++DatadogGoKubernetesOpentelemetryPrometheusPythonRos2TelegrafTypescript

WorkOS

Site Reliability Engineer

Reposted 7 Days AgoSaved

Remote

Calgary, AB

Mid level

Software

As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.

Top Skills: AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript