Top Reliability Engineer Jobs in Calgary

Reposted 17 Hours AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Senior Site Reliability Engineer will enhance reliability of Block's platform, improve incident response using AI tools, and coordinate incident management. Responsibilities include building reliable systems, standardizing tools, and leading high-severity incidents during on-call rotations.
Top Skills: Amazon Web ServicesDatadogDynamoDBGrpcHTTPIstioJavaJSONKotlinKubernetesLaunchdarklyMySQLProtocol BuffersTerraformVitess
Reposted 2 Days AgoSaved
Easy Apply
Remote
Calgary, AB
Easy Apply
Senior level
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Site Reliability Engineer will manage system incidents, improve monitoring and logging, optimize database infrastructure, and collaborate on scaling systems efficiently.
Top Skills: AWSClickhouseKubernetesMySQLPostgresRedis
Reposted 8 Days AgoSaved
Easy Apply
In-Office or Remote
Calgary, AB
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer in Environment Automation, you'll automate operations across many GitLab environments, maintain infrastructure reliability using Kubernetes, and enhance IT practices with Terraform and Ansible, while collaborating with senior engineers.
Top Skills: AnsibleCloud ServicesDevsecopsGitlabGoInfrastructure As CodeKubernetesTerraform
Reposted 5 Days AgoSaved
Remote or Hybrid
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Software
The Lead Infrastructure and Reliability Engineer will enhance GPU operations, define scalability strategies, and develop organizational strengths in a high-demand AI infrastructure setting.
Top Skills: ContainersDistributed SystemsGpuKubernetesLinuxNetworkingOrchestrationStorage
Reposted 6 Days AgoSaved
Remote
Calgary, AB
Junior
Junior
Insurance
As a Reliability Engineer, you'll design, implement, and maintain AWS cloud environments, ensuring systems' reliability and performance, while enhancing monitoring and incident response capabilities.
Top Skills: AWSNginxPythonUnixWindows
Reposted 7 Days AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Software • Generative AI
The Founding Platform & Reliability Engineer will design and operate reliable, scalable infrastructure for an AI storytelling platform, involving hands-on implementation and strategic decision-making.
Top Skills: AmplitudeAWSCloud RunFirebaseGCPModalNext.JsNode.jsPythonReactRedisSentryTypescriptUpstash
Reposted 15 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Software
Own reliability, performance, and scalability of PostgreSQL infrastructure. Implement HA, replication, observability, capacity planning, automation, and DR. Support engineering teams with migrations, query optimization, on-call incident response, runbooks, and tooling to enable safe DB operations.
Top Skills: AnsibleAuroraAws RdsChefDatadogDynamoDBElasticacheGoGrafanaIndexingMvccPatroniPgbouncerPostgresPrometheusPythonQuery PlannerReplicationRubySQLTerraformVacuum TuningWal
3 Days AgoSaved
In-Office
Calgary, AB
Expert/Leader
Expert/Leader
Utilities
The role involves leading the development and implementation of Alberta Reliability Standards, coordinating stakeholder engagement, and managing timelines. Requires extensive experience in electrical engineering and strong leadership skills.
Top Skills: High Voltage FacilitiesProject Management ToolsProtection & Control SystemsTransmission System Operations
Reposted 21 Days AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Software
The HW/SW Reliability Engineer ensures product reliability through detailed reviews of complex designs, FMEA analysis, modeling, and collaboration with design teams.
Top Skills: FmeaIp TechnologyMarkov ChainsMicrosoft Office ToolsReliability ModelingSQLTelecom Protocols
9 Days AgoSaved
Remote or Hybrid
Calgary, AB
Senior level
Senior level
eCommerce • Payments • Software
The role focuses on ensuring the reliability, scalability, and operational maturity of production MySQL databases. Responsibilities include managing database operations, improving automation, and collaborating with engineering teams to enhance performance and troubleshoot issues.
Top Skills: AnsibleBashCi/CdDockerGCPGoJavaScriptKubernetesMySQLPythonTerraform
Reposted 16 Hours AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Software
The Senior Site Reliability Engineer will lead service onboarding, maintain SLAs/SLOs, design secure infrastructure, automate operational tasks, and respond to incidents while ensuring system reliability and performance.
Top Skills: AWSCloudFormationElk StackGoGrafanaHadoopKubernetesPythonTerraform
Reposted 12 Days AgoSaved
In-Office
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Fintech • HR Tech • Social Impact • Software • Analytics
The Senior Site Reliability Engineer will enhance application reliability and availability, manage Kubernetes applications, and collaborate with developers in a cloud-based environment.
Top Skills: DatadogGCPGoHelmKubernetesPythonTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 5 Days AgoSaved
In-Office or Remote
Calgary, AB
Mid level
Mid level
Artificial Intelligence • Cloud • Information Technology • Software
Contribute to the reliability and performance of Mithril's GPU orchestration platform through automation, observability, and infrastructure management. Collaborate with the team to ensure scalability across multi-cloud environments while maintaining systems stability and implementing SLOs.
Top Skills: AWSAzureGCPGoGrafanaKubernetesLinuxOpentelemetryPrometheusPulumiPythonTcp/IpTerraform
Reposted 5 Days AgoSaved
Remote or Hybrid
Calgary, AB
Mid level
Mid level
Artificial Intelligence • Software
The Data Reliability Engineer will enhance the resilience and scalability of data infrastructure, focusing on automation and reliability. Responsibilities include managing data pipelines, operating Kubernetes clusters, and defining observability standards.
Top Skills: GrafanaKubernetesPrometheusPythonRayTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Calgary, AB
Expert/Leader
Expert/Leader
Artificial Intelligence • Software
As a Software Engineer in Reliability, you'll architect and manage multi-cloud GPU infrastructure, ensuring performance, security, and scale while debugging complex hardware/software issues.
Top Skills: AmdAWSBashGoGpuInfinibandLinuxNvidiaOciPythonRdma
Reposted 6 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Marketing Tech
The Senior Site Reliability Engineer ensures system reliability, performance, and scalability while automating infrastructure and processes. Responsibilities include incident response, monitoring, team collaboration, and continuous improvement in service stability.
Top Skills: AnsibleAWSAzureBashCi/CdIacPythonSnowflakeTerraformTerragrunt
Reposted 9 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Cloud • Software
Operate, maintain and improve the global Tyk Cloud platform: run production Kubernetes clusters, manage cloud infrastructure, automate operations, run on-call incident response, create monitoring and dashboards, conduct post-incident analysis, document SRE processes, and drive reliability, efficiency and multi-region/multi-cloud expansion.
Top Skills: AWSAzureContainersDnsEksGCPGoGrafanaHelmHTTPInfrastructure As Code (Iac)KubernetesLinuxLogging Collection And Analysis SystemsMongoDBPrometheusPythonRancherRedisTcp/IpTerraformThanosTlsUdp
Reposted 9 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Cloud • Social Impact • Software • Wearables
Senior SRE focused on building cloud-native platforms, testable automation, and reliability tooling. Partner with Identity and Security to strengthen authentication/authorization, Okta integrations, and compliance. Design tests, write maintainable code (Go/Python), and improve observability and operational practices.
Top Skills: AksApmAWSAzureC#Ci/CdEksGoIacInfrastructure As CodeJavaKubernetesLoggingMetricsObservability ToolsOidcOktaPythonSAMLSecrets ManagementTracing
11 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.
Top Skills: AWSClickhouseGoKafkaKubernetesPulumiPythonTerraform
Reposted 12 Days AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Information Technology • Security • Software
The Senior Site Reliability Engineer will manage and ensure uptime, performance, and reliability of cloud services while optimizing resource allocation and leading incident responses.
Top Skills: AnsibleAWSAzureDatadogJenkinsNetappOctopus DeployPowershellPrometheusSplunkTerraformVMware
Reposted 13 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Insurance • Cybersecurity
Lead AI enablement at Coalition by developing standards for AI-native tools, driving tooling adoption, and mentoring engineering teams while ensuring reliable production environments.
Top Skills: AWSDatadogEcsGithub ActionsGoKubernetesPythonTerraform
Reposted 17 Days AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
Reposted 18 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Cloud • Social Impact • Software • Wearables
The Site Reliability Engineer II role involves building and maintaining cloud-native services, ensuring high reliability, managing cloud platforms, and fostering collaboration within engineering teams.
Top Skills: ApmAWSAws CloudformationAzureC#Ci/CdGoJavaKubernetesPythonTerraform
Reposted 19 Days AgoSaved
Remote
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Other • Sales • Software
Design and advance core infrastructure for engineering, ensure Kubernetes reliability, automate operations, and support AI infrastructure.
Top Skills: AWSAzureCi/CdCloudFormationGitopsGoGCPHelmKubernetesKustomizePostgresPythonTerraform
Reposted 20 Days AgoSaved
In-Office or Remote
Calgary, AB
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics
The role involves operating and scaling Kong's SaaS platform, building automated infrastructure, optimizing multi-region data layers, enhancing observability, and ensuring reliability across services.
Top Skills: ArgocdAWSAzureBashClickhouseDatadogDruidGCPGoGrafanaHelmKubernetesPostgresPrometheusPythonRedisTerraformTerragruntThanos
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account