Blackpoint Cyber Logo

Blackpoint Cyber

Site Reliability Engineer

Job Posted 10 Days Ago Posted 10 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Mid level
Remote
Hiring Remotely in Canada
Mid level
Responsible for designing and maintaining infrastructure and CI/CD pipelines, ensuring reliability, scalability, and performance while collaborating with teams.
The summary above was generated by AI

Blackpoint Cyber is the leading provider of world-class cybersecurity threat hunting, detection and remediation technology. Founded by former National Security Agency (NSA) cyber operations experts who applied their learnings to bring national security-grade technology solutions to commercial customers around the world, Blackpoint Cyber is in hyper-growth mode,  fueled by a recent $190m series C round. 

Job Overview:

We are seeking an experienced SRE Engineer to join our dynamic team. As a SRE Engineer, you will be responsible for designing, implementing, and maintaining our infrastructure and CI/CD pipelines, with a focus on automation, scalability, and performance. You will collaborate with cross-functional teams to ensure the reliability and efficiency of our systems while fostering a culture of continuous improvement.

Key Responsibilities:

  • Design, build, and maintain highly scalable infrastructure using Terraform and Terragrunt to automate cloud resource provisioning

  • Manage cloud environments, particularly in AWS, ensuring cost optimization, security, and high availability

  • Work with Confluent Cloud and Kafka to manage and scale our data streaming platforms

  • Deploy and manage REDIS instances for caching and real-time data processing

  • Implement and maintain monitoring and alerting solutions using Prometheus, Grafana, Alert Manager, and OpsGenie to ensure system reliability

  • Enable feature flag management and controlled rollouts using LaunchDarkly

  • Manage Kubernetes clusters using Kubernetes, Helm, ArgoCD, Istio, and Kustomize for continuous delivery and infrastructure-as-code practices

  • Collaborate with development teams to ensure seamless integration of new services and features into our infrastructure

  • Troubleshoot and resolve complex system issues, ensuring high performance and uptime

  • Continuously improve automation tools, processes, and methodologies to enhance system scalability and maintainability

  • Stay up-to-date with emerging SRE trends and technologies, ensuring the organization leverages the latest advancements

Skills & Qualifications:

  • 4+ years proven experience as a SRE Engineer or in a similar role with a strong focus on cloud infrastructure and automation

  • Expertise in Infrastructure as Code (IaC) using Terraform and Terragrunt

  • Deep knowledge of AWS cloud services and best practices for designing secure and scalable architectures.

  • Hands-on experience with Confluent Cloud and Kafka for distributed data streaming

  • Strong experience with REDIS for caching and RDS data storage

  • Strong Experience with OpenSearch/ElasticSearch/ ChaosSearch

  • Proficiency in monitoring and alerting using Prometheus, Grafana, Alert Manager, and OpsGenie

  • Experience with LaunchDarkly for feature flag management

  • Extensive experience managing Kubernetes clusters, including package management with Helm, deployment with ArgoCD, and service mesh configurations using Istio

  • Familiarity with Kustomize for Kubernetes resource configuration

  • Excellent problem-solving skills with the ability to troubleshoot complex systems in production

  • Strong communication and collaboration skills, with experience working in agile environments

Nice to Have:

  • Experience with multi-cloud environments (e.g., GCP, Azure).

  • Familiarity with security best practices in cloud and containerized environments

  • Knowledge of serverless architectures and CI/CD tools such as Jenkins and Github Actions

  • Some development experience in NodeJS/Python/GoLang

Blackpoint Cyber welcomes and encourages applications from qualified individuals of all races,  colors, religions, sex, sexual orientation, gender identity or expression, national origin, age, marital  status, or any other legally protected status. We are committed to equality of opportunity in all  aspects of employment.  For eligible employees in the US, Blackpoint offers competitive Health, Vision, Dental, and Life Insurance plans, a robust 401k plan, Discretionary Time Off, and other minor perks.

Top Skills

Alert Manager
Argocd
AWS
Confluent Cloud
Go
Grafana
Helm
Istio
Kafka
Kubernetes
Kustomize
Launchdarkly
Node.js
Opsgenie
Prometheus
Python
Redis
Terraform
Terragrunt

Similar Jobs

10 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As a Site Reliability Engineer, you will build and maintain observability tools, develop systems for monitoring and alerting, and collaborate with engineering teams to enhance reliability and scalability at GitLab.
Top Skills: AnsibleAWSElk StackGCPGitlabGrafanaInfrastructure As CodeKubernetesPrometheusTerraform
3 Days Ago
Remote
Hybrid
Canada
Senior level
Senior level
Digital Media • Fintech • Information Technology • Mobile • Payments • Software • Financial Services
The Site Reliability Engineer will develop compliant, scalable infrastructure, focusing on IAC with Typescript, CI/CD pipelines, and collaboration with architecture teams.
Top Skills: AWSAzure-Devops)C#CdkCi/Cd (GitlabJavaJenkinsKubernetesPythonTeamcityTerraformTypescript
Senior level
Gaming • Software
The Site Reliability Engineer will enhance game delivery systems, ensure scalability and reliability, collaborate with teams on best practices, and maintain operational health while participating in on-call support.
Top Skills: AnsibleBuildkiteC++Ci/CdCloudFormationGitGitlabGitopsGrafanaHelix CoreJavaScriptKotlinLokiPerforcePrometheusPythonTerraform

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account