Menlo Security Inc. Logo

Menlo Security Inc.

Platform Infrastructure Engineer (SRE Core)

Posted 15 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Ameer, Minto-Odanah, MB
Mid level
In-Office or Remote
Hiring Remotely in Ameer, Minto-Odanah, MB
Mid level
Build, operate, and automate Menlo Security's multi-region cloud-native infrastructure across GCP and AWS. Manage Kubernetes clusters (GKE), VMs, Terraform IaC, observability with Grafana/Prometheus/OTel, certificate/DNS automation, ingress/service mesh (Cilium), and CI/CD. Partner with product, security, and engineering, eliminate operational toil, and participate in 24x7 on-call incident response and post-incident reviews.
The summary above was generated by AI

Menlo Security's mission is enabling the world to connect, communicate and collaborate securely without compromise. COVID-19 has made our mission all the more real. We support customers across various enterprises including Fortune 500 companies, 9/10 of the largest global banks and the Department of Defense.

The world has fundamentally changed. We are growing from 400 employees into the next phase of our journey, and we need passionate talent filled with empathy and agility. The right candidate for the job is ethical, hyper-organized, fanatical about seeing things through to completion, service-oriented, and humble enough to take feedback and coaching yet confident enough to provide feedback and coaching.

Menlo is well-funded for growth and our investors are second to none. They include Vista Equity Partners (“Vista”), General Catalyst, JPMC, American Express, HSBC, and Ericsson Ventures.

About the Role

Platform Infrastructure Engineering is responsible for building and operating Menlo Security's Infrastructure Platform. Together with the rest of our engineering teams, we enable our customers to connect to the Internet without compromise. Our environment provides services globally. We expect failure, build security in by design, create evolvable systems, and enable multi-tenancy across the infrastructure. Automation is an absolute for us.

We are committed to getting it done properly, the first time.

As a Platform Infrastructure Engineer, you'll join a group of experienced engineers who are part of a globally distributed team responsible for building and managing the company's core infrastructure services and maintaining our constantly growing platform. The team operates a sophisticated cloud-native infrastructure built on Google Kubernetes Engine and VMs spanning multiple environments globally from development to production. We manage infrastructure as code with Terraform and Spacelift orchestration, and deploy services using Helm charts. Our platform emphasizes security-first design, comprehensive observability, and multi-region resilience. Success in this role requires working with a vast VM fleet in AWS and GCP as well as Kubernetes, writing Infrastructure as Code, and a passion for automation and reliability engineering.

Responsibilities

  • Design, deploy, and maintain VM and Kubernetes infrastructure on GCP and AWS across dozens of clusters spanning development, staging, and production environments in multiple regions.

  • Coordinate with your peers in your direct team as well as across teams to ensure that the tasks you’re working on are going to solve the problems that we need them to solve.

  • Build and maintain Infrastructure as Code (IaC) using Terraform modules, managing resources through Spacelift or equivalent Terraform Automation and Collaboration Software (TACOS). Provision cloud infrastructure including networking, compute, storage, and security components primarily on GCP, with secondary AWS support.

  • Implement and manage workflows with sophisticated multi-layer configuration management.

  • Build and maintain comprehensive observability solutions using Grafana Cloud, Prometheus/Mimir, and OTel collectors. Design Grafana dashboards, configure alerting rules, and ensure visibility across all platform components.

  • Manage certificate lifecycle, DNS automation, ingress controllers, and service mesh networking with Cilium.

  • Partner with Engineering, Product, Compliance, and Security teams to design resilient, scalable systems. Consult on capacity planning, disaster recovery, and architectural decisions for cloud-native applications.

  • Identify and eliminate toil through automation. Write scripts, develop tools, and build CI/CD pipelines to improve operational efficiency and reduce manual work.

  • Participate in a 24x7 on-call rotation as part of a globally distributed team, responding to incidents and driving post-incident reviews.

Requirements

  • Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience.

  • Proficiency in common programming & scripting languages. We use a lot of python, bash and go.

  • Understanding of network topologies, communication protocols (ie. TCP/IP, HTTP/S, UDP, TLS) and enterprise grade connectivity solutions.

  • Kubernetes expertise including cluster administration, RBAC, networking, workload management, and troubleshooting across production environments.

  • Proven experience with Terraform for infrastructure provisioning and management.

  • Knowledge of Google Cloud Platform services including GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.

  • Experience with GitOps methodologies and tools.

  • Clear understanding of how to use LLM code assist tools to effectively build software.

Why Menlo?

Our culture is collaborative, inclusive, and fun! We have five core values: Stay Aligned, Get It Done, Customer Empathy, Think Creatively and Help Each Other Out. We believe in open communication, supporting new ideas, and sharing a mutual mindset of what we’re aiming to achieve together. There are tremendous opportunities to take initiative, implement new ideas, and have a hand in building a legacy.

All qualified applicants will receive consideration for employment without regard to race, sex, color, religion, sexual orientation, gender identity, national origin, protected veteran status, or on the basis of disability.

TO ALL AGENCIES: Please, no phone calls or emails to any employee of Menlo Security outside of the Talent organization. Menlo Security’s policy is to only accept resumes from agencies via Ashby (ATS). Agencies must have a valid services agreement executed and must have been assigned by the Talent team to a specific requisition. Any resume submitted outside of this process will be deemed the sole property of Menlo Security. In the event a candidate submitted outside of this policy is hired, no fee or payment will be paid.

Top Skills

Python,Bash,Go,Kubernetes,Google Kubernetes Engine (Gke),Terraform,Spacelift,Helm,Grafana Cloud,Prometheus,Mimir,Opentelemetry (Otel) Collectors,Cilium,Service Mesh,Cloud Dns,Ingress Controllers,Aws,Gcp,Vpc,Artifact Registry,Secret Manager,Iam,Workload Identity,Gemini Code Assist,Gitops,Ci/Cd,Tcp/Ip,Http/S,Udp,Tls

Similar Jobs

16 Hours Ago
Easy Apply
Remote or Hybrid
6 Locations
Easy Apply
Mid level
Mid level
Big Data • Cloud • Software • Database
Join the Networking & Observability team at MongoDB to improve distributed database communication and observability features using C++. Collaborate on projects from design to delivery, focusing on system performance and efficiency.
Top Skills: C++
17 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
The Process Excellence Manager enhances business processes, coordinates projects, and ensures operational readiness while collaborating across multiple teams to achieve strategic goals.
Top Skills: Design ThinkingLean Six SigmaProject Management
17 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
The Partner Growth Manager at Affirm will expand payment partnerships, execute growth strategies, negotiate agreements, and optimize performance to drive revenue and merchant acquisition.

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account