Quantiphi

Senior Platform Engineer

Posted Yesterday

In-Office or Remote

Hiring Remotely in Toronto, ON

Senior level

In-Office or Remote

Hiring Remotely in Toronto, ON

Senior level

The Senior Platform Engineer will design and optimize infrastructure for GenAI and LLM workloads, manage GPU resources, and collaborate on deploying AI solutions.

The summary above was generated by AI

While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.
If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!

About Quantiphi:

Quantiphi is an award-winning, AI-First digital engineering and consulting company focused on delivering high-impact Services and Solutions that help organizations solve what truly matters. We partner with enterprises to reimagine their businesses through intelligent, scalable, and transformative AI driving measurable outcomes at the very core of their operations.

Since our founding in 2013, Quantiphi has tackled some of the world’s most complex business challenges by combining deep industry expertise, disciplined cloud and data engineering practices, and cutting-edge applied AI research. Our work is rooted in delivering accelerated, quantifiable business value, not just technology for technology’s sake.

Headquartered in Boston, Quantiphi is a global organization with 4,000+ professionals serving clients across key industry verticals, including BFSI, Healthcare & Life Sciences, CPG, MFG, TME etc. As an Elite and Premier partner to leading cloud and AI platforms such as NVIDIA, Google Cloud, AWS, and Snowflake, we build and deliver enterprise-grade AI services and solutions that create real-world impact.

We’ve been recognized with:

17x Google Cloud Partner of the Year awards in the last 8 years.
3x AWS AI/ML award wins.
3x NVIDIA Partner of the Year titles.
2x Snowflake Partner of the Year awards.
We have also garnered top analyst recognitions from Gartner, ISG, and Everest Group.
We offer first-in-class industry solutions across Healthcare, Financial Services, Consumer Goods, Manufacturing, and more, powered by cutting-edge Generative AI and Agentic AI accelerators.
We have been certified as a Great Place to Work for the third year in a row- 2021, 2022, 2023.

Be part of a trailblazing team that’s shaping the future of AI, ML, and cloud innovation.

Your next big opportunity starts here!

For more details, visit: Website or LinkedIn Page.

Role: Senior Platform Engineer

Experience Level: 8+ yrs

Work Location: US East/Canada (Remote)

Role Overview:

We are looking for a highly skilled Senior Platform Engineer to design, optimize, and scale infrastructure for GenAI and LLM workloads. This role is ideal for someone with deep hands-on experience in GPU profiling, distributed training, and high-performance compute environments.

You’ll play a key role in building out GenAI platform foundations, supporting production-grade deployments, and partnering closely with data science, MLOps, and application teams to bring cutting-edge AI solutions to life.

Key Responsibilities:

Design and implement scalable infrastructure for LLM and GenAI workloads across multi-GPU environments
Perform GPU profiling, benchmarking, and performance optimization for distributed training workloads
Manage and schedule compute-intensive jobs using Slurm-based clusters and OpenShift/Kubernetes environments
Enable and optimize the NVIDIA GPU stack (CUDA, cuDNN, NCCL, Triton, RAPIDS, etc.)
Collaborate with cross-functional teams to deploy models in research and production environments
Build and support GenAI pipelines (fine-tuning, RAG, multi-modal inferencing, LLMOps)
Develop reusable infrastructure templates using tools like Terraform and Helm
Contribute to internal innovation (PoCs, workshops) and support client-facing delivery engagements

Basic Qualifications:

Strong experience with Slurm and distributed training environments
Hands-on expertise with Red Hat OpenShift and/or Kubernetes
Deep knowledge of the NVIDIA GPU ecosystem (CUDA, cuDNN, NCCL, Nsight, Triton/TensorRT)
Strong foundation in Linux systems, performance tuning, and multi-GPU optimization
Experience deploying GenAI workloads (LLM fine-tuning, RAG pipelines, multi-modal systems)
Familiarity with Infrastructure-as-Code tools (Terraform, Ansible)
Experience with cloud GPU environments (GCP, Azure, AWS, OCI) and/or on-prem GPU clusters

Other Qualifications (OQs):

Experience with NVIDIA NIMs, DGX systems, or GPU-accelerated containers
Knowledge of LLMOps frameworks and MLOps integration
Familiarity with vector databases and retrieval systems for RAG architectures
Comfortable working in client-facing environments and collaborating with AI solution teams

Healthcare Domain Experience (Nice to Have):

Experience working with FHIR R4, HL7 v2, or SMART on FHIR
Integration with EHR systems (e.g., Epic)
Understanding of HIPAA compliance and healthcare data privacy
Exposure to clinical workflows, CDS Hooks, or patient-facing applications
Experience building clinical decision support systems or healthcare interoperability solutions

What’s in it for YOU at Quantiphi:

Make an impact at one of the world’s fastest-growing AI-first digital engineering companies.
Upskill and discover your potential as you solve complex challenges in cutting-edge areas of technology alongside passionate, talented colleagues.
Work where innovation happens - work with disruptive innovators in a research-focused organization with 60+ patents filed across various disciplines.
Stay ahead of the curve immerse yourself in breakthrough AI, ML, data, and cloud technologies and gain exposure working with Fortune 500 companies.

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Top Skills

Ansible

AWS

Azure

Cudnn

GCP

Gpu

Kubernetes

Linux

Nccl

Nvidia Cuda

Oci

Rapids

Red Hat Openshift

Slurm

Terraform

Triton

Similar Jobs

Coinbase

Senior Software Engineer

2 Hours Ago

Easy Apply

Remote

Canada

Easy Apply

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

The Senior Software Engineer will manage data systems, develop scalable pipelines, ensure data security, and build self-service applications for users at Coinbase.

Top Skills: AirflowGoJavaKafkaPythonSparkSQL

Zapier

Senior Full-stack Engineer

22 Days Ago

Remote

Canada

Senior level

Artificial Intelligence • Productivity • Software • Automation

The Senior Fullstack Engineer will build and manage enterprise authentication systems, improve user session management, ensure security compliance, and support internal API usage across services, while enhancing performance and reliability.

Top Skills: Ci/CdDjangoFastifyKubernetesNode.jsPythonTerraform

Cority

Senior Platform Engineer

18 Days Ago

Remote

Canada

Senior level

Healthtech • Software

As a Senior Platform Engineer at Cority, you'll own platform services, design CI/CD pipelines, manage cloud infrastructure, and provide operational insights. You'll drive platform strategy and communicate technical trade-offs effectively.

Top Skills: ArgocdAWSBackstageGCPGitlab Ci/CdKubernetesKustomizeLaunchdarklyOpentelemetryPulumiPythonSonarqube

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.