Design and build core backend AI systems and components for a multi-cloud execution platform. Enhance support for AI and batch workloads, collaborate with users and open-source community, and contribute public-facing content.
SkyPilot is building the future of multicloud AI infra. We are the Berkeley founding team commercializing SkyPilot (9.5K+ GitHub stars, 200+contributors), to enable AI to run on different cloud infrastructures in a portable, cost-optimizing, and highly available way.
SkyPilot is deployed at 100s of companies, including Fortune 500s and top AI-natives (Shopify, Redis, Abridge, Hippocratic, Applied Compute, etc.). In 2025, adoption grew >600%, now launching more GPUs per month than the biggest neocloud’s fleet. Currently in stealth, SkyPilot is founded in 2024 by UC Berkeley PhDs and professors (incl. Databricks cofounders). We’re building a top-tier engineering team, with current talent from Databricks, Google, Crusoe, ByteDance, and PingCap.
You will play a crucial role in shaping the future of Sky Computing and AI infrastructure:
- Design and build core AI systems in the Sky Computing vision to make SkyPilot the standard solution in multi-cloud, any-cloud execution for AI.
- Build enhancements and new components to evolve SkyPilot with better support of a wide range of AI and batch workloads.
- Engage with users: Opportunity to work closely with our users and customers to make their use cases successful; to grow our open-source community; to gain visibility for your work via public tutorials, blog posts, and/or talks.
Ideal Candidates
- Strong systems background: 3+ years of industry experience in backend engineering (YOE can be relaxed for exceptional candidates). Bonus: Designed and/or implemented impactful infra platforms & cloud/distributed systems.
- Experience with cloud infra technologies: e.g., gRPC, Protobuf, AWS EC2 / GCP GCE / Azure, object storage, cloud networking, Kubernetes, Terraform, load balancers.
- Experience with Python/Go, or other systems programming languages.
- Bonus: Familiarity with GenAI / DL / ML workloads or related infra frameworks (e.g., Kueue, KAI, KServe).
- Passion for building the future of AI infra and cloud computing.
What We Offer
- Competitive equity and compensation.
- Chance to work with some of the best minds in cloud, distributed, and AI systems.
- Front-row seat at the latest open-source infra startup from Berkeley (prev: Databricks, Anyscale).
Similar Jobs
Big Data • Fintech • Mobile • Payments • Financial Services
Build and operate ML training and serving infrastructure. Design, develop, and launch backend systems, collaborate across teams, support operations and on-call, write well-tested extensible code, and participate in team growth and hiring activities.
Top Skills:
AWSKotlinKubernetesMySQLPython
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Design, build, and maintain backend services for fraud and risk detection at scale. Deliver features end-to-end, partner with Data Science/ML, improve reliability and observability, respond to incidents, and build reusable systems to reduce financial loss and protect users.
Top Skills:
AWSBitcoinDockerDynamoDBEthereumGCPGeminiGleanGoJavaKafkaKubernetesLibrechatMongoDBPostgresPythonRabbitMQRuby
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead architecture and implementation of Coinbase's Risk Platform: build high-throughput, low-latency real-time fraud detection, decisioning, and mitigation systems. Define multi-quarter technical strategy, partner with Data Science/ML/Product/Compliance, implement AI-native agent-driven workflows, and mentor engineers to improve reliability, performance, and scale.
Top Skills:
Agent FrameworksEvent-Driven ArchitecturesGenerative AiGraphQLMicroservicesReal-Time DecisioningRest
What you need to know about the Calgary Tech Scene
Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.


.png)