Autodesk Logo

Autodesk

Senior Site Reliability Engineer

Reposted 2 Days Ago
Be an Early Applicant
In-Office
Toronto, ON
Senior level
In-Office
Toronto, ON
Senior level
The Senior Site Reliability Engineer manages AWS infrastructure, ensuring reliability and performance. Responsibilities include architecture, cloud automation, CI/CD processes, and operational support.
The summary above was generated by AI

Job Requisition ID #

25WD92369

Position Overview

We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloud

infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring

the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering

Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the

platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.

Responsibilities

  • Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture

  • Independently manage requirement analysis, solution design, implementation, and release planning

  • Ensure high adherence to trust and security compliance, guidelines and standards

  • Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security

  • Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices

  • Implement and maintain configuration management and infrastructure as code (IaC) using Terraform

  • Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and period maintenance activities

  • Contribute to critical vulnerability (CVEs) remediation efforts

  • Promote and document security and best practices across all pillars of DevOps/SRE throughout system design

  • Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues

  • Participate in on-call rotations, providing critical 24x7 support for production systems

Minimum Qualifications

  • Bachelor’s degree or higher in Computer Science, Engineering, or a related field

  • 5+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field

  • Proficiency with managing AWS resources and understanding of networking and security protocols

  • Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation

  • Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory

  • Experience with container-based technologies like Docker and AWS ECS

  • Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch

  • Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment

  • Proficiency in programming languages such as UNIX, Python, Go, Bash, Groovy, and Node.js

  • Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, GoLang, Node.js, Groovy, Python, Jenkins, GitHub, Jira, ServiceNow, and Splunk.

Preferred Qualifications

  • Knowledge in applying AI and ML solutions for engineering processes and/or DevOps automation

  • Knowledge of standardized observability frameworks such as OpenTelemetry

  • Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer)

  • Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures

  • Broad knowledge with data streaming pipelines like Kinesis, Firehose, and Kafka

  • Knowledge on core Java and SpringBoot concepts in JVM optimization

  • Knowledge on build tools, e.g. Gradle

  • Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment

  • Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership

#LI-AD1

Learn More

About Autodesk

Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & Belonging
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

Top Skills

AWS
Bash
CloudFormation
Datadog
Docker
Elk Stack
Git
Go
Grafana
Groovy
Java
Jenkins
Kafka
Node.js
Python
Servicenow
Splunk
Spring Boot
Terraform
Unix

Autodesk Calgary, Alberta, CAN Office

107-6227 2 St SE, Calgary, AB, Canada, T2H 1J5

Similar Jobs

2 Days Ago
Hybrid
Toronto, ON, CAN
Mid level
Mid level
Enterprise Web • Fintech • Financial Services
The Senior Site Reliability Engineer will enhance system reliability, lead automation projects, and optimize cloud solutions in a collaborative environment.
Top Skills: Ci/CdCloud-Based SolutionsCloudFormationContainersDevOpsDistributed ApplicationsDockerInfrastructure As CodeMicroservicesPlsqlServerless TechnologySQLTerraform
16 Days Ago
In-Office or Remote
Toronto, ON, CAN
Senior level
Senior level
Insurance
The Senior Site Reliability Engineer at Zensurance will focus on enhancing production systems' reliability, scalability, and performance through automation, best practices, and incident management, while mentoring junior engineers.
Top Skills: AWSDatadogElk StackGithub ActionsGrafanaKubernetesPrometheusSplunkTerraformTypescript
5 Days Ago
In-Office
2 Locations
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics
The role involves operating and scaling Kong's SaaS platform, building automated infrastructure, optimizing multi-region data layers, enhancing observability, and ensuring reliability across services.
Top Skills: ArgocdAWSAzureBashClickhouseDatadogDruidGCPGoGrafanaHelmKubernetesPostgresPrometheusPythonRedisTerraformTerragruntThanos

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account