Intact (intactfc.com) Logo

Intact (intactfc.com)

SRE Specialist

Posted 13 Days Ago
Be an Early Applicant
2 Locations
Senior level
2 Locations
Senior level
As an SRE Specialist at Intact Financial Corporation, you will implement and maintain reliability solutions for on-premises and cloud applications, collaborate with teams for application reliability onboarding, automate configurations, develop monitoring profiles, and establish protocols for proactive alerts. You're expected to guide IT teams and produce architecture documentation while staying updated with market trends.
The summary above was generated by AI

Our employees are at the heart of everything we do. Together, we help people, businesses, and society prosper in good times and be resilient in bad times.


Our employee promise represents Intact’s commitment to you in exchange for living our Values, striving to do your best work, being open to change and investing in your career. In return, we promise to provide support, opportunities and performance-led financial rewards at a workplace where you can shape the future, win as a team and grow with us.

About the role

Intact Financial Corporation (IFC) is seeking an experienced SRE specialist to join its SRE Practice team. As a key contributor to the reliability strategy at IFC, you will be responsible for the adoption, integration, and evolution of the Framework and platform, as well as the associated automation and service delivery. You will work collaboratively with various IT teams to enable and guide them in achieving site reliability.

What you'll do here: 

  • Implement and maintain a comprehensive reliability solution for on-premises and cloud applications and services.

  • Work collaboratively with application support, developers, and database teams for application reliability on-boarding.

  • Automate system instrumentalization and configuration.

  • Design and implement self-service models for reliability.

  • Develop reliability requirements working with various IT support and cloud ops teams.

  • Deploy application monitoring profiles that meet requirements.

  • Implement workflow and synthetic transaction monitoring and alerting.

  • Help define key application performance metrics for proactive response to alerts.

  • Establish protocols for proactive monitoring alerts.

  • Plan and support the installation and configuration of monitoring agents and other monitoring components for on-premises and cloud applications and services.

  • Collaborate with projects and application support teams across the enterprise to identify opportunities and provide inputs to address any reliability gaps.

  • Define, capture, analyze, and build reliability solutions as per system requirements.

  • Conduct incident reviews from a reliability perspective and address any gaps in reliability coverage.

  • Prepare environment dashboards and report environment health, performance and availability metrics.

  • Support the reliability platform and its underlying solutions.

  • Ensure leading awareness of market trends and opportunities on the subject matter.

  • Produce and review architecture documentation.

  • Define best practices framework and guidelines.

What you bring to the table: 

  • Solid expertise on the topic of IT reliability

  • Extensive experience with application performance management, IT infrastructure monitoring, and user experience monitoring.

  • Technical leadership experience.

  • Enterprise application, systems, and network monitoring expertise for on-premises and cloud applications.

  • Hands-on experience with Dynatrace, Elastic Search, and ServiceNow in instrumenting applications end-to-end with minimal supervision.

  • Solid knowledge of AI-OPS, anomaly detection, and event correlation solutions.

  • Comfortable with scripting or programming languages (Java, C++, GO, Python)

  • Experience with open telemetry.

  • Good knowledge of infrastructure protocols to gather element-level event data.

  • Good knowledge of open-source monitoring technologies.

  • Proficient with data lifecycles and aggregation, reporting, and web dashboards.

  • Proficient in ITIL event management and good basis in ITIL foundational concepts.

  • Hands-on experience with continuous integration tools.

  • Deep knowledge of reliability and Site Reliability Engineering (SRE).

  • Infrastructure and Networking: The candidate should be familiar with advanced networking tools like F5, Citrix, Cloudflare, etc. and be able to design custom hardware and software networking solutions.

  • Troubleshooting: The candidate should be proficient with advanced log analysis tools like Dynatrace and be able to develop and maintain automated testing and deployment tools.

  • Cloud Computing and Virtualization: The candidate should have hands-on experience with AWS, GCP, Azure, VirtualBox, Docker, Kubernetes and advanced cloud infrastructure tools like Terraform, Puppet, or Chef.

  • Distributed Systems and Scalability: The candidate should have knowledge of advanced distributed systems tools like Kubernetes and service meshes, and advanced distributed systems tools like Cassandra, Hadoop, or Spark.

  • Security and Compliance: The candidate should have knowledge of advanced security tools like HashiCorp Vault, AWS KMS, or Azure Key Vault and security best practices, firewalls, encryption, SSL/TLS.

  • Bilingual (French and English): Need to interact on a regular basis with an English-speaking clientele and colleagues across the country. 

  • No Canadian work experience required however must be eligible to work in Canada 

#LI-Hybrid 

What we offer
 

Our hybrid work model provides the balance between working from home and enjoying meaningful in-person interactions.

Working here means you'll be empowered to be and do your best every day. Here is some of what you can expect as a permanent member of our team:

  • A financial rewards program that recognizes your success

  • An industry leading Employee Share Purchase Plan; we match 50% of net shares purchased

  • An extensive flex pension and benefits package, with access to virtual healthcare

  • Flexible work arrangements

  • Possibility to purchase up to 5 extra days off per year

  • An annual wellness account that promotes an active and healthy lifestyle

  • Access to tools and resources to support physical and mental health, embracing change and connecting with colleagues

  • A dynamic workplace learning ecosystem complete with learning journeys, interactive online content, and inspiring programs

  • Inclusive employee-led networks to educate, inspire, amplify voices, build relationships and provide development opportunities

  • Inspiring leaders and colleagues who will lift you up and help you grow

  • A Community Impact program, because what you care about is a part of what makes you different. And how you contribute to your community should be just as unique.

We are an equal opportunity employer


At Intact, our Value of respect is founded on seeing diversity as a strength. We strive to create an accessible workplace where employees feel valued, included and encouraged to share their unique perspectives.


We encourage applications from individuals who are members of equity-deserving groups, including but not limited to women, Indigenous peoples, persons with disabilities, Black people, and members of the 2SLGBTQI+ community.


As part of Intact’s commitment to reconciliation, we acknowledge that we work, meet and travel across the land currently called Canada, originally inhabited by First Nations, Metis and Inuit people. This history extends through many centuries and continues to evolve today.


We have policies to ensure equal access and participation for people with disabilities, including providing workplace adjustments (accommodations). A copy of applicable policies is available on request.


If we can provide a specific adjustment to make the recruitment process more accessible for you, please let us know when we reach out about a job opportunity. We’ll work with you to meet your needs.


Learn more about our recruitment process and your candidate journey here.


If you are an employee of Intact or belairdirect, please apply for this role on Internal Career Site.

Top Skills

Ai-Ops
Anomaly Detection
Application Performance Management
AWS
Aws Kms
Azure
Azure Key Vault
C++
Cassandra
Chef
Continuous Integration Tools
Docker
Dynatrace
Elastic Search
Event Correlation
GCP
Go
Hadoop
Hashicorp Vault
It Infrastructure Monitoring
It Reliability
Java
Kubernetes
Open Telemetry
Open-Source Monitoring Technologies
Puppet
Python
Servicenow
Spark
Ssl/Tls
Terraform
User Experience Monitoring
Virtualbox

Similar Jobs

Yesterday
Montréal, QC, CAN
Senior level
Senior level
Artificial Intelligence • Software
As a Site Reliability Engineer at Behavox, you will ensure the availability and performance of production systems, implement SRE practices, and work with high-load data processing systems. Responsibilities include deploying and maintaining cloud services, automating operations, and collaborating with DevOps and engineering teams.
Top Skills: AnsibleAWSConsulGCPGitlabGoJavaJenkinsKubernetesNomadPythonSaltstackTerraform
2 Days Ago
Montréal, QC, CAN
Senior level
Senior level
Artificial Intelligence • Software
As a Site Reliability Engineer at Behavox, you'll ensure production systems' availability and performance, automate operations, and support high-load data processing in public cloud environments. Collaborate with DevOps and engineering teams to design effective SRE practices and tackle challenges in large-scale distributed systems.
Top Skills: AnsibleAWSCloud FunctionsConsulGCPGoGoogle Cloud DataflowJavaLinuxNomadPub/SubPythonSaltstackTerraformVault
14 Days Ago
Montréal, QC, CAN
Senior level
Senior level
Cloud • eCommerce • Payments • Sales • Software
The Senior Site Reliability Engineer will collaborate with data teams to design and maintain secure and reliable cloud infrastructure. Responsibilities include implementing Infrastructure as Code, ensuring high availability and disaster recovery, and optimizing observability and monitoring processes.
Top Skills: BashDockerGoGoogle Cloud PlatformKubernetesLinuxMySQLNetworkingPostgresPythonTerraformUnix

What you need to know about the Calgary Tech Scene

Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account