SRE Specialist

Posted 18 Hours Ago
Be an Early Applicant
Circonscription électorale de Bourassa-Sauvé, QC
Mid level
Insurance • Financial Services
The Role
The SRE Specialist will implement and maintain reliability solutions for on-premises and cloud applications, work collaboratively across IT teams, automate system configurations, and establish monitoring protocols. They will also conduct incident reviews, define reliability requirements, and prepare performance dashboards while staying updated on market trends.
Summary Generated by Built In

Our employees are at the heart of what we do best: helping people, businesses and society prosper in good times and be resilient in bad times. When you join our team, you’re bringing this purpose to life alongside a passionate community of experts.

Feel empowered to learn and grow while being valued for who you are– here, diversity is a strength. You have our commitment to support you in reaching your goals with tools, opportunities, and flexibility. It’s our employee promise. 

Our hybrid work model provides the balance between working from home and enjoying meaningful in-person interactions.

Read on to see how you can shape the future, win as a team, and grow with us.

About the role

Our SRE Practice Team is looking for an experienced SRE Specialist!

As a key contributor to the reliability strategy at IFC, you will be responsible for the adoption, integration, and evolution of the Framework and platform, as well as the associated automation and service delivery. You will work collaboratively with various IT teams to enable and guide them in achieving site reliability.

What you'll do here:

  • Implement and maintain a comprehensive reliability solution for on-premises and cloud applications and services.

  • Work collaboratively with application support, developers, and database teams for application reliability on-boarding.

  • Automate system instrumentalization and configuration.

  • Design and implement self-service models for reliability.

  • Develop reliability requirements working with various IT support and cloud ops teams.

  • Deploy application monitoring profiles that meet requirements.

  • Implement workflow and synthetic transaction monitoring and alerting.

  • Help define key application performance metrics for proactive response to alerts.

  • Establish protocols for proactive monitoring alerts.

  • Plan and support the installation and configuration of monitoring agents and other monitoring components for on-premises and cloud applications and services.

  • Collaborate with projects and application support teams across the enterprise to identify opportunities and provide inputs to address any reliability gaps.

  • Define, capture, analyze, and build reliability solutions as per system requirements.

  • Conduct incident reviews from a reliability perspective and address any gaps in reliability coverage.

  • Prepare environment dashboards and report environment health, performance, and availability metrics.

  • Support the reliability platform and its underlying solutions.

  • Ensure leading awareness of market trends and opportunities on the subject matter.

  • Produce and review architecture documentation.

  • Define best practices framework and guidelines.

What you bring to the table:

  • Solid expertise on the topic of IT reliability

  • Extensive experience with application performance management, IT infrastructure monitoring, and user experience monitoring.

  • Technical leadership experience.

  • Enterprise application, systems, and network monitoring expertise for on-premises and cloud applications.

  • Hands-on experience with Dynatrace, Elastic Search, and ServiceNow in instrumenting applications end-to-end with minimal supervision.

  • Solid knowledge of AI-OPS, anomaly detection, and event correlation solutions.

  • Comfortable with scripting or programming languages (Java, C++, GO, Python)

  • Experience with open telemetry.

  • Good knowledge of infrastructure protocols to gather element-level event data.

  • Good knowledge of open-source monitoring technologies.

  • Proficient with data lifecycles and aggregation, reporting, and web dashboards.

  • Proficient in ITIL event management and good basis in ITIL foundational concepts.

  • Hands-on experience with continuous integration tools.

  • Deep knowledge of reliability and Site Reliability Engineering (SRE).

  • Infrastructure and Networking: The candidate should be familiar with advanced networking tools like F5, Citrix, Cloudflare, etc. and be able to design custom hardware and software networking solutions.

  • Troubleshooting: The candidate should be proficient with advanced log analysis tools like Dynatrace and be able to develop and maintain automated testing and deployment tools.

  • Cloud Computing and Virtualization: The candidate should have hands-on experience with AWS, GCP, Azure, VirtualBox, Docker, Kubernetes and advanced cloud infrastructure tools like Terraform, Puppet, or Chef.

  • Distributed Systems and Scalability: The candidate should have knowledge of advanced distributed systems tools like Kubernetes, and service meshes, and advanced distributed systems tools like Cassandra, Hadoop, or Spark.

  • Security and Compliance: The candidate should have knowledge of advanced security tools like HashiCorp Vault, AWS KMS, or Azure Key Vault and security best practices, firewalls, encryption, SSL/TLS.

  • For candidates located in Quebec, bilingualism is required considering the necessity to interact on a regular basis with English speaking colleagues across the country. 

  • No Canadian work experience required however must be eligible to work in Canada 

If you are interested in joining the SRE Practice team and contributing to the reliability strategy at IFC, please apply now.

#LI-Hybrid

What we offer

Working here means you'll be empowered to be and do your best every day. Here is some of what you can expect as a permanent member of our team:

  • A financial rewards program that recognizes your success

  • An industry leading Employee Share Purchase Plan; we match 50% of net shares purchased

  • An extensive flex pension and benefits package, with access to virtual healthcare

  • Flexible work arrangements

  • Possibility to purchase up to 5 extra days off per year

  • An annual wellness account that promotes an active and healthy lifestyle

  • Access to tools and resources to support physical and mental health, embracing change and connecting with colleagues

  • A dynamic workplace learning ecosystem complete with learning journeys, interactive online content, and inspiring programs

  • Inclusive employee-led networks to educate, inspire, amplify voices, build relationships and provide development opportunities

  • Inspiring leaders and colleagues who will lift you up and help you grow

  • A Community Impact program, because what you care about is a part of what makes you different. And how you contribute to your community should be just as unique.

We are an equal opportunity employer

At Intact, we value diversity and strive to create an inclusive, accessible workplace where all individuals feel valued, respected, and heard.

If we can provide a specific adjustment to make the recruitment process more accessible for you, please let us know when we reach out about a job opportunity. We’ll work with you to meet your needs.

Click here to review other important information about the hiring process, including background checks, internal candidates, and eligibility to work in Canada.

If you are an employee of Intact or belairdirect, please apply for this role on Contact People.

Top Skills

C++
Go
Java
Python
The Company
Edmonton, Alberta
18,239 Employees
On-site Workplace
Year Founded: 1809

What We Do

We are here to help people, businesses and society prosper in good times and be resilient in bad times. This is our purpose and the foundation of our company – it drives everything we do and gives meaning to our work.
-
Nous sommes là pour aider les gens, les entreprises et la société à aller de l'avant dans les bons moments et à être résilients dans les moments difficiles. C'est notre raison d'être et l'essence même de notre entreprise

Similar Jobs

Hybrid
Montréal, QC, CAN
550 Employees

DRW Logo DRW

Application Support Specialist

Fintech • Financial Services
Montréal, QC, CAN
1825 Employees
Montréal, QC, CAN
300 Employees

Similar Companies Hiring

MyBambu Thumbnail
Social Impact • Payments • Other • Mobile • Fintech • Financial Services • App development
West Palm Beach, Florida
120 Employees
Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
55 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account