Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Toronto, ON
Senior level
Fintech • Insurance
The Role
As a Site Reliability Engineer, you will monitor system health, support large distributed applications, improve reliability and performance, and collaborate with development teams. You will automate processes, manage production support, and contribute to system design and capacity planning.
Summary Generated by Built In

Job Summary

Job Description

What will you do?

    • Run the production environment by monitoring availability and taking a holistic view of system health
    • Build tools to manage platform infrastructure and applications
    • Debug production issues across services and levels of the stack and provide primary operational support and engineering for multiple large distributed software applications
    • Help adopt and drive the creation of tools for health monitoring and alerting applications.
    • Improve reliability, quality, and time-to-market of our suite of software solutions
    • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of application team needs, and innovating to continually improve
    • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
    • Partner with development teams to improve services through rigorous testing and release procedures.
    • Participate in system design consulting, platform management, and capacity planning.
    • Create sustainable systems and services through automation and uplifts.
    • Balance feature development speed and reliability with well-defined service level objectives. 

    Technical Leadership

    • Provide SRE thought leadership on the squad level
    • Perform code and non-functional (performance, security, maintainability) reviews of all production-bound SRE solutions
    • Help drive transformation by continuously looking for ways to automate existing processes
    • Run engineering mindset meetups accelerating breadth and depth of knowledge in the community
    • Manage application assets and users (virtual machines, cloud instances, source code repositories, etc.)
    • Publish technical design for SRE solutions
    • Publish and/or review implementation plans for SRE solutions bound to production
    • Explore new capabilities and technologies to drive innovation (including coding and publishing how-to documentation)

    Production Support + Development

    • Perform production support role, including off-hours support
    • Assist in incident management and problem management for applications in scope
    • Evaluate continuously – what went well, what went wrong, what can be done to improve and prevent in future
    • Maintain technology currency (perform server patching, certificate renewal, etc.) with a keen eye on automating opportunities
    • Ensure availability and uptime of applications in scope, as per service level objectives
    • Ensure compliance with all systems and applications in scope, including maintaining segregation of duties

     

    What Do You Need To Succeed? 
    Must have:

    • Overall, 5- 7 years of support experience in Openshift, Azure & Kubernetes.
    • 3 years of experience as an SRE supporting multiple applications
    • Have very strong programming skills in Java/JavaScript/Typescript/Python
    • SQL database operational experience in the cloud/on-premise and writing/understanding database queries (SQL and/or No-SQL)
    • Object Oriented design and development
    • Exposure to UCD, PCF (Pivotal Cloud Foundry), and GitHub is desirable
    • Having a good overall understanding of networking-related areas like certificates, load balancers etc.
    • Monitoring using Splunk, Dynatrace, RUM, Grafana & other related tools
    • Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting.
    • Experience in micro-services, public cloud (Azure preferred) & container technologies
    • Working knowledge of Mainframes & JCL is nice to have

    Nice-to-have:

    • Knowledge of public cloud (Microsoft Azure and AWS) and private cloud (OpenShift) platforms and development of applications in multi-cloud, hybrid environments
    • Knowledge of containers and orchestration (e.g: Docker, Kubernetes)

    What's in it for you?

    • A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
    • Leaders who support your development through coaching and managing opportunities.
    • Ability to make a difference and lasting impact.
    • Work in a dynamic, collaborative, progressive, and high-performing team.
    • Flexible work/life balance options.
    • Opportunities to do challenging work.
    • Opportunities to take on progressively greater accountabilities.
    • Opportunities to build close relationships with clients.

    #LI-Hybrid #LI-POST #TECHPJ

    Job Skills

    Critical Thinking, Customer Support Systems, Group Problem Solving, Installation Support, IT Service Level Management, IT Service Management (ITSM), IT Standards, Technical Troubleshooting

    Additional Job Details

    Address:

    RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO

    City:

    TORONTO

    Country:

    Canada

    Work hours/week:

    37.5

    Employment Type:

    Full time

    Platform:

    TECHNOLOGY AND OPERATIONS

    Job Type:

    Regular

    Pay Type:

    Salaried

    Posted Date:

    2025-01-23

    Application Deadline:

    2025-02-17

    Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above

    Inclusion and Equal Opportunity Employment

    At RBC, we embrace diversity and inclusion for innovation and growth. We are committed to building inclusive teams and an equitable workplace for our employees to bring their true selves to work. We are taking actions to tackle issues of inequity and systemic bias to support our diverse talent, clients and communities.
    ​​​​​​​
    We also strive to provide an accessible candidate experience for our prospective employees with different abilities. Please let us know if you need any accommodations during the recruitment process.

    Join our Talent Community
    Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
    Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.

    Top Skills

    Java
    JavaScript
    Python
    SQL
    Typescript
    The Company
    HQ: Toronto, Ontario
    88,000 Employees
    On-site Workplace

    What We Do

    Royal Bank of Canada is a global financial institution with a purpose-driven, principles-led approach to delivering leading performance. Our success comes from the 88,000+ employees who leverage their imaginations and insights to bring our vision, values and strategy to life so we can help our clients thrive and communities prosper. As Canada’s biggest bank, and one of the largest in the world based on market capitalization, we have a diversified business model with a focus on innovation and providing exceptional experiences to our 17 million clients in Canada, the U.S. and 27 other countries. Learn more at rbc.com.‎

    We are proud to support a broad range of community initiatives through donations, community investments and employee volunteer activities.

    Similar Jobs

    McCain Foods Logo McCain Foods

    Sr Eng Manager, SRE & Observability

    Food • Retail • Agriculture • Manufacturing
    Toronto, ON, CAN
    20000 Employees

    McCain Foods Logo McCain Foods

    Microsoft Unified Communication SRE

    Food • Retail • Agriculture • Manufacturing
    Toronto, ON, CAN
    20000 Employees
    Hybrid
    Mississauga, ON, CAN
    1557 Employees
    110K-118K Annually

    MongoDB Logo MongoDB

    Staff Site Reliability Engineer, Fabric

    Big Data • Cloud • Software • Database
    Toronto, ON, CAN
    2382 Employees

    Similar Companies Hiring

    EDGE Thumbnail
    Software • Fintech • Financial Services • Analytics
    Chicago, IL
    20 Employees
    Bectran, Inc Thumbnail
    Software • Machine Learning • Information Technology • Fintech • Automation • Artificial Intelligence
    Schaumburg, IL
    51 Employees
    MassMutual India Thumbnail
    Insurance • Information Technology • Fintech • Financial Services • Big Data
    Hyderabad, Telangana

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account