Job Summary
Job Description
What will you do?
- Run the production environment by monitoring availability and taking a holistic view of system health
- Build tools to manage platform infrastructure and applications
- Debug production issues across services and levels of the stack and provide primary operational support and engineering for multiple large distributed software applications
- Help adopt and drive the creation of tools for health monitoring and alerting applications.
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of application team needs, and innovating to continually improve
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
- Partner with development teams to improve services through rigorous testing and release procedures.
- Participate in system design consulting, platform management, and capacity planning.
- Create sustainable systems and services through automation and uplifts.
- Balance feature development speed and reliability with well-defined service level objectives.
Technical Leadership
- Provide SRE thought leadership on the squad level
- Perform code and non-functional (performance, security, maintainability) reviews of all production-bound SRE solutions
- Help drive transformation by continuously looking for ways to automate existing processes
- Run engineering mindset meetups accelerating breadth and depth of knowledge in the community
- Manage application assets and users (virtual machines, cloud instances, source code repositories, etc.)
- Publish technical design for SRE solutions
- Publish and/or review implementation plans for SRE solutions bound to production
- Explore new capabilities and technologies to drive innovation (including coding and publishing how-to documentation)
Production Support + Development
- Perform production support role, including off-hours support
- Assist in incident management and problem management for applications in scope
- Evaluate continuously – what went well, what went wrong, what can be done to improve and prevent in future
- Maintain technology currency (perform server patching, certificate renewal, etc.) with a keen eye on automating opportunities
- Ensure availability and uptime of applications in scope, as per service level objectives
- Ensure compliance with all systems and applications in scope, including maintaining segregation of duties
What Do You Need To Succeed?
Must have:
- Overall, 5- 7 years of support experience in Openshift, Azure & Kubernetes.
- 3 years of experience as an SRE supporting multiple applications
- Have very strong programming skills in Java/JavaScript/Typescript/Python
- SQL database operational experience in the cloud/on-premise and writing/understanding database queries (SQL and/or No-SQL)
- Object Oriented design and development
- Exposure to UCD, PCF (Pivotal Cloud Foundry), and GitHub is desirable
- Having a good overall understanding of networking-related areas like certificates, load balancers etc.
- Monitoring using Splunk, Dynatrace, RUM, Grafana & other related tools
- Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting.
- Experience in micro-services, public cloud (Azure preferred) & container technologies
- Working knowledge of Mainframes & JCL is nice to have
Nice-to-have:
- Knowledge of public cloud (Microsoft Azure and AWS) and private cloud (OpenShift) platforms and development of applications in multi-cloud, hybrid environments
- Knowledge of containers and orchestration (e.g: Docker, Kubernetes)
What's in it for you?
- A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
- Leaders who support your development through coaching and managing opportunities.
- Ability to make a difference and lasting impact.
- Work in a dynamic, collaborative, progressive, and high-performing team.
- Flexible work/life balance options.
- Opportunities to do challenging work.
- Opportunities to take on progressively greater accountabilities.
- Opportunities to build close relationships with clients.
#LI-Hybrid #LI-POST #TECHPJ
Job Skills
Critical Thinking, Customer Support Systems, Group Problem Solving, Installation Support, IT Service Level Management, IT Service Management (ITSM), IT Standards, Technical Troubleshooting
Additional Job Details
Address:
RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO
City:
TORONTO
Country:
Canada
Work hours/week:
37.5
Employment Type:
Full time
Platform:
TECHNOLOGY AND OPERATIONS
Job Type:
Regular
Pay Type:
Salaried
Posted Date:
2025-01-23
Application Deadline:
2025-02-17
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above
Inclusion and Equal Opportunity Employment
At RBC, we embrace diversity and inclusion for innovation and growth. We are committed to building inclusive teams and an equitable workplace for our employees to bring their true selves to work. We are taking actions to tackle issues of inequity and systemic bias to support our diverse talent, clients and communities.
We also strive to provide an accessible candidate experience for our prospective employees with different abilities. Please let us know if you need any accommodations during the recruitment process.
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.
Top Skills
What We Do
Royal Bank of Canada is a global financial institution with a purpose-driven, principles-led approach to delivering leading performance. Our success comes from the 88,000+ employees who leverage their imaginations and insights to bring our vision, values and strategy to life so we can help our clients thrive and communities prosper. As Canada’s biggest bank, and one of the largest in the world based on market capitalization, we have a diversified business model with a focus on innovation and providing exceptional experiences to our 17 million clients in Canada, the U.S. and 27 other countries. Learn more at rbc.com.
We are proud to support a broad range of community initiatives through donations, community investments and employee volunteer activities.