Join us as a Site Reliability Engineer
- In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
- You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
- This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
- We're offering this role at vice president level
What you'll do
As our Site Reliability Engineer, you’ll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve systems and environments. You’ll define error budgets that support finding the right balance between risk and reliability.
You’ll also provide structure and help to our release process, suggesting and making improvements where possible. You’ll scale systems sustainably through mechanisms like automation, evolving them by pushing for changes that improve reliability and velocity. We’ll also look to you to coach and provide guidance to colleagues and the wider team, leading where required.
In addition to this, you’ll:
- Proactively contribute new ideas and innovations to meet short term and longer-term goals
- Continually balance and manage any potential risks
- Be accountable for the day-to-day health of both production and non-production environments and respond to any incidents as required
- Provide technical expertise and input to establish the risk tolerance of products and services
- Communicate incident status updates clearly and frequently to other teams, customers and stakeholders
The skills you'll need
We’re looking for someone with strong knowledge of reliability systems thinking and experience of software engineering. You’ll need at least 11 years experience of using a data driven and scientific approach to fact finding. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes
We’re also looking for:
- Good knowledge and experience of programming languages like Java and Terraform
- Strong knowledge of deploy and release services, automation, and troubleshooting
- Experience of GCP, databases, observability and monitoring tools like Prometheus, Grafana and Splunk
- Experience of SRE principles and CI/CD pipelines
- Strong communication skills with the ability to proactively engage with a wide range of stakeholders
Hours
45
Job Posting Closing Date:
12/05/2025
What We Do
We’re a business that understands when our customers and people succeed, our communities succeed, and our economy thrives. As part of our purpose, we’re looking at how we can drive change for our communities in enterprise, learning and climate.
As one of the leading supporters of UK business, we’re prioritising enterprise as a force of change. We’re focusing on the people and communities who have traditionally faced the highest barriers to entry and figuring out ways to remove these.
Learning is also key to our continued growth as a company in an ever changing and increasingly digital world. By setting a dynamic and leading learning culture, our people prosper, and our customers are given the tools to continue to improve their financial capability and confidence.
One of the biggest challenges we all face in our future is climate change. That’s why we’ve put it right at the core of our purpose. We want to champion climate solutions with financing and entrepreneurial support, fully embed climate into our culture and decision making, and be climate positive by 2025.
We’re committed to using our purpose to break down barriers, drive change and ultimately create a great place to work.