Team Leader - Site Reliability Engineering (Technical Duty Officers)

Posted 2 Days Ago
Be an Early Applicant
San Mateo, CA
Hybrid
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
Xero’s online accounting software connects small business owners with their numbers, their bank, and advisors anytime.
The Role
The Team Leader for Site Reliability Engineering at Xero will oversee incident management processes, lead a technical team, drive best practices in SRE, and collaborate with product teams to enhance service reliability. The role requires strong communication skills and hands-on experience with AWS and coding, particularly in Python, to build tools and automation for incident resolution.
Summary Generated by Built In

Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive. 


At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their lives so that they can help small businesses succeed through better tools, information and connections. Because when they succeed they make a difference, and when millions of small businesses are making a difference, the world is a more beautiful place.


About the team


Xero’s Incident and Problem Management team are a part of the Site Reliability Engineering (SRE) organization and are responsible for the build, delivery and ongoing maintenance of robust process and tooling around Incident management.


The team is responsible for driving enduring reliability at Xero through robust, consistent and fast response to high severity incidents. They are responsible for building a world class process and ensuring that process matures as the demands of the business grows. 


About the role


This role requires an experienced SRE professional with a strong technical background, deep experience in SRE, a passion for building and delivering robust processes and extensive experience of leading technical teams who respond to high severity cloud issues.


As a seasoned and relentless professional, they will drive best practice across the business and contribute to the ongoing transformation of the Xero SRE culture. As an expert communicator, you will lead technical discussions to identify and track actions associated with and identified during incident situations.


Across our SRE function, we're looking for those who are keen to deep dive into causes of incidents and proactively examine the potential causes of future incidents; working with engineering teams to remove the risk of that failure scenario. Ultimately building playbooks and automation to ensure quick and effective responses. In addition, provide ongoing training across the business to ensure the process is well understood and adhered to.


This Team Leader role will focus on building and leading a team of highly technical SRE engineers, who provide a Technical Duty Officer (TDO) function within the business. TDO’s are incident commanders who use SRE skillsets to drive fast mitigation and enduring resolution of impactful events.

What you'll do:

  • Own the incident management process, ensuring it drives enduring reliability across all products and services within Xero.
  • Provide expert leadership during critical outages, coordinating multiple teams to ensure streamlined decision-making and quick resolution.
  • Lead and advocate for the transformation to a world-leading SRE organization, promoting SRE principles within the Engineering Department.
  • Promote a customer-focused approach by addressing and mitigating global customer environment issues, and fostering a culture of continuous learning and technical excellence within the SRE team.
  • Develop and implement scalable process frameworks and observability strategies to ensure rapid problem diagnosis, response, and service reliability.
  • Collaborate with product teams to thoroughly analyze failures and integrate insights to improve service reliability, scalability, and operational efficiency.

What you'll bring:

  • Previous career experience as a Site Reliability Engineer, in an Operations or Engineering environment
  • Proven people & team leadership capabilities, having held a Team Leader/Engineering Manager role at a comparative organisation with a passion for leading others
  • Hands-on experience troubleshooting AWS hosted services
  • Networking knowledge and able to troubleshoot TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP issues.
  • Coding experience (preferably Python) building tools, scripting, or automation
  • Strong communication (oral & written) skills including the ability to translate technical issues/concepts into agreed actions


Why Xero? 

Diversity of people brings diversity of thought, and we like that. Our human-first culture of respect, fairness, and inclusion is what helps Xeros thrive and work and beyond. Offering very generous paid leave to use however you’d like (plus statutory holidays!), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family, employee resource groups, wellbeing programming and allowances, medical, dental, vision, and disability insurance, fertility and family forming financial support, 401k contribution matching, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices with snacks and break areas, flexible working, career development and many other benefits that reflect our human value, you’ll do the best work of your life at Xero.

Top Skills

Python

What the Team is Saying

Rose
Sophia
The Company
HQ: Hawthorn, VIC
4,700 Employees
Hybrid Workplace
Year Founded: 2006

What We Do

Xero is a global small business platform with 3.95 million subscribers which includes a core accounting solution, payroll, workforce management, expenses and projects. Xero also has an extensive ecosystem of connected apps and connections to banks and other financial institutions helping small businesses access a range of solutions from within Xero’s open platform to help them run their business and manage their finances.

Why Work With Us

Xero is not like most companies. When you join Xero, you become part of something beautiful —a global community of people who are passionate about making an impact on the world. It’s a place where you can truly be yourself and find success in a way that’s meaningful to you.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Xero Teams

Xero Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Join us from home or at one of our beautiful workspaces. Xero has offices in Australia, New Zealand, United Kingdom, United States, Canada, Singapore, and South Africa.

Typical time on-site: Flexible
HQMelbourne (HQ)
HQWellington, NZ
Singapore
Auckland, NZ
Brisbane
Denver, CO
London, GB
Napier, NZ
New York, NY
Company Office Image
San Mateo, CA
Sydney, NSW
Toronto, Ontario
Learn more

Similar Jobs

Xero Logo Xero

Head of Engineering, Product SRE

Cloud • Fintech • Information Technology • Machine Learning • Software
Hybrid
San Mateo, CA, USA
4700 Employees
266K-300K Annually

Xero Logo Xero

Technical Duty Officer (Lead/Senior Site Reliability Engineer)

Cloud • Fintech • Information Technology • Machine Learning • Software
Hybrid
San Mateo, CA, USA
4700 Employees
120K-175K Annually

Xero Logo Xero

Senior Site Reliability Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software
Hybrid
San Mateo, CA, USA
4700 Employees
120K-175K Annually

Xero Logo Xero

Principal Engineer, Site Reliability

Cloud • Fintech • Information Technology • Machine Learning • Software
Hybrid
San Mateo, CA, USA
4700 Employees
200K-250K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account