Manager, Site Reliability Engineering

Posted 10 Days Ago
Be an Early Applicant
United Kingdom
Senior level
Big Data • Cloud • Information Technology • Software • Business Intelligence • Cybersecurity
We help companies turn technology into a competitive advantage, whether they make it or use it.
The Role
The Manager of Site Reliability Engineering will lead a team dedicated to improving the operational readiness, reliability, performance, and scalability of Flexera's global cloud infrastructure. Responsibilities include team management, engaging with stakeholders, handling incidents, and ensuring high-quality delivery of SRE best practices.
Summary Generated by Built In

Flexera saves customers billions of dollars in wasted technology spend. A pioneer in Hybrid ITAM and FinOps, Flexera provides award-winning, data-oriented SaaS solutions for technology value optimization (TVO), enabling IT, finance, procurement and cloud teams to gain deep insights into cost optimization, compliance and risks for each business service. Flexera One solutions are built on a set of definitive customer, supplier and industry data, powered by our Technology Intelligence Platform, that enables organizations to visualize their Enterprise Technology Blueprint™ in hybrid environments—from on-premises to SaaS to containers to cloud.

We’re transforming the software industry. We’re Flexera. With more than 50,000 customers across the world, were achieving that goal. But we know we can’t do any of that without our team Ready to help us re-imagine the industry during a time of substantial growth and ambitious plans? Come and see why we’re consistently recognized by Gartner, Forrester and IDC as a category leader in the marketplace. Learn more at flexera.com

Build, grow and lead a team that is responsible for implementing the Site Reliability Engineering practices and tools that continually improve the operational readiness, instrumentation, reliability, performance and scalability of Flexera’s Snow Atlas global cloud infrastructure, platform and products. The team is central to the success of Flexera’s SaaS solutions and stakeholders will rely on your knowledge and expertise of SRE and DevOps practices.

Adopting DevOps principles of delivery, the manager is responsible for the deliverables of the central team and works with stakeholders to enable Site Reliability Engineers. The manager will engage with stakeholders to identify and deliver the highest value / priority work that improves SRE capabilities, tools and services. Generation of actionable insights from qualitative and quantitative metrics to continually improve the operational reliability of Snow’s systems.

What you will be doing:

  • Lead, manage and coach a team of Site Reliability Engineers (SREs) responsible for building and maintaining Flexera’s Snow Atlas platform infrastructure and tooling. Manage the day-to-day execution of high-quality, prioritized, deliverables of SRE best practices ensuring the reliability, scalability, instrumentation, automation and performance of Snow’s cloud SaaS products.
  • Being a passionate advocate of the SRE discipline and DevOps principles you will engage, influence, seek feedback, and evangelize best practices with development, operational and support teams to enable stakeholders to support self-service and “you build-it – you run it”.
  • Manage the operational reliability, fault-tolerance, performance, scalability, observability and efficiency of Flexera’s cloud platforms and products across environments.
  • Work on incidents in conjunction with team members and coordinating with wider stakeholders to resolve customer impacting service issues promptly.
  • Partners with security and other “shared services” teams to align, automate, integrate and orchestrate specialist tooling into a common set of SRE best practices that supports the wider Software Delivery Lifecycle and Product Lifecycle.
  • Plan and execute projects in support of the SRE objectives, and ensure projects are delivered with high quality, on time, and within budget
  • Hire, develop and retain a highly skilled SRE team
  • Evaluate hardware and software technologies to improve efficiency and performance

Responsibilities:

  • Manage a team responsible for supporting an international, 24x7, Azure cloud infrastructure powering Flexera’s customer facing service offerings
  • Participate in the design, implementation, and operation of a scalable and reliable systems infrastructure supporting a fast-growth SaaS offering
  • Ensure proper security, monitoring, alerting, and reporting for the infrastructure
  • Troubleshooting and resolving escalated issues
  • Capacity planning for all aspects of the infrastructure
  • Developing and maintaining processes, tools, and documentation in support of the production environment
  • Participate in evaluation of new software, hardware and infrastructure solutions
  • Participation in an on-call rotation and be available 24x7 in an escalation capacity

Required skills and knowledge:

  • Experience as a Site Reliability Engineering in cloud environments
  • Experience managing a team of Site Reliability Engineers
  • Experience managing infrastructure in Azure
  • Experience managing Kubernetes infrastructure in the cloud.
  • Experience in Monitoring & Observability practices in the cloud including tooling, logging, metrics, tracing, and alerting
  • Experience with IaC and Containers to achieve scalable, reliable, performant and secure SaaS platform infrastructure
  • Experience of CI/CD tooling to automate, orchestrate and integrate continuous delivery pipelines

Flexera is proud to be an equal opportunity employer. Qualified applicants will be considered for open roles regardless of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by local/national laws, policies and/or regulations. 

Flexera understands the value that results from employing a diverse, equitable, and inclusive workforce. We recognize that equity necessitates acknowledging past exclusion and that inclusion requires intentional effort. Our DEI (Diversity, Equity, and Inclusion) council is the driving force behind our commitment to championing policies and practices that foster a welcoming environment for all.

We encourage candidates requiring accommodations to please let us know by emailing [email protected].

Top Skills

DevOps
Site Reliability Engineering
The Company
HQ: Itasca, IL
1,900 Employees
Hybrid Workplace
Year Founded: 1987

What We Do

Flexera delivers SaaS-based IT management solutions that enable enterprises to accelerate digital transformation and multiply the value of their technology investments. We help organizations inform their IT with definitive visibility into complex hybrid IT ecosystems, providing unparalleled IT insights that allow them to seize technology opportunities. And we help them transform their IT with tools that deliver actionable intelligence across an ever-increasing range of dimensions to effectively manage, govern and optimize their hybrid IT estate.

More than 50,000 customers subscribe to our technology value optimization solutions, delivered by 1,300+ passionate team members worldwide. To learn more, visit flexera.com

Why Work With Us

People work here for, well, the people. People stay for the camaraderie with smart, passionate teams who actually like working together and managers who support them. We also offer competitive benefits, hybrid working and unlimited time off.

Our inclusivity scores are in the top benchmark and we are consistently rated a “great place to work.”

Gallery

Gallery

Similar Jobs

Snyk Logo Snyk

Software Engineer (Machine Learning)

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
Hybrid
London, Greater London, England, GBR
1000 Employees

Snyk Logo Snyk

Manager, Engineering - Orchestration Group

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
Hybrid
London, Greater London, England, GBR
1000 Employees

Snyk Logo Snyk

Distinguished Engineer - Platform Engineering

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
London, Greater London, England, GBR
1000 Employees

Snyk Logo Snyk

Senior Software Engineer

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
London, Greater London, England, GBR
1000 Employees

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account