Manager, Site Reliability Engineering - Cloud Platform

Posted Yesterday
Be an Early Applicant
Austin, TX
Hybrid
182K-299K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Role
As the Manager of Site Reliability Engineering, you will lead a team focused on improving the reliability, efficiency, and scalability of GM's cloud infrastructure. Responsibilities include automating operational processes, implementing monitoring frameworks, handling production incidents, managing service levels, and collaborating with development teams to ensure service reliability and cost efficiency.
Summary Generated by Built In

Description
Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to Mountain View, CA, Austin, TX, or Warren, MI three times per week, at minimum.
The rapid adoption of advanced software in vehicles marks a new era for automakers and consumers, bringing both advantages and challenges.
As part of Site Reliability Engineering (SRE) at General Motors, you'll join a dedicated team focused on enhancing the reliability, efficiency, and scalability of our distributed systems. We leverage engineering principles to manage operations effectively and build solutions that enable us to grow without sacrificing performance or quality. Our SREs work closely with software development teams, acting as specialists in reliability and production engineering, with a focus on automation, observability, and shared responsibility.
We are looking for individuals who are passionate about maintaining the health of our infrastructure while optimising for reliability and cost-efficiency. This role involves a blend of software engineering and systems engineering skills to keep our services resilient, robust, and scalable.
This is an Engineering Manager role. As an SRE Engineering Manager, you will be expected to not only lead your team in setting priorities and ensuring alignment with organizational goals but also to be deeply technical. We expect our managers to be able to contribute directly through coding, reviewing code, and mentoring engineers.
While it's unlikely that you'll spend the majority of your time coding, having the capability and willingness to dive into technical details, solve problems hands-on, and support your team's technical decisions is crucial. You'll be a mentor, guide, and a partner, helping engineers grow, and ensuring the reliability and efficiency of the systems they are working on. We believe in setting a high bar for engineering managers who can lead by example in both technical expertise and people leadership.
As part of the Cloud Platform Team you will help to build and a run self-service, multi-cloud developer platform capable of supporting thousands of services and hundreds of engineering teams, with a focus on reliability and cost efficiency. Our customers are other engineering teams at GM, and our goal is to provide them with self-service capabilities that create and manage public cloud infrastructure that is reliable, resilient to failures, easily monitored and observed, and is cost effective.
Key Responsibilities

  • Automation and Reliability Improvements: Develop tools and software to automate operational processes, improve system reliability, and reduce manual intervention.
  • Observability and Monitoring: Lead, Implement and improve monitoring and observability frameworks, enabling proactive detection and resolution of incidents.
  • Incident Response: Participate in an on-call rotation to diagnose, troubleshoot, and mitigate production incidents, ensuring minimal downtime and swift resolution.
  • Collaboration with Development Teams: Work alongside developers to ensure the quality, scalability, and reliability of our services. Practice shared ownership of services in production, fostering a "You build it, you run it" culture.
  • Service Level Management: Manage Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) to manage reliability expectations effectively.
  • Engineering for Reliability: Strong understanding of common application reliability patterns, with hands-on experience implementing them.
  • Failure Analysis and Post-Incident Reviews: Conduct deep-dive analyses of incidents and collaborate on post-incident reviews to derive learnings and prevent recurrence. Champion a culture of continuous improvement.
  • Cost Efficiency: Evaluate system performance and advocate for optimisations that reduce infrastructure costs while maintaining service reliability.


Additional Description
Skills and Qualifications

  • Bachelor's degree in Computer Science, Engineering, or equivalent experience
  • Programming Skills: Proficiency in at least one programming language (e.g., Python, Go, Java) and familiarity with multiple language ecosystems.
  • Systems Knowledge: Solid understanding of operating systems, networking, distributed systems, databases, and storage architectures.
  • *Strong Understanding of System Fundamentals: Deep understanding of how code runs on underlying hardware, including operating systems, algorithms, and data structures. Ability to optimize or troubleshoot code by understanding its execution and the impact on system resources.
  • Incident Management: Experience handling production incidents, including root cause analysis, mitigation, and working through complex system failures.
  • Communication and Collaboration: Strong communication skills, with an ability to explain technical concepts to both engineering and business stakeholders. Commitment to collaborative problem solving and shared ownership of services.
  • Automation Focus: Proven experience in automating manual processes, building deployment pipelines, or managing configuration systems.


Preferred Experience

  • Bachelor's degree in Computer Science, Engineering, or equivalent experience
  • Experience with cloud platforms (AWS, GCP, Azure).
  • Familiarity with container orchestration systems like Kubernetes.
  • A track record of managing or developing distributed systems.
  • Prior experience with Java in production
  • Compensation:
    • The expected base compensation for this role is: $181,600- $298,800. Actual base compensation within the identified range will vary based on factors relevant to the position.
    • Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
  • Benefits:
    • Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.


Relocation benefits may be provided.
About GM
Our vision is a world with Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for all.
Why Join Us
We aspire to be the most inclusive company in the world. We believe we all must make a choice every day - individually and collectively - to drive meaningful change through our words, our deeds and our culture. Every day, we want every employee, no matter their background, ethnicity, preferences, or location, to feel they belong to one General Motors team.
Total Rewards | Benefits Overview
From day one, we're looking out for your well-being-at work and at home-so you can focus on realizing your ambitions. Learn how GM supports a rewarding career that rewards you personally by visiting Total Rewards resources.
Diversity Information
General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. We strongly believe that workforce diversity creates an environment in which our employees can thrive and develop better products for our customers. We encourage interested candidates to review the key responsibilities and qualifications for each role and apply for any positions that match their skills and capabilities. Applicants in the recruitment process may be required, where applicable, to successfully complete a role-related assessment(s) and/or a pre-employment screening prior to beginning employment. To learn more, visit How we Hire
Equal Employment Opportunity Statement (U.S.)
General Motors is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Accommodations (U.S. and Canada)
General Motors offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, email us [email protected] or call us at 800-865-7580. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

Top Skills

Go
Java
Python

What the Team is Saying

Divya
Eseme
Kendra
Navya
Charles
Victoria
Sri
Jeremiah Hamlin
The Company
HQ: Detroit, MI
165,000 Employees
Hybrid Workplace
Year Founded: 1908

What We Do

At General Motors, our vision is to create a world with Zero Crashes, Zero Emissions, and Zero Congestion. We wholeheartedly embrace the responsibility to lead the change that will make our world better, safer, and more equitable for all.

Our industry and company are undergoing a once-in-a-lifetime technological transformation, which is reshaping our approach to technology and innovation. We are expanding our horizons through new technology platforms and driving innovations that deliver exceptional value to our customers.

Why Work With Us

At General Motors, our purpose is to pioneer the innovations that move and connect people to what matters. We’re driving the world forward, together. We’re building vehicle software alongside its hardware, hands-free driving that will lead to autonomy, and EVs that charge your home for an all-electric future.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

General Motors Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Roles that are categorized as Hybrid mean that the successful candidate is expected to report onsite to the designated facility at least three times per week or other frequency as dictated by the business.

Typical time on-site: 3 days a week
Company Office Image
HQDetroit Renaissance Center Global HQ
FR
IL
NO
NZ
Alvear, Santa Fé
Company Office Image
Austin IT Innovation Center
Bengaluru, IN
Bogotá, CO
Company Office Image
Charlotte Technical Center
Huechuraba, Santiago Metropolitan Region
Indaiatuba, São Paulo
Kapuskasing, Ontario
Langley, British Columbia
Ireland IT Innovation Center
Markham, Ontario
Melbourne, Victoria
Mexico City, MX
Company Office Image
Milford, MI
Company Office Image
Mountain View Tech Center
Münster, DE
Oshawa, Ontario
Company Office Image
Advanced Design and Innovation Campus
Company Office Image
Pontiac Engineering Center
Ramos Arizpe, Coahuila
Company Office Image
Roswell, GA
São Caetano do Sul, São Paulo
Silao, Guanajuato
Company Office Image
Global Technical Center
Learn more

Similar Jobs

General Motors Logo General Motors

Senior Software Engineer - 3D

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Austin, TX, USA
165000 Employees

General Motors Logo General Motors

Sr. Developer (OneStream)

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Austin, TX, USA
165000 Employees

General Motors Logo General Motors

Senior Developer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Austin, TX, USA
165000 Employees

General Motors Logo General Motors

Software Engineering Manager

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Austin, TX, USA
165000 Employees
166K-299K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account