Senior Site Reliability Engineer (SRE)

Posted 5 Days Ago
3 Locations
Remote
Senior level
Logistics • Robotics
Reinvent the Warehouse & Reimagine the Supply Chain®
The Role
The Senior Site Reliability Engineer will lead root cause analysis investigations for critical production outages, improving system resiliency. Responsibilities include analyzing metrics, driving RCA processes, leading problem resolutions, and working collaboratively with cross-functional teams. The role involves troubleshooting infrastructure incidents and ensuring the service lifecycle is optimized based on operational lessons learned.
Summary Generated by Built In

Who we are

With its A.I.-powered robotic technology platform, Symbotic is changing the way consumer goods move through the supply chain. Intelligent software orchestrates advanced robots in a high-density, end-to-end system – reinventing warehouse automation for increased efficiency, speed and flexibility.

What we need

We are looking for a passionate, hard-working, and talented Senior Site Reliability Engineer to take the lead on solving some of the toughest operational challenges in some of the most sensitive and mission-critical automated warehouse solutions. The SRE team will drive the stability and sustainability of these next-generation systems and discover innovative ways to scale and operate them reliably as we expand. In this role, you will work with cross functional teams such as Operations, Mechanical Hardware Engineering, IT infrastructure Systems, and Software Engineering teams to identify and address underlying resiliency gaps. 

What we do

The Site Reliability Engineering team is part of the Technology Support organization and is responsible for all Root both industrial and enterprise systems Root Cause Analysis software. We are a passionate cross functional team solving some of the toughest challenges in some of the most sensitive and mission critical automated warehouse solutions.

What you’ll do

  • You will be part of the SRE Team, which is focused on hands-on root cause analysis of all critical production outages to improve resiliency. 

  • You are responsible for analyzing various sources of metric, dashboards, phrasing logs and articulating that to a facts-based actionable Root Cause Analysis investigation to lead a group of Subject Matter Experts teams to find the actual cause 

  • Host RCA calls as a chair and drive the RCA process to conclusion within tight SLAs with customer-facing deliverables 

  • Lead problem tickets and improvements to major software components, systems, and features to improve the availability, scalability, latency, and efficiency of the Symbotic System. 

  • Engage in and improve the service lifecycle from inception and design to deployment, operation, and refinement based on lessons learned through deep dives. 

  • Hands-on troubleshooting of VMware, Kubernetes, Custom Software, and infrastructure performance incidents. 

  • Be a trusted technical advisor who leads complex root cause analysis investigations from beginning to end until maximum improvements are identified 

  • Demonstrate sound knowledge of gathering logs and facilitating a facts-based root cause analysis with cross-functional teams. 

  • Assist internal teams with corrective actions and improvement tickets and influence the completion goals. 

  • Flexibility to work during occasional out of standard hours including weekends may be required depending on the cruciality and workload demands.

  • Ability to travel up to 10%.

What you’ll need

  • Bachelor’s degree in Software Engineering, Information Systems, Computer Science or a related field. 

  • Minimum 8 years of experience working on ITSM tools such as Jira or equivalent tool. (ITIL Problem Management experience is a plus) 

  • Minimum 8 years of infrastructure engineering experience with a record demonstrating the delivery of high-quality, large-scale solutions requiring planning and change control. 

  • Minimum 8 years of experience in operation of production systems including troubleshooting, testing, and automation. 

  • Minimum 5 years of experience leading technical Root Cause Analysis (Software and/or industrial focus is a plus) 

  • Ability to prioritize parallel RCA investigations and tasks by influencing cross-functional teams to complete actions on time with demanding quality. 

  • Experience with executive incident communications, RCA report writing, and written communication skills to non-technical audiences. 

  • Ability to transfer vast technical background to projects through excellent problem-solving and competence to work with other technical teams. Efficiently read and understand Gitlab technical documentation 

  • Experience in the advanced use of tools like Prometheus, Grafana, Logic Monitor, Elastic, VMware and use of CLI (Kube or Linux). 

  • Knowledge of Power BI, Tableau, executive report writing, and presentation skills is a plus. 

Our environment

  • Up to 10% of travel may be required. Employees must have a valid driver’s license and the ability to drive and/or fly to client and other customer locations.  

  • The employee is responsible for owning a credit card and managing expenses personally to be reimbursed bi-weekly. 

 

#LI-SK1

#LI-Remote

About Symbotic

Symbotic is an automation technology leader reimagining the supply chain with its end-to-end, AI-powered robotic and software platform. Symbotic reinvents the warehouse as a strategic asset for the world’s largest retail, wholesale, and food & beverage companies. Applying next-gen technology, high-density storage and machine learning to solve today's complex distribution challenges, Symbotic enables companies to move goods with unmatched speed, agility, accuracy and efficiency. As the backbone of commerce the Symbotic platform transforms the flow of goods and the economics of supply chain for its customers. For more information, visit www.symbotic.com.

 

We are a community of innovators, collaborators and pioneers who embrace our differences, because we know unique perspectives make us stronger and smarter. Every perspective matters. We depend on the collective voices of our employees, customers and community to help guide us as we build a better place to work – for you and the world. That’s why we’re proud to be an equal opportunity employer. 

We do not discriminate based on race, color, ethnicity, ancestry, religion, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, or genetic information. 

Top Skills

Kubernetes
VMware
The Company
HQ: Wilmington, MA
1,200 Employees
Hybrid Workplace
Year Founded: 2007

What We Do

The Symbotic story began in 2007 with a vision and the drive to solve real-world problems. Since then, the company has been on a journey to bring together warehouse operations expertise and groundbreaking technology to revolutionize warehouse automation. We provide end-to-end fully automated supply chain solutions using advanced robotics and A.I. for the world’s largest retailers and wholesalers including Walmart, Albertsons, C&S, UNFI, Target, Giant Tiger in Canada. Our innovation solutions solve complex supply chain challenges.

Why Work With Us

Our robots are responsible for getting food & merchandise to thousands of store shelves across North America. We think of ourselves as a hybrid company, one with the excitement & ambition of a startup and the benefits of a proven company. We do important work for some of the biggest companies in the world; together we are changing entire industries

Gallery

Gallery

Similar Jobs

Remote
United States
400 Employees

Upstart Logo Upstart

Senior Site Reliability Engineer

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Easy Apply
Remote
2 Locations
1500 Employees
160K-222K Annually

Zeta Global Logo Zeta Global

Senior Site Reliability Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote
United States
2194 Employees

Rula Logo Rula

Sr. SRE & DevOps Engineer (Remote)

Healthtech • Other • Social Impact • Software • Telehealth
Remote
Los Angeles, CA, USA
450 Employees

Similar Companies Hiring

Doodle Labs Thumbnail
Wearables • Robotics • Internet of Things • Hardware • Automation • App development • Aerospace
Los Angeles, CA
50 Employees
Cencora Thumbnail
Pharmaceutical • Logistics • Healthtech
Conshohocken, PA
46000 Employees
HERE Thumbnail
Software • Logistics • Information Technology
Amsterdam, NL
9000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account