Staff Site Reliability Engineer

Posted 8 Days Ago
Hiring Remotely in United States
Remote
148K-204K Annually
Senior level
Information Technology • Security • Cybersecurity
Defeating every attack, every second of every day.
The Role
As a Staff Site Reliability Engineer, you will lead incident management, optimize observability strategies, collaborate with engineering teams on monitoring solutions, develop service level objectives, and mentor fellow engineers in best practices.
Summary Generated by Built In

About Us:

SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With SentinelOne, organizations gain full transparency into everything happening across the network at machine speed – to defeat every attack, at every stage of the threat lifecycle. 

We are a values-driven team where names are known, results are rewarded, and friendships are formed. Trust, accountability, relentlessness, ingenuity, and OneSentinel define the pillars of our collaborative and unified global culture. We're looking for people that will drive team success and collaboration across SentinelOne. If you’re enthusiastic about innovative approaches to problem-solving, we would love to speak with you about joining our team!

Due to Federal Government contract requirements, U.S. Citizenship is required for this position.FedRamp Staff may be subject to customer or third party background checks up to and including Secret Clearance if required by their role at SentinelOne. 

What Are We Looking For?

We are looking for a Staff Site Reliability Engineer (SRE) to join the Site Reliability Engineering team at SentinelOne. This organization’s mission is to keep our uptime promise to our customers by ensuring we meet our SLOs/SLAs, help our engineering teams ship software to our customers fast and with quality, and ensure our customers are successful. We are looking to add a senior SRE that has experience running incident post-mortems, automating repetitive operational tasks, improving alerting accuracy, and building and refining processes that reduce downtime. You will work closely with cross-functional teams to lead reliability initiatives and bring best practices to our team.

We value good written communication skills, data-driven decisions, and a keen eye for continuous improvements. You’ll help simplify, have a passion for new ideas and know how to execute iteratively toward the final goal. We value candor and collaboration.

What Will You Do?

  • Lead and execute incident management for production issues, ensuring rapid recover and root cause analysis
  • Improve and optimize the observability strategy…..
  • Collaborate with application engineering teams to design and implement monitoring solutions that enhance our alerting capabilities and reduce noise
  • Develop and refine SLOs, SLIs, and SLAs that align with business objectives and customer expectations
  • Conduct post-incident review, documenting findings and driving follow-up actions to prevent recurrence.
  • Mentor and support other engineers in incident response, troubleshooting techniques, and reliability best practices.

What Skills and Experience Will You Need?

  • 8+ years experience in Site Reliability Engineering, DevOps, or a related field in cloud native environments
  • Strong expertise in incident management processes and the ability to lead complex troubleshooting efforts under pressure.
  • Experience with Kubernetes and container orchestration
  • Experience with industry standard observability stacks (Prometheus, Grafana, ELK, OpenTelemetry, etc).
  • Proficiency in Python and Bash scripting to improve operational workflows and incident response
  • Familiarity with modern CI/CD pipelines and DevOps practices
  • Excellent communication skills with demonstrated ability to lead and mentor engineers in reliability practices.

Why Us?

You will work on real-world problems and make an impact by protecting our customers from cyber threats. You will join a cutting-edge project and will be able to influence the architecture, design and structure of our core platform. You will tackle extraordinary challenges and work with the very BEST in the industry.

  • You will be joining a cutting-edge company, where you will tackle extraordinary challenges and work with the very best in the industry
  • Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA
  • Unlimited PTO
  • Industry-leading gender-neutral parental leave
  • Paid company holidays
  • Paid sick time
  • Employee stock purchase program
  • Disability and life insurance
  • Employee assistance program
  • Gym membership reimbursement
  • Cell phone reimbursement
  • Numerous company-sponsored events including regular happy hours and team-building events


This U.S. role has a base pay range that will vary based on the location of the candidate.  For some
locations, a different pay range may apply.  If so, this range will be provided to you during the recruiting
process.  You can also reach out to the recruiter with any questions.

Base Salary Range

$148,000$204,000 USD

SentinelOne is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

SentinelOne participates in the E-Verify Program for all U.S. based roles. 

Top Skills

Bash
Python
The Company
HQ: Mountain View, CA
1,050 Employees
Remote Workplace
Year Founded: 2013

What We Do

SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With SentinelOne, organizations gain full transparency into everything happening across the network at machine speed – to defeat every attack, at every stage of the threat lifecycle.

We are a values-driven team where names are known, results are rewarded, and friendships are formed. Trust, accountability, relentlessness, ingenuity, and OneSentinel define the pillars of our collaborative and unified global culture. We're looking for people that will drive team success and collaboration across SentinelOne. If you’re enthusiastic about innovative approaches to problem-solving, we would love to speak with you about joining our team!

Gallery

Gallery

Similar Jobs

Crunchyroll Logo Crunchyroll

Staff Site Reliability Engineer - Data Engineering, Platform

Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
Remote
San Francisco, CA, USA
1200 Employees
191K-239K Annually
Remote
Newton, MA, USA
2327 Employees
130K-185K Annually

Coinbase Logo Coinbase

Staff Site Reliability Engineer - Platform

Cloud • Fintech • Cryptocurrency • NFT • Web3
Remote
USA
3700 Employees
212K-249K Annually

Ancestry Logo Ancestry

Staff Site Reliability Engineer

Artificial Intelligence • Big Data • Information Technology • Other • Software • Database • Biotech
Remote
United States of America
1300 Employees
176K-202K Annually

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account