Sr Staff Site Reliability Engineer (Cortex)

Posted 5 Hours Ago
Be an Early Applicant
Santa Clara, CA
126K-204K Annually
Senior level
Cybersecurity
Palo Alto Networks is the global cybersecurity leader.
The Role
The Sr Staff Site Reliability Engineer will operate and enhance a large-scale GCP environment, focusing on observability systems. Responsibilities include improving monitoring processes, incident management, automation, and collaborating with engineering teams to optimize solutions for system performance and health.
Summary Generated by Built In

Company Description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission:

Being the cybersecurity partner of choice, protecting our digital way of life.
Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

Job Description

Your Career

The Cortex team builds and delivers the industry’s most advanced SecOps platform, consisting of XSIAM, XSOAR, and XPANSE. As a member of the Cortex DevOps team, your role involves operating and maintaining a large-scale GCP environment, including the design, implementation, and continuous enhancement of our comprehensive observability systems. To meet the opportunities that such a role provides, you will have a deep knowledge of modern observability and monitoring tools and practices, having managed high cardinality metrics, implemented tracing, and operationalized large scale logging solutions. As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and actionable insights into our systems’ performance and health.

Your Impact

As a Senior Staff SRE with the Cortex Observability team, you will:

  • Cloud Expertise - Utilize your expertise in monitoring cloud platforms, particularly GCP, to optimize our infrastructure leveraging cloud-native technologies
  • Monitoring Expertise - Improve monitoring processes, alerts, and metrics - Work with development teams to ensure that all of our services have the right monitoring and metrics in place so that we detect problems before our customers do
  • Incident Management - Leverage incident management processes to ensure efficient resolution of system issues and minimal impact on services
  • Automation - Automate complex monitoring and alerting tasks by building tools for cloud operations, such as automated remediation of known issues and auto-scaling
  • Continuously Improve - Stay up-to-date with cutting-edge technologies, evaluate their potential impact on our operations, and implement them when appropriate
  • On-Call - Participate with our DevOps team to provide follow-the-sun operational coverage in the production of our SaaS product
  • Collaborate - Work with our Engineering team to influence the operability of the product and ensure the reliability and availability of our services

Qualifications

Your Experience

  • Observability Tools - High proficiency with Prometheus, Grafana, and other monitoring tools - Experience with log management solutions, particularly Splunk and ELK
  • Distributed Tracing - Experience deploying distributed tracing using technologies such as OpenTelemetry
  • Incident and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering
  • DevOps/SRE Expertise - 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong motivation for high reliability at the service level
  • Cloud Proficiency - High proficiency in either Google Cloud Platform or Amazon Web Services
  • Kubernetes and Docker - High proficiency with Kubernetes and Docker for container orchestration
  • Scripting and Automation - High proficiency in Python programming and Linux Shell commands - Experience with Terraform for infrastructure as code
  • Communication Skills - Effective communication and interpersonal skills, with the ability to work and coordinate between multiple teams
  • Troubleshooting - Ability to effectively troubleshoot and address emerging and complex problems
  • Independence - Ability to operate independently, make decisions, take action, and take responsibility

Additional Information

The Team

Our engineering team is at the core of our products – connected directly to the mission of preventing cyberattacks. We are constantly innovating – challenging the way we, and the industry, think about cybersecurity. Our engineers don’t shy away from building products to solve problems no one has pursued before.

We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.and downtime.

Compensation Disclosure 

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $126000 - $203500/YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

Our Commitment
We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at [email protected].

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Is role eligible for Immigration Sponsorship?: Yes

Top Skills

GCP
The Company
HQ: Santa Clara, CA
15,289 Employees
Hybrid Workplace
Year Founded: 2005

What We Do

Palo Alto Networks, the global cybersecurity leader, is shaping the cloud-centric future with technology that is transforming the way people and organizations operate. Our mission is to be the cybersecurity partner of choice, protecting our digital way of life. We help address the world's greatest security challenges with continuous innovation that seizes the latest breakthroughs in artificial intelligence, analytics, automation, and orchestration. By delivering an integrated platform and empowering a growing ecosystem of partners, we are at the forefront of protecting tens of thousands of organizations across clouds, networks, and mobile devices.

Why Work With Us

We are protecting the digital way of life for billions of people every day. We are united in this mission and the unique ideas it takes stay ahead. This is why we embrace each individual who is part of our team determined to make a difference.

Gallery

Gallery

Similar Jobs

Anduril Logo Anduril

Aircraft RF Systems Integration Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
1400 Employees
140K-215K Annually

Anduril Logo Anduril

Machine Learning Infrastructure Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
1400 Employees
168K-252K Annually

Cisco Meraki Logo Cisco Meraki

Technical Deployment Engineer

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
San Francisco, CA, USA
3000 Employees
150K-214K Annually

Roblox Logo Roblox

Technical Lead, Partner Innovation Studio

Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
San Mateo, CA, USA
2500 Employees
224K-283K Annually

Similar Companies Hiring

Invoice Home Thumbnail
Software • SEO • Mobile • Information Technology • Fintech • Financial Services • Cybersecurity
Austin, TX
20 Employees
MacPaw Thumbnail
Software • Security • Information Technology • Data Privacy • Cybersecurity • App development
Cambridge, MA
550 Employees
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account