Site Reliability Engineer

Posted 6 Days Ago
Hiring Remotely in San Francisco, CA
Remote
Senior level
Artificial Intelligence • Hardware • Software
The Role
The Site Reliability Engineer will manage infrastructure reliability, automate processes, respond to incidents, and improve system performance in a SaaS environment.
Summary Generated by Built In

Who we are


The problem: Every minute matters in fire response.  As climate change amplifies the intensity of wildfires—with longer fire seasons, dryer fuels, and faster winds—new ignitions spread faster and put more communities at risk. Today, most wildfires are detected by bystanders and reported via 911, meaning it can take hours to detect a fire, verify its exact location and size, and dispatch first responders. Fire authorities need a faster way to detect, confirm, and pinpoint fires so that they can quickly respond—preventing small flare-ups from becoming devastating infernos.


About Pano: Pano is a venture-backed early-stage climate tech startup that is the leader in wildfire early detection, leveraging the latest advancements in IoT, AI, satellites, and SaaS software to deliver actionable intelligence to customers. Pano leverages mountaintop cameras and satellites to detect the first traces of smoke and put real-time fire images in the hands of asset owners and first responders to speed up containment. Pano is already partnering with major utilities, fire authorities, and government agencies in the USA and Australia. Recent media coverage includes being named one of the Top 10 most innovative companies in AI of 2023 by FastCompany and recipient of the Innovative Mobile Service and Application Award at Mobile World Congress through our partnership with T-Mobile.


Pano brings together a diverse team bridging frontline, wildland firefighting experience with best-in-class know-how in operations, logistics, artificial intelligence, and software. Our team is composed of experienced technology professionals from companies such as Apple, Cisco, Nest, DoorDash and Meta. Headquartered in San Francisco with an office and factory in the Mission District, our hybrid team works from locations around the world. Founded in mid-2020, we’ve raised over $40M from leading VC funds. 


The Role 


For the SRE role, we’re looking for somebody to join our Platform team and bridge the gap between development and infra operations, ensuring the reliability, performance, and availability of software systems through automation, monitoring, and proactive problem-solving.


The person in this role will be responsible for ensuring that the underlying infrastructure is running smoothly and that our systems and tools are working as expected.


At Pano, we strongly believe in team members taking ownership of what they do, and our approach to problem-solving relies heavily upon creativity, communication, and collaboration. 


The ideal candidate is humble, hungry, and people-smart.  They have the wisdom and experience to build mature operational processes for the future but are also comfortable with rolling up their sleeves, writing code, and building systems in a growing startup environment.

What you’ll do

  • Implement and maintain monitoring systems to proactively identify and address potential issues before they impact users.
  • Automate repetitive tasks and processes, such as deployments, infrastructure management, and incident response, to improve efficiency and reduce manual effort.
  • Respond to incidents, diagnose problems, and implement solutions to restore service quickly.
  • Improve the performance and scalability of systems and applications, ensuring they can handle peak loads and user traffic.
  • Help plan future capacity needs, ensuring that systems can accommodate growth and evolving requirements, while remaining cost-efficient.
  • Work closely with development teams to understand their needs, guide them, and ensure that systems are designed and deployed reliably.
  • Build tools to codify and automate infrastructure operations.
  • Define and track SLIs and SLOs to measure the performance and reliability of services.
  • Assess and mitigate risks associated with deployments and infrastructure changes.
  • Assist with the release and deployment processes, ensuring that changes are rolled out smoothly and reliably.

What you’ll bring

  • 5+ years of professional experience in a fast-paced SaaS or a similar business environment
  • 3+ years of hands-on experience supporting production systems as a Site Reliability Engineer (SRE) or a DevOps Engineer 
  • 3+ years of hands-on experience with cloud services and technologies (GCP, AWS, Azure, etc.)
  • Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)
  • Proficient in Infrastructure as Code (IaC) tools and methodologies (e.g. Terraform, Pulumi, Puppet, etc.)
  • Proven ability to troubleshoot and resolve complex technical issues in distributed systems
  • Ability to communicate effectively within the team and across the organization while sharing insights and updates and collaborating to achieve project goals

Preferred skills:

  • Advanced working knowledge of GCP Services like GKE, GCS, IAM, etc.
  • Professional experience supporting containerized Java/JVM/Python services
  • Experience with relational databases, particularly PostgreSQL
  • 3+ years of professional experience designing, and implementing and/or administering CI/CD solutions (e.g. Github Actions, Buildkite, Jenkins, etc.)
  • Strong SRE mindset with focus on cloud networking and security best practices
  • Strong software development, particularly with scripting languages (e.g., Python, Bash, etc.)
  • Experience with system administration in general and Linux in particular
  • Familiarity with SOC2 / ISO 27001 security frameworks
  • Preference for someone in the Pacific / Mountain time zone

Pano is an equal opportunity employer committed to recruiting and supporting our team-members regardless of where they come from. We do not discriminate on the basis of race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Top Skills

AWS
Azure
Bash
Buildkite
Docker
GCP
Github Actions
Jenkins
Kubernetes
Postgres
Pulumi
Puppet
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
41 Employees
On-site Workplace
Year Founded: 2019

What We Do

Pano AI (Pano) is the leader in early wildfire detection and intelligence, providing government, utilities, insurers, and private landowners with advanced tools and up-to-the-minute intelligence to quickly mitigate wildfire threats while protecting lives, property, and the environment.

Pano combines advanced hardware, artificial intelligence, and software in a single integrated enterprise solution. Leveraging data and satellite feeds, as well as propriety imagery from a network of ultra-high-definition, 360-degree cameras atop high vantage points, Pano’s artificial intelligence model produces a real-time picture of threats in a geographic region and delivers immediate, actionable intelligence.

Pano is based in San Francisco with a team of experienced technology professionals–from companies including Cisco, Tesla, Apple, Salesforce, and Nest–who are on a mission to give firefighters advanced tools to safeguard lives, communities, and the environment. Recently, Pano has been featured on Fox Business News, the San Francisco Chronicle, Morning Brew, and T&D World Magazine. Learn more at https://www.pano.ai/.

Similar Jobs

Chainlink Labs Logo Chainlink Labs

Site Reliability Engineer (SRE), Cloud Efficiency Engineer

Blockchain • Internet of Things • Payments • Cryptocurrency • Web3
Remote
7 Locations
680 Employees

Atlassian Logo Atlassian

Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
117K-187K Annually

Cisco Meraki Logo Cisco Meraki

Lead Site Reliability Engineer, Engineering Enablement - REMOTE

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
Remote
Hybrid
United States
3000 Employees
139K-215K Annually

Cisco Meraki Logo Cisco Meraki

Lead Site Reliability Engineer, Network - Remote

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
Remote
Hybrid
2 Locations
3000 Employees
148K-236K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account