Senior Site Reliability Engineer

Posted 12 Hours Ago
Be an Early Applicant
Denver, CO
Senior level
Big Data • Internet of Things • Machine Learning
The Role
The Senior Site Reliability Engineer will focus on production operations, provisioning and managing Kubernetes infrastructure, deploying software, and improving monitoring and alerting. The role requires troubleshooting production issues and contributing to automation and on-call processes.
Summary Generated by Built In

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 50 million locations globally and have managed over 2.5 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Communications Service Providers (CSPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

Opportunity

We’re looking for infrastructure engineers who are driven, independent and like to learn, exchange and collaborate. Advanced experience on specific Tooling/Tech Stack is not essential, familiarity with standard infrastructure concepts and production troubleshooting will be a differential.

You’ll be driving operational excellence for our largest customers, powering Cloud Services for Mesh Wi-Fi in 10s of Millions of homes per environment. Systems scalability and performance are our focuses.

What You’ll Do 

  • Focus on Production operations/matters and on-call 
  • Provision and scale multi-datacenter Kubernetes Infrastructure and Applications (EKS)
  • Deploy Software in multiple Production Environments
  • Own monitoring and alerting to production systems, improvements and changes
  • Contribute improvements to the current automation 
  • Contribute improvements to our on-call process and alerting

What You’ll Bring

  • Availability to be in on-call rotation for Production issues
  • Availability to work with a distributed team in different timezones

Desired Skill Set

  • 6+ Years of experience with Production Troubleshooting 
  • 1+ years of Kubernetes Knowledge (operate)
  • 1+ years Basic Terraform Knowledge
  • Experience both setting up and utilizing Monitoring and observability tools
    • e.g. New Relic, Nagios/Icinga, Grafana, Prometheus
  • 2+ years of experience Programming/Scripting - one of the following
    • eg. Perl, Python, PHP, GoLang, Java, etc
  • 8+ years of experience with modern Linux Operating systems (Enterprise Linux or Debian based)
  • 6+ years of experience with modern cloud infrastructure, preferably AWS
  • Bachelor’s degree in related field or equivalent experience

Differentiators

  • Troubleshooting production performance/service degradation or outage issues at scale
  • Experience with Infrastructure Troubleshooting in VMs and/or Bare Metal (ssh/Linux)
  • Advanced Kubernetes knowledge
  • Advanced Terraform knowledge
  • Customer Facing experience in previous roles
  • Experience operating Kafka in Production
  • Experience operating NoSQL Databases in Production
  • Experience operating Relational Databases in Production
  • Generic Configuration Management experience

Candidates must be in a commutable distance. We are not offering relocation or visa transfers/sponsorship at this time. 

Total Compensation package would include: anticipated salary range of $136,000 - $160,000  + bonus + equity + benefits.  Benefits include: a 401k plan and a company match, basic life insurance plus unparalleled health, dental, vision and other benefits and perks. Please see here for more details.

An employee’s base salary and its position within the range may depend on a number of factors including job related knowledge, education, skills, experience and other business related considerations. Published ranges are provided in good faith at the time of posting. 

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for CSPs and their subscribers, Plume partners with over 350 CSP customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM. 

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows CSPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.  

Backed by investors such as Insight Partners and SoftBank Vision Fund 2, Plume is now valued at $2.6B, having added over $500M in funding in 2021 alone.

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

Top Skills

Go
Java
Kubernetes
Perl
PHP
Python
Terraform
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
611 Employees
On-site Workplace
Year Founded: 2015

What We Do

The world’s first SaaS experience platform for Communications Service Providers.

At Plume, we believe that technology isn't about moving faster. It's about making moments better. Which is why we've brought relentless focus to understanding the digital lifestyles people want to live, the spaces where they play out, and innovating ways to make digital experiences blossom. As the only open and hardware-independent solution, Plume enables the rapid delivery of new services for connected homes, small business, and beyond at massive scale. For residential subscribers, Plume delivers self-optimizing WiFi, cyber-security, access controls, and more. Our purpose-built suite of smart services turns small business networks into fully connected, business intelligence platforms. And Service Providers get robust back-end applications for unprecedented visibility and support.

Our goal is simple: to keep you in the flow with any experiences you turn on. And to help you fill the spaces that matter to you with all kinds of wonderful.

With a bias for action and love for breaking molds, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We constantly challenge ourselves to think in ways that other companies don't, and work to do what should be done (rather than what can). It’s how we've assembled a team of world-class engineers, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

We believe that the core competitive advantage in the over-the-top era resides in the ability to create new services at high cadence, deploy them at massive scale, and to orchestrate through a common data set. Our flexible, cloud-controlled platform paired with an open-source software stack decouples service creation and delivery from dependence on proprietary hardware.

Similar Jobs

Boulder, CO, USA
14894 Employees
131K-233K Annually
Hybrid
Colorado Springs, CO, USA
450 Employees

Pax8 Logo Pax8

Sr. Site Reliability Engineer I

Cloud • Information Technology • Productivity • Security • Sharing Economy • Software • Infrastructure as a Service (IaaS)
Greenwood Village, CO, USA
1600 Employees

Comcast Logo Comcast

Senior Site Reliability Engineer, Cloud Infrastructure (Eng4)

Digital Media • Gaming • Internet of Things • News + Entertainment • Retail • Business Intelligence • Cybersecurity
6 Locations
68848 Employees
133K-208K Annually

Similar Companies Hiring

HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account