Principal Site Reliability Engineer

Posted Yesterday
Be an Early Applicant
Austin, TX
Senior level
Information Technology
Here, entrepreneurs, self-starters, team players, and big thinkers unite behind a common cause.
The Role
As a Principal Site Reliability Engineer, you will be responsible for the reliability, scalability, and performance of systems, leading incident response, managing releases, and improving observability by collaborating with development and operations teams.
Summary Generated by Built In

About Care.com 

Care.com is a consumer tech company with heart. We’re on a mission to solve a human challenge we all face: finding great care for the ones we love. We’re moms and dads and pet parents. We have parents and grandparents, so we understand that everyone, at some point in their lives, could use a helping hand. Our culture and our products reflect that. 

Here, entrepreneurs, self-starters, team players, and big thinkers unite behind a common cause. Here, we’re applying data analytics, AI, and the latest technologies to solve universal problems and connect people in new ways. If you like having autonomy, if you thrive on collaboration and building new things, and if you’re all about using your talent for good, Care.com is the place for you. 

Work Environment: Hybrid (In Office - Monday, Wednesday & Friday) 
Locations: Salt Lake City | Austin | Dallas

What You’ll Be Working On
As a Principal Site Reliability Engineer (SRE), you will be responsible for ensuring the reliability, scalability, and performance of our critical systems. You’ll lead incident response, manage releases, improve observability, and collaborate across development and operations teams to drive continuous improvements.

Key Responsibilities

  • Release Management: Coordinate releases for applications, ensuring efficient deployment and smooth rollbacks.
  • Incident Response: Lead incident management, facilitate root cause analysis, and continuously update response processes.
  • Monitoring & Alerting: Implement proactive monitoring, create dashboards, and set up real-time alerts for critical services.
  • Hypercare: Ensure system stability during critical post-release periods, monitoring performance and preventing incidents.
  • Collaboration with Dev & QA: Work closely with developers and QA teams to ensure performance benchmarks and observability goals are met.
  • SLI/SLA/SLO Management: Define and measure service levels for key workflows and APIs, ensuring alignment with business expectations.
  • Observability Maturity: Continuously assess and improve observability practices across teams, driving data-driven insights.

What You’ll Need to Succeed

  • 6+ years of experience in SRE or DevOps roles with a focus on monoliths and distributed microservices in cloud environments (AWS, GCP).
  • Proficiency in CI/CD tools (Jenkins, Terraform, Ansible).
  • Strong experience with Kubernetes, Docker, and JVM-based monoliths.
  • Expertise in monitoring tools (SignalFX, Splunk, Amplitude) and production incident management.
  • Scripting skills (Python, Bash, or Groovy).
  • Strong understanding of cloud-based systems and containerization.
  • Excellent communication skills and a collaborative approach to working cross-functionally.
  • Experience optimizing large-scale, customer-facing platforms in fast-paced environments.

For a list of our Perks + Benefits, click here!

Care.com supports diverse families and communities and seeks employees who are just as diverse. As an equal opportunity employer, Care.com recognizes the power of a diverse and inclusive workforce and encourages applications from individuals with varied experiences, perspectives, and backgrounds. Care.com is committed to providing reasonable accommodations for qualified individuals with disabilities. If you need assistance or accommodation, please reach out to [email protected].

Company Overview:

Available in 21 countries, Care.com is one of the largest providers of online services for finding family care and care jobs, spanning in-home and in-center care solutions. Since 2007, families have relied on Care.com for an array of care for children, seniors, pets, and the home.  Designed to meet the evolving needs of today’s families and caregivers, the Company also offers customized corporate benefits packages to support working families, household tax and payroll services, and innovations for caregivers to find and book jobs. Care.com is an IAC company (NASDAQ: IAC).

Salary Range: $180,000 to $200,000. 

The base salary range above represents the anticipated low and high end of the national salary range for this position. Actual salaries may vary and may be above or below the range based on various factors including but not limited to work location, experience, and performance. The range listed is just one component of Care.com’s total compensation package for employees. Other rewards may include annual bonuses and short- and long-term incentives. In addition, Care.com provides a variety of benefits to employees, including health insurance coverage, life, and disability insurance, a generous 401K employer matching program, paid holidays, and paid time off (PTO).

Top Skills

AWS
Bash
GCP
Groovy
Python
The Company
Austin, TX
500 Employees
Hybrid Workplace
Year Founded: 2007

What We Do

Care.com is a consumer tech company with heart. We’re on a mission to solve a human challenge we all face: finding great care for the ones we love. We’re moms and dads and pet parents. We have parents and grandparents so we understand that everyone, at some point in their lives, could use a helping hand. Our culture and our products reflect that.

Why Work With Us

Here, we’re applying data analytics, AI, and the latest technologies to solve universal problems and connect people in new ways. If you like having autonomy, if you thrive on collaboration and building new things, and if you’re all about using your talent for good, Care.com is the place for you.

Gallery

Gallery

Similar Jobs

Hybrid
Fort Worth, TX, USA
289097 Employees
Easy Apply
3 Locations
1100 Employees

Capital One Logo Capital One

Senior Software Engineer, DevOps (Cloud Operations Resilience Engineering)

Fintech • Machine Learning • Payments • Software • Financial Services
Plano, TX, USA
55000 Employees

Capital One Logo Capital One

Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

Fintech • Machine Learning • Payments • Software • Financial Services
Plano, TX, USA
55000 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account