Staff Software Engineer, Reliability Engineering

Posted 2 Days Ago
Be an Early Applicant
San Francisco, CA
204K-259K Annually
Expert/Leader
Real Estate • Travel • PropTech
The Role
As a Staff Software Engineer in Reliability Engineering at Airbnb, you will develop and maintain tools for service reliability while collaborating with SRE teams. Responsibilities include incident management, system improvements, and mentoring junior staff. Your focus will be on scalable solutions and effective incident response, ensuring the reliability of Airbnb’s services during high-severity incidents.
Summary Generated by Built In

Airbnb was born in 2007 when two Hosts welcomed three guests to their San Francisco home, and has since grown to over 4 million Hosts who have welcomed more than 1 billion guest arrivals in almost every country across the globe. Every day, Hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way.

The Community You Will Join:

We are looking for a Staff Software Engineer to join our Site Reliability Engineering team. As a Staff Software Engineer in SRE, you will be responsible for developing and maintaining the tools and systems that enable our engineering teams to operate our services reliably and at scale. You will work closely with our SREs and other engineering teams to ensure our services are properly instrumented and able to scale with our growing business.

The Difference You Will Make:

In this role, your expertise in developing and maintaining tools and systems will be instrumental in bolstering our services' reliability and improving how the company manages incidents broadly. By collaborating closely with other engineering teams you will help establish a culture of reliability throughout the organization by providing a comprehensive incident management platform that is being used for instrumentation, operability, and around incidents. Your ability to identify opportunities for improvement and drive their implementation will contribute significantly to our overall operational efficiency and growth, ensuring that our services remain resilient as our business continues to expand.

Additionally, as an essential part of this role, you will serve as an active member of our first responder SRE team, responding to and managing high severity incidents. Your vast technical experience and leadership skills will be invaluable as you step into the role of Incident Commander during these critical events. You will guide cross-functional teams during crisis situations and ensure timely resolution, minimizing the impact on our customers and business. This aspect of your work will require not just strong technical acumen, but also excellent communication and coordination skills, resilience under pressure, and a firm commitment to our culture of blamelessness and continuous learning.

A Typical Day: 

  • Design, implement and maintain the tools and systems that support service reliability, monitoring, and alerting.
  • Collaborate with other engineering teams to ensure services are designed with reliability in mind, and provide guidance on the appropriate use of tooling and automation.
  • Identify opportunities to improve the reliability, scalability, and efficiency of our services and drive their implementation.
  • Work with SREs to understand the challenges they face in operating our services and develop tools and systems to help them manage these challenges.
  • Participate in incident response and post-mortems to identify and address systemic issues.
  • Continuously evaluate new technologies and industry best practices to improve our SRE tooling and incident response procedures.
  • Gain and maintain an intimate understanding of how the critical parts of the site work (services, infrastructure, tooling, and processes)
  • Lead high-urgency incident management and mentor less-experienced team members in effectively handling incidents.
  • Contribute to better incident retrospectives, driving improvements in our overall reliability and incident response time.

Your Expertise:

  • Bachelor's degree in Computer Science or related field.
  • 9+ years of experience in software engineering or SRE roles, with a focus on large scale distributed systems.
  • Strong coding skills in at least one programming language, such as Java, Python, or Go.
  • Experience with distributed systems and service-oriented architectures.
  • Experience with cloud computing platforms such as AWS or Google Cloud Platform.
  • Strong conviction in software development best practices, including version control, automated testing, and continuous integration and delivery.
  • Experience with containerization technologies such as Docker and Kubernetes.
  • Excellent problem-solving and analytical skills, with a strong attention to detail.
  • Ability to work effectively in a fast-paced and dynamic environment.
  • Strong communication and interpersonal skills.

Your Location:

This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list. If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.

Our Commitment To Inclusion & Belonging:

Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply.

We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: [email protected]. Please include your full name, the role you’re applying for and the accommodation necessary to assist you with the recruiting process. 

We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.

How We'll Take Care of You:

Our job titles may span more than one career level. The actual base pay is dependent upon many factors, such as: training, transferable skills, work experience, business needs and market demands. The base pay range is subject to change and may be modified in the future. This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.  

Pay Range

$204,000$259,000 USD

Top Skills

Go
Java
Python
The Company
HQ: Dublin
14,622 Employees
Hybrid Workplace
Year Founded: 2008

What We Do

Airbnb is a community based on connection and belonging—a community that was born in 2008 when two hosts welcomed three guests to their San Francisco home, and has since grown to 4 million hosts who have welcomed over 800 million guest arrivals to about 100,000 cities in almost every country and region across the globe. Hosts on Airbnb are everyday people who share their worlds to provide guests with the feeling of connection and being at home. At Airbnb, we believe that hosts, guests and the communities where we operate are all stakeholders we have a responsibility to serve, and that by serving them alongside our employees and investors, we will build an enduringly successful company.

Gallery

Gallery

Similar Jobs

Palo Alto, CA, USA
180K-250K Annually

Voltage Park Logo Voltage Park

Senior Full Stack Engineer

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
San Francisco, CA, USA
51 Employees
150K-200K Annually
Hybrid
San Francisco, CA, USA
289097 Employees
Hybrid
San Francisco, CA, USA
289097 Employees

Similar Companies Hiring

Findigs, Inc. Thumbnail
Software • Real Estate • PropTech • Fintech
New York, NY
56 Employees
Digible Thumbnail
Social Media • PropTech • Marketing Tech • Digital Media • Artificial Intelligence • Agency • AdTech
Englewood, CO
120 Employees
Fora Travel Thumbnail
Travel • Software • Sales • Professional Services • On-Demand • Hospitality • Agency
New York, NY
85 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account