Principal Engineer, SRE

Posted 3 Days Ago
Be an Early Applicant
Mountain View, CA
Expert/Leader
eCommerce
Building the future of eCommerce. We push the boundaries of what’s possible to solve problems!
The Role
In this role, you will ensure the reliability and performance of Coupang's e-commerce systems. You will lead technical initiatives across teams, mentor junior engineers, and work closely with product development to enhance system reliability. Key responsibilities include architecture reviews, problem-solving for complex technical challenges, and championing SRE principles. A strong background in distributed systems and cloud infrastructure is required.
Summary Generated by Built In

Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale e-commerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated, and designed to scale. As an SRE organization, we take pride in handling “operations as an engineering” problem with automation automation-first approach. You will use your background to build best-in-class infrastructure automation for areas such as Observability, Incident Management, Disaster Recovery, Load testing, Capacity engineering, and many more. In this role, you will work very closely with our product development teams from an early stage of design to all the way helping resolve any production incidents, maintaining SLI/SLA bar for production services, and influencing them with SRE principles and best practices. If you take pride in complete ownership, have a passion for solving complex technical challenges for large-scale distributed systems, and demeanor to work and communicate effectively across team boundaries, this is the role for you!  

 

Key Responsibilities: 

  • Serve as a primary point responsible for the reliability, health, and performance of all Coupang customer-facing services.  

  • Gain deep knowledge of Coupang application workflow and dependencies.  

  • Spearheading and conceptualizing revolutionary designs in critical service architecture.  

  • Conducting comprehensive architecture reviews leading re-architecting initiatives to set industry-leading benchmarks in performance, reliability, and availability. 

  • Lead and drive large-scale technical initiatives across multiple engineering teams.  

  • Be able to drive collaboration effectively across organizational boundaries, be able to build strong stakeholder relationships to achieve broad organizational objectives.  

  • Identify and implement scalable solutions for complex technical problems. Be the change driver.  

  • Self-motivated to be able to navigate the ambiguity with large initiatives and find solutions to accomplish the goal.  

  • Be the SRE champion/lead working with the rest of the technical leaders across Coupang to define and drive the engineering roadmap.  

  • Contribute towards hiring and building a world-class team. Mentor and coach junior engineers on the team.  

  • Communicate effectively with people at all levels of the organization. 

Essential Qualifications: 

  • 10+ years of industry experience building and operating large-scale distributed systems.  

  • Deep UNIX/Linux systems knowledge and administration background. 

  • Strong programming skills in one or more of Python, Java, Golang, or C++.  

  • Strong problem-solving and analytical skills spanning systems, networks (TCP/IP), and code, with a focus on data-driven decision-making. 

  • Proficient with cloud-based infrastructure, including AWS, Azure, or Google Cloud Platform. 

  • Strong understanding of DevOps and SRE practices, including continuous integration, continuous delivery, and infrastructure as code (IaC). 

  • Proficient with containerization and orchestration technologies, such as Docker and Kubernetes. 

  • Knowledge of observability ecosystem including metrics, logging, tracing, and tools, such as Prometheus, Grafana, Elastic Stack, Datadog, or New Relic. 

  • Excellent communication and collaboration skills, with the ability to work with teams across distinct functions and technical domains. 

 

Preferred Qualifications: 

  • Master’s degree in computer science, Engineering, or a related technical field.  

  • Prior experience working with large-scale web-based Java architectures and JVM configuration. 

  • Professional certifications in cloud platforms, monitoring tools, or related technologies. 

  • Previous experience working on a large-scale e-commerce platform.  

 

Top Skills

C++
Go
Java
Python
The Company
Mountain View, CA
70,000 Employees
Hybrid Workplace
Year Founded: 2010

What We Do

We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.

We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurial, surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.

Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on, high-tech, and hyper-connected world.

Why Work With Us

We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact.

Gallery

Gallery

Similar Jobs

Atlassian Logo Atlassian

Principal Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
167K-269K Annually
Easy Apply
3 Locations
1100 Employees

Roblox Logo Roblox

Senior SRE, Compute Orchestration

Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
San Mateo, CA, USA
2500 Employees
234K-284K Annually
Easy Apply
San Francisco, CA, USA
1100 Employees

Similar Companies Hiring

MagicLinks Thumbnail
Social Media • Marketing Tech • eCommerce
US
42 Employees
Block Thumbnail
Software • Payments • Fintech • Financial Services • eCommerce • Cryptocurrency • Blockchain
Oakland, CA
12000 Employees
Munchkin, Inc. Thumbnail
Kids + Family • Enterprise Web • eCommerce • Consumer Web • 3D Printing
Milton, Ontario
325 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account