Lead Site Reliability Engineer

Posted 5 Days Ago
Be an Early Applicant
Marlborough Center, Town of Marlborough, CT
109K Annually
Senior level
Retail
The Role
As a Lead Site Reliability Engineer, you will design and manage infrastructure for the ecommerce platform, utilizing observability tools and scripting to ensure reliability and performance. You'll address potential issues proactively, improve systems, conduct root-cause analyses, and foster a culture of reliability within the organization. You are expected to automate tasks and manage high-availability designs while adhering to SRE principles.
Summary Generated by Built In

Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight distribution centers. BJ’s Wholesale Club offers a collaborative and inclusive environment where all team members can learn, grow and be their authentic selves. Together, we’re committed to providing outstanding service and convenience to our members, helping them save on the products and services they need for their families and homes.

The Benefits of working at BJ’s

•        BJ’s pays weekly

        Eligible for free BJ's Inner Circle and Supplemental membership(s)*

•        Generous time off programs to support busy lifestyles* 

                      o Vacation, Personal, Holiday, Sick, Bereavement Leave, Jury Duty

•        Benefit plans for your changing needs*

                      o Three medical plans**, Health Savings  Account (HSA), two dental plans, vision plan, flexible spending

•        401(k) plan with company match (must be at least 18 years old)

*eligibility requirements vary by position

**medical plans vary by location

As a Lead Site Reliability Engineer (SRE) for BJ’s Wholesale Club’s digital team, you will play a critical role in ensuring the reliability, scalability, and performance of our digital platforms, including bjs.com, mobile apps, and in-club digital capabilities. Supporting the full lifecycle of digital experiences—from order placement to fulfillment—you will lead efforts to enhance system reliability, implement SRE best practices, and foster collaboration across teams.

Key Responsibilities:

  • Ensure Reliability and Scalability: Lead the design, deployment, and management of highly reliable and scalable systems for BJ's digital properties, including e-commerce platforms and mobile applications.
  • Full-Stack Expertise: Responsible for analyzing and fixing issues for Java-based microservices, React applications, and respective backend services
  • Proactive Monitoring and Incident Response: Build and maintain monitoring systems using tools like New Relic, dataset, or similar to identify and resolve issues before they impact members.
  • Collaboration and Leadership: Drive cross-functional collaboration with development and operations teams to align on goals for system performance and reliability.
  • SRE Methodologies: Implement SRE principles to enhance service level objectives (SLOs), service level indicators (SLIs), and ensure a seamless customer experience.
  • Automation and Optimization: Automate repetitive tasks and optimize workflows to improve operational efficiency and reduce toil.
  • Incident Management: Lead root cause analyses for critical production incidents, generating detailed RCA reports and driving actions to prevent recurrence.
  • Change Management: Ensure reliable deployments by following structured change management processes and leveraging version control systems.
  • Security and Compliance: Collaborate with security teams to ensure systems and applications meet stringent security standards.

Qualifications:

  • Educational Background: Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • Experience: At least 6+ years in SRE or related roles, with experience in large-scale e-commerce or digital platforms.
  • Technical Skills:
    • Strong hands-on knowledge of Java-based microservices and APIs.
    • Proficiency in observability tools (e.g., New Relic, Splunk).
    • Hands-on experience with automation scripts (Bash, Python).
    • Experience with change management tools like terraform etc
    • Knowledge in CI/CD, containerization (e.g., Docker, Kubernetes), and cloud technologies.
  • Soft Skills:
    • Exceptional communication skills to work effectively with cross-functional teams and leadership.
    • Strong analytical and problem-solving abilities.
    • Demonstrated ability to learn and adapt to new tools and technologies quickly.

Job Conditions:

  • Collaborate within a dynamic and innovative team environment.
  • Participate and initiate cross-training and knowledge-sharing initiatives.
  • Be part of an escalation on-call rotation to provide 24/7 support for critical systems and lead team during critical issues.

In accordance with the Pay Transparency requirements, the following represents a good faith estimate of the compensation range for this position. At BJ’s Wholesale Club, we carefully consider a wide range of non-discriminatory factors when determining salary. Actual salaries will vary depending on factors including but not limited to location, education, experience, and qualifications. The pay range for this position is starting from $109,000.00.

Top Skills

Java
Python
The Company
HQ: Westborough, MA
10,308 Employees
On-site Workplace

What We Do

Headquartered in Westborough, Massachusetts, BJ's Wholesale Club is a leading operator of membership warehouse clubs in the Eastern United States. The company currently operates over 215 clubs and more than 145 BJ's Gas® locations in 17 states.

Explore career opportunities at BJ's and join our team today: www.bjs.com/careers

Similar Jobs

Marlborough Center, Town of Marlborough, CT, USA
10308 Employees
109K Annually

Pitney Bowes Inc. Logo Pitney Bowes Inc.

Sr Site Reliability Engineer

Information Technology • Logistics • Financial Services
Shelton, CT, USA
12066 Employees
143K-178K Annually
Hartford, CT, USA
20002 Employees
114K-170K Annually

Pitney Bowes Inc. Logo Pitney Bowes Inc.

Sr Site Reliability Engineer

Information Technology • Logistics • Financial Services
Shelton, CT, USA
12066 Employees
143K-178K Annually

Similar Companies Hiring

McCain Foods Thumbnail
Retail • Manufacturing • Food • Agriculture
Florenceville-Bristol, NB
20000 Employees
Optimum Thumbnail
Software • Retail • Mobile • Marketing Tech • Internet of Things • Digital Media • AdTech
Long Island City, NY
9000 Employees
Grocery TV Thumbnail
Software • Retail • Marketing Tech • Hardware • Digital Media • AdTech
Austin, TX
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account