Lead Site Reliability Engineer
Company: BJ's Wholesale Club
Location: Marlborough
Posted on: April 18, 2025
Job Description:
Join our team of more than 34,000 team members, supporting our
members and communities in our Club Support Center, 235+ clubs and
eight distribution centers. BJ's Wholesale Club offers a
collaborative and inclusive environment where all team members can
learn, grow and be their authentic selves. Together, we're
committed to providing outstanding service and convenience to our
members, helping them save on the products and services they need
for their families and homes. The Benefits of working at BJ's ---
BJ's pays weekly --- Eligible for free BJ's Inner Circle and
Supplemental membership(s) --- Generous time off programs to
support busy lifestyles o Vacation, Personal, Holiday, Sick,
Bereavement Leave, Jury Duty --- Benefit plans for your changing
needs o Three medical plans , Health Savings Account (HSA), two
dental plans, vision plan, flexible spending --- 401(k) plan with
company match (must be at least 18 years old) eligibility
requirements vary by position medical plans vary by location As a
Lead Site Reliability Engineer (SRE) for BJ's Wholesale Club's
digital team, you will play a critical role in ensuring the
reliability, scalability, and performance of our digital platforms,
including bjs. com, mobile apps, and in-club digital capabilities.
Supporting the full lifecycle of digital experiences - from order
placement to fulfillment - you will lead efforts to enhance system
reliability, implement SRE best practices, and foster collaboration
across teams. Key Responsibilities:
- Ensure Reliability and Scalability: Lead the design,
deployment, and management of highly reliable and scalable systems
for BJ's digital properties, including e-commerce platforms and
mobile applications.
- Full-Stack Expertise: Responsible for analyzing and fixing
issues for Java-based microservices, React applications, and
respective backend services
- Proactive Monitoring and Incident Response: Build and maintain
monitoring systems using tools like New Relic, dataset, or similar
to identify and resolve issues before they impact members.
- Collaboration and Leadership: Drive cross-functional
collaboration with development and operations teams to align on
goals for system performance and reliability.
- SRE Methodologies: Implement SRE principles to enhance service
level objectives (SLOs), service level indicators (SLIs), and
ensure a seamless customer experience.
- Automation and Optimization: Automate repetitive tasks and
optimize workflows to improve operational efficiency and reduce
toil.
- Incident Management: Lead root cause analyses for critical
production incidents, generating detailed RCA reports and driving
actions to prevent recurrence.
- Change Management: Ensure reliable deployments by following
structured change management processes and leveraging version
control systems.
- Security and Compliance: Collaborate with security teams to
ensure systems and applications meet stringent security standards.
Qualifications:
- Educational Background: Bachelor's degree in Computer Science,
Engineering, or a related field (or equivalent experience).
- Experience: At least 6+ years in SRE or related roles, with
experience in large-scale e-commerce or digital platforms.
- Technical Skills:
- Strong hands-on knowledge of Java-based microservices and
APIs.
- Proficiency in observability tools (e. g., New Relic,
Splunk).
- Hands-on experience with automation scripts (Bash,
Python).
- Experience with change management tools like terraform etc
- Knowledge in CI/CD, containerization (e. g., Docker,
Kubernetes), and cloud technologies.
- Soft Skills:
- Exceptional communication skills to work effectively with
cross-functional teams and leadership.
- Strong analytical and problem-solving abilities.
- Demonstrated ability to learn and adapt to new tools and
technologies quickly. Job Conditions:
- Collaborate within a dynamic and innovative team
environment.
- Participate and initiate cross-training and knowledge-sharing
initiatives.
- Be part of an escalation on-call rotation to provide 24/7
support for critical systems and lead team during critical issues.
This is a hybrid role. Tuesday through Thursday are in-office days
at BJ's Club Support Center in Marlborough, MA and Monday and
Friday are remote days. In accordance with the Pay Transparency
requirements, the following represents a good faith estimate of the
compensation range for this position. At BJ's Wholesale Club, we
carefully consider a wide range of non-discriminatory factors when
determining salary. Actual salaries will vary depending on factors
including but not limited to location, education, experience, and
qualifications. The pay range for this position is starting from
$109,000.00.
Keywords: BJ's Wholesale Club, Cranston , Lead Site Reliability Engineer, Professions , Marlborough, Rhode Island
Didn't find what you're looking for? Search again!
Loading more jobs...