Sr./Lead Site Reliability Engineer

Sorry, this job was removed at 01:14 p.m. (CST) on Tuesday, Dec 03, 2024
Be an Early Applicant
San Francisco, CA
114K-222K Annually
Internship
Fintech • Payments • Financial Services
The Role

CompanyFederal Reserve Bank of San Francisco

We are the Federal Reserve Bank of San Francisco—public servants with a mission to advance the nation’s monetary, financial, and payment systems to build a stronger economy for all Americans. We are a community-engaged bank, and are committed to understanding and serving the vibrant, expansive communities of the Twelfth District. That means we seek and appreciate new perspectives. We respect people for what they do and for who they are. We build opportunities to learn and grow. When you join the SF Fed, you become part of a diverse team united in its purpose to promote an economy that works for everyone.
As a Sr. /Lead Site Reliability Engineer, you will work with Cash Application Delivery Services (ADS) development, QA , DevOps and National IT teams for managing the systems that support the Cash ADS applications suite both on-prem and in the Cloud. Your main focus will be to ensure that all of our applications are operating optimally, and every aspect of the application is being monitored so as to facilitate quick troubleshooting and resolution of issues as they arise.
We empower our people to balance their life and work responsibilities. That’s why we offer a flexible hybrid work model that allows you to collaborate with office colleagues on some days, and work from home on others.

Responsibilities:

  • Establish and run playbooks to support the resolution of incidents that occur in production environments.
  • Help design Dashboards for effective monitoring of infrastructure resources in the cloud environments
  • Work with development teams to establish Service-Level Objectives and key Service-Level Indicators
  • Conduct Production Readiness Reviews to ensure services meets accepted standards of operational readiness before going live
  • Ensure infrastructure aligns with Security standards, assist in audits, and implement recommended practices to protect data and systems.
  • Facilitate the design and implementation of the Disaster Recovery plans, including back-ups, failover and recovery mechanism with the development and DBA subject matter experts
  • As one of the SREs, drive improvement opportunities in infrastructure, tooling, and workflows using a continuous feedback loop between development and CloudOps
  • Ensure uptime and reliability of Cloud based infrastructure and systems, monitoring system performance, and maintaining high availability of cloud-based assets.
  • Participate in incident Response and Troubleshooting by conducting root cause analysis and implementing solutions to prevent recurrence.
  • Establish thresholds for cloud based services and capabilities, set up and maintain monitoring systems to detect issues, before they impact users,
  • Configure alerts for system analogies, develop monitoring dashboards, monitor resource usage, latency, and error rates
  • Analyze system performance, establish metrics and thresholds, optimize service uptime, reduce latency, and improve customer experience, leveraging infrastructure modifications and configuration tuning etc.
  • Knowledge of technical troubleshooting approaches, tools and techniques, and the ability to anticipate, recognize, and resolve technical (hardware, software, application or operational) problems
  • Working experience in programming and scripting languages
  • Working tooling experience in Ansible, GitLab, Terraform, CloudWatch, Dynatrace, Grafana or equivalent is a must

Qualifications:

  • Bachelor’s degree in Computer Science, Information Systems, Computer Engineering, Systems Analysis or a related field or equivalent work experience
  • As a Lead Site Reliability Engineer typically requires 7+ years of industry experience in building and supporting enterprise level systems as a platform engineer or equivalent in a production environment. As a Senior Site Reliability Engineer typically requires 5+ years of hands-on experience implementing, supporting, and using the tools and services required for software orchestration, environment monitoring and management (DevOps) best practices. Experience in Ansible, GitLab, Terraform, CloudWatch, Dynatrace, Grafana is required
  • 2+ years hands-on experience with AWS services - building, deploying, and monitoring using AWS tools and services such as AWS Lambda, AWS CloudWatch, and AWS X-Ray
  • Must be a U.S. Citizen or a Green Card holder with the intent to become a U.S. Citizen

Base Salary Range Sr. Site Reliability Engineer: Min: $113600 - Mid: $147600 - Max: $181600 (Location: San Francisco)

Base Salary Range: Lead Site Reliability Engineer: Min: $138900 - Mid: $180400 - Max: $221900 (Location: San Francisco)

Final salary and offer will be determined by the applicant’s background, experience, skills, internal equity, and alignment with market data.

We offer a wonderful benefits package including: Medical, Dental, Vision, Pre-tax Flexible Spending Account, Backup Child Care Program, Pre-Tax Day Care Flexible Spending Account, Paid Family Care Leave, Vacation Days, Sick Days, Paid Holidays, Pet Insurance, Matching 401(k), and Retirement/Pension.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. The SF Fed is an Equal Opportunity Employer.
 

#LI-Hybrid

Full Time / Part TimeFull time

Regular / TemporaryRegular

Job Exempt (Yes / No)Yes

Job CategoryInformation Technology

Work ShiftFirst (United States of America)

The Federal Reserve Banks believe that diversity and inclusion among our employees is critical to our success as an organization, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

Always verify and apply to jobs on Federal Reserve System Careers (https://rb.wd5.myworkdayjobs.com/FRS) or through verified Federal Reserve Bank social media channels.

Privacy Notice

The Company
Kansas City, MO
2,289 Employees
On-site Workplace

What We Do

This page is dedicated to Federal Reserve System career and employment related information only. Comments not pertaining to Fed recruiting will be removed.

The Fed - Make a world of difference in the global economy

OUR BANK has one of the most recognizable brands around the world. The Federal Reserve is the central bank of the United States—one of the world's most influential, trusted and prestigious financial organizations. The Federal Reserve is charged with the important mission of promoting a strong economy and a stable financial system and fulfills this responsibility by formulating national monetary policy, supervising and regulating banks and bank holding companies, and providing financial services for banks and the U.S. government.

OUR PEOPLE are diverse in background and ideas, which allows for ongoing creativity and innovation. Ultimately, they are the ones who push our high-performance, exchange-driven culture forward.

Why Our People Choose Us:

Our reputation precedes us
There will always be room for personal growth
Our people are first
You’ll find the right balance
Your responsibilities will be meaningful

We hope that you will be our future colleague.

Find your preferred locations around the United States and explore the breadth of opportunity available at the Federal Reserve.

Atlanta https://www.frbatlanta.org/
Boston http://www.bostonfed.org/
Chicago https://www.chicagofed.org/
Cleveland https://www.clevelandfed.org/
Dallas http://dallasfed.org/
Kansas City https://www.kansascityfed.org/
Minneapolis https://www.minneapolisfed.org/
New York http://www.newyorkfed.org/
Philadelphia https://www.philadelphiafed.org/
Richmond https://www.richmondfed.org/
San Francisco http://www.frbsf.org/
St. Louis https://www.stlouisfed.org/
Board http://www.federalreserve.gov/

Similar Jobs

Atlassian Logo Atlassian

Senior Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees

Cisco Meraki Logo Cisco Meraki

Lead Site Reliability Engineer - Remote

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
Remote
San Francisco, CA, USA
3000 Employees
173K-242K Annually

Atlassian Logo Atlassian

Principal Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
167K-269K Annually

Atlassian Logo Atlassian

Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees

Similar Companies Hiring

Bectran, Inc Thumbnail
Software • Machine Learning • Information Technology • Fintech • Automation • Artificial Intelligence
Schaumburg, IL
51 Employees
Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
55 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account