Site Reliability Engineering Specialist

Posted 2 Days Ago
Be an Early Applicant
Atlanta, GA
131K-221K Annually
Mid level
eCommerce • Fintech • Information Technology • Payments • Software
We are the leader in financial technology and services for financial institutions and businesses of all sizes, globally.
The Role
The Site Reliability Engineer will design and maintain monitoring solutions, automate tasks, ensure application reliability and performance, lead incident response, collaborate with security teams, manage deployment processes, and develop disaster recovery plans. The role involves proactive issue detection, resource optimization, and knowledge sharing within the team, while also participating in on-call rotations for critical incidents.
Summary Generated by Built In

Job Description

We are FIS. Our technology powers the world’s economy and our teams bring innovation to life. We champion diversity to deliver the best products and solutions for our colleagues, clients and communities. If you’re ready to start learning, growing and making an impact with a career in fintech, we’d like to know: Are you FIS?
 

Role and Responsibilities 

Reporting to the Head of Cloud Enablement Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions business.  In this role, the candidate will have the opportunity to make a lasting impact on the company's digital transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive digital banking landscape. Specifically, the Site Reliability Engineer will be responsible for the following: 

 

  • Design and maintain monitoring solutions and alerting mechanisms for infrastructure, application performance, and user experience metrics, enabling proactive issue detection and mitigation. 

  • Implement automation tools and processes to automate routine tasks, scale infrastructure, and ensure seamless deployments, updates, and rollbacks with minimal user impact. 

  • Ensure the reliability, availability, and performance of applications and services, focusing on minimizing downtime, optimizing response times, and maintaining high availability for users. 

  • Lead incident response efforts for incidents, including identification, triage, resolution, and post-incident analysis to prevent recurrence and improve system resilience. 

  • Conduct capacity planning, performance tuning, and resource optimization for environments, collaborating with development and operations teams to meet scalability and performance goals. 

  • Collaborate with security teams to implement security best practices, perform vulnerability assessments, and ensure compliance with security standards and regulatory requirements for applications. 

  • Manage deployment pipelines, release processes, and configuration management for app deployments, ensuring consistency, reliability, and version control across environments. 

  • Identify areas for improvement in reliability, performance, and efficiency through data analysis, root cause analysis, and trend analysis, and drive initiatives to enhance system reliability and operational efficiency. 

  • Create and maintain documentation, runbooks, and knowledge base articles for operational procedures, troubleshooting guides, and best practices, and promote knowledge sharing within the team. 

  • Develop and test disaster recovery plans, backup strategies, and failover mechanisms for app services, ensuring business continuity and data integrity in case of failures or disasters. 

  • Collaborate with development, QA, DevOps, and product teams to ensure alignment on reliability goals, performance metrics, release schedules, and incident response processes. 

  • Participate in on-call rotations and provide 24/7 support for critical incidents, troubleshoot issues, and coordinate with teams for resolution, escalation, and follow-up actions as per defined SLAs. 

 

Professional Qualifications 

  • Proficient in development technologies, architectures, and platforms (web, api) to understand system complexities and performance considerations. 

  • Experience in cloud platforms (e.g., AWS, Azure, Google Cloud) and infrastructure as code (IaC) tools for managing app infrastructure and deployments. 

  • Knowledge of monitoring tools (e.g., Prometheus, Grafana, DataDog, New Relic) and logging frameworks (e.g., Splunk, SumoLogic, ELK Stack) for real-time visibility into system health, performance metrics, and user experience. 

  • Experience in incident management, including incident response, triage, root cause analysis (RCA), and post-mortem reviews to prevent recurring issues. 

  • Strong troubleshooting skills to diagnose complex technical issues in app environments, infrastructure, networking, and performance bottlenecks. 

  • Proficiency in scripting languages (e.g., Python, Bash) and automation tools (e.g., Terraform, Ansible) for automating routine tasks, deployments, and infrastructure management. 

  • Experience in implementing continuous integration/continuous deployment (CI/CD) pipelines for apps using tools like Jenkins, GitLab CI/CD, or Azure DevOps. 

  • Expertise in setting up monitoring solutions, configuring alerts, and creating dashboards to monitor system performance, application metrics, and user experience. 

  •  Familiarity with APM (Application Performance Monitoring) tools to analyze app performance, identify bottlenecks, and optimize resource utilization. 

  • Familiarity with RUM (Real User Monitoring) for tracking and analyzing user interaction and system performance. 

  • Commitment to continuous learning, staying updated with industry trends, new technologies, and best practices in app reliability, performance, and operations. 

  •  Adaptability to evolving requirements, technologies, and business needs, with a focus on driving continuous improvement and operational excellence. 

 

Personal Characteristics 

  • Demonstrates judgment and flexibility; thinks about issues and develops solutions that thoughtfully take the broader context into account - positively deals with a shifting demand for time, priorities, and the rapid change of environments. 

  • Takes an ownership approach to engineering and product outcomes. 

  • Action-oriented self-starter who can set strategy and drive execution with a “roll up the sleeves” approach. 

  • Excellent interpersonal communication, negotiation and influencing skills to work effectively with all stakeholders (internal & external), making information-based decisions. 

  • Penchant for excellence, both personally and professionally, demonstrated by intellectual curiosity, record of accomplishment, and reputation; shows strong attention to detail and implementation of best practices with an inclination for continuous improvement. 

  • Ability to quickly establish strong credibility with employees, business partners and external resources. 

  • Embodies and delivers the firm's values and culture towards colleagues, clients, and communities: 

  • Win as one team 

  • Lead with integrity 

  • Be the change 

FIS is committed to providing its employees with an exciting career opportunity and competitive compensation. The pay range for this full-time position is $131,290.00 - $220,600.00 and reflects the minimum and maximum target for new hire salaries for this position based on the posted role, level, and location. Within the range, actual individual starting pay is determined additional factors, including job-related skills, experience, and relevant education or training. Any changes in work location will also impact actual individual starting pay. Please consult with your recruiter about the specific salary range for your preferred location during the hiring process.

Privacy Statement

FIS is committed to protecting the privacy and security of all personal information that we process in order to provide services to our clients. For specific information on how FIS protects personal information online, please see the Online Privacy Notice.

EEOC Statement

FIS is an equal opportunity employer. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, marital status, genetic information, national origin, disability, veteran status, and other protected characteristics. The EEO is the Law poster is available here supplement document available here


For positions located in the US, the following conditions apply. If you are made a conditional offer of employment, you will be required to undergo a drug test. ADA Disclaimer: In developing this job description care was taken to include all competencies needed to successfully perform in this position. However, for Americans with Disabilities Act (ADA) purposes, the essential functions of the job may or may not have been described for purposes of ADA reasonable accommodation. All reasonable accommodation requests will be reviewed and evaluated on a case-by-case basis.

Sourcing Model

Recruitment at FIS works primarily on a direct sourcing model; a relatively small portion of our hiring is through recruitment agencies. FIS does not accept resumes from recruitment agencies which are not on the preferred supplier list and is not responsible for any related fees for resumes submitted to job postings, our employees, or any other part of our company.

#pridepass

Top Skills

AWS
Azure
GCP
The Company
HQ: Jacksonville, FL
57,000 Employees
Hybrid Workplace
Year Founded: 1968

What We Do

FIS is a leading provider of technology solutions for financial institutions and businesses of all sizes and across any industry, globally. We enable the movement of commerce by unlocking the financial technology that powers the world’s economy. Our employees are dedicated to advancing the way the world pays, banks and invests through our trusted innovation, proven performance and flexible architecture. We help our clients use technology in innovative ways to solve business-critical challenges and deliver superior experiences for their customers. Headquartered in Jacksonville, Florida, FIS ranks #241 on the 2021 Fortune 500 and is a member of Standard & Poor’s 500® Index. To learn more, visit www.fisglobal.com. Follow FIS on Facebook, LinkedIn and Twitter (@FISGlobal).

Why Work With Us

The world of fintech is changing fast. Our core values and behaviors give us the power to have a positive
impact on the world we share. We work together, connecting to achieve outcomes with speed. We are inclusive and embrace our diverse strengths. We make things happen and celebrate together

Gallery

Gallery

Similar Jobs

Remote
10 Locations
2674 Employees

General Motors Logo General Motors

Staff Engineer, Vehicle Data Cloud

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Atlanta, GA, USA
165000 Employees
210K-274K Annually

General Motors Logo General Motors

Sr. Data Engineer (Motorsports)

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Atlanta, GA, USA
165000 Employees

PwC Logo PwC

Fraud Technologist - Data and Analytics - Sr Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
Atlanta, GA, USA
364000 Employees
84K-202K Annually

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account