Site Reliability Engineer IV

Posted 8 Days Ago
Be an Early Applicant
Windward, DE
Senior level
eCommerce • Fintech • Payments
The Role
The Site Reliability Engineer IV is responsible for ensuring system availability, performance, and efficiency while bridging development and operations. This role involves architecture discussions, chaos engineering, system monitoring, and mentorship of other engineers. Responsibilities include leading game day exercises, improving alert systems and documentation, and troubleshooting with the Technical Operations Team.
Summary Generated by Built In

Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services.  Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing results.  We are driven by our passion for success and we are proud to deliver best-in-class payment technology and software solutions.  Join our dynamic team and make your mark on the payments technology landscape of tomorrow. 

Summary of This Role

Responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Creates a bridge between development and operations by applying a software engineering mindset to system administration topics. Splits time between operations/on-call duties and developing systems and software that help increase site reliability and performance.

What Are We Looking For in This Role?

  • Participate in architecture and R&D discussions for new technology or processes to increase the performance and reliability of our systems.
  • Chaos engineering - you’re expected to think laterally about how our systems might fail in theory, design tests to demonstrate how they behave in practice, and then formulate and implement remediation plans, as appropriate.
  • Pushing our systems to their limits, and then coming up with designs for how to get them to the next performance tier.
  • Use practices from DevOps and GitOps to improve automation and processes to make self service possible.
  • Safeguarding reliability. Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing.
  • Running “game days” to test assumptions about reliability and learn what will break before it matters to customers.
  • Reviewing designs with an eye toward increasing the holistic stability of our platform and identifying potential risks.
  • Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure.
  • Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and don’t get paged when it doesn’t).
  • Troubleshooting systems and network issues, alongside our Technical Operations Team.
  • Mentoring other engineers in reliability-related skills.
  • Evolving our SDLC, practices, and tooling to account for Site Reliability considerations and best practices.
  • Developing runbooks and improving documentation.
     

Minimum Qualifications

  • BS in Computer Science, Information Technology, Business / Management Information Systems or related field
  • Typically have 6+ years of experience with programming in one or more programming languages and 4 years of experience working with Unix/Linux systems internals and administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).

Preferred Qualifications

  • MS in Computer Science, Information Technology, Business / Management Information Systems or related field
  • Typically have 5+ years of experience

What Are Our Desired Skills and Capabilities?

  • Skills / Knowledge - Having wide-ranging experience, uses professional concepts and company objectives to resolve complex issues in creative and effective ways. Some barriers to entry exist at this level (e.g., dept./peer review).
  • Job Complexity - Works on complex issues where analysis of situations or data requires an in-depth evaluation of variable factors. Exercises judgment in selecting methods, techniques and evaluation criteria for obtaining results. Networks with key contacts outside own area of expertise.
  • Supervision - Determines methods and procedures on new assignments and may coordinate activities of other personnel (Team Lead).
  • Experience in Public and Private Clouds, Jenkins, Terraform, Ansible, OpenShift, Kubernetes or AWS EKS
  • Cloud platform : AWS or any cloud knowledge

    Containerization : Dockers, K8s, Redhat, OpenShift and AWS EKS
    Development : Python, Java, groovy, or any scripting

  • Knowledge in Observability/Monitoring stacks

Global Payments Inc. is an equal opportunity employer. Global Payments provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, sex (including pregnancy), national origin, ancestry, age, marital status, sexual orientation, gender identity or expression, disability, veteran status, genetic information or any other basis protected by law. If you wish to request reasonable accommodations related to applying for employment or provide feedback about the accessibility of this website, please contact [email protected].

Top Skills

Programming Languages
The Company
HQ: Atlanta, GA
24,000 Employees
On-site Workplace

What We Do

Global Payments (NYSE: GPN) is a Fortune 500 payments technology company, delivering the leading complete worldwide commerce ecosystem.

Our unique, connected infrastructure unifies every aspect of commerce, from issuer solutions to payments, and the innovative software that delivers seamless customer experiences.

Headquartered in Atlanta, Georgia, we’re a worldwide team of over 24,000 people—including local experts on the ground in nearly 40 countries. Together, we support thousands of businesses across more than 100 industries. Empowering commerce for everyone.

Similar Jobs

CaptivateIQ Logo CaptivateIQ

Site Reliability Engineer (SRE)

Cloud • Fintech • Other • Software
Remote
14 Locations
335 Employees
130K-150K Annually

Capital One Logo Capital One

Senior Software Engineer, Full Stack

Fintech • Machine Learning • Payments • Software • Financial Services
Wilmington, DE, USA
55000 Employees

Capital One Logo Capital One

Senior Software Engineer, Back End (Bank Tech)

Fintech • Machine Learning • Payments • Software • Financial Services
Wilmington, DE, USA
55000 Employees
Wilmington, DE, USA
289097 Employees

Similar Companies Hiring

MyBambu Thumbnail
Social Impact • Payments • Other • Mobile • Fintech • Financial Services • App development
West Palm Beach, Florida
120 Employees
Bectran, Inc Thumbnail
Software • Machine Learning • Information Technology • Fintech • Automation • Artificial Intelligence
Schaumburg, IL
51 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account