Azure Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Las Vegas, NV
Entry level
Gaming • News + Entertainment • Esports
The Role
The Azure Site Reliability Engineer will build resilient infrastructure and ensure high availability through system reliability, observability, and automated response. Responsibilities include establishing reliability metrics, incident management, and collaborating on automation and CI/CD workflows. The role focuses on enhancing network security and platform management, partnering with DevOps and IT leadership.
Summary Generated by Built In

Pavilion Payments enables the world’s gaming entertainment leaders to create amazing consumer experiences and maximize spend across all of their physical and digital properties. Our complete suite of payment solutions enables safe, secure, and trusted cash access at the cage, on the casino floor, or online. Our compliance and security solutions offer additional layers of automation and risk protection. And our analytics solutions enable clients to view performance across all of their gaming properties.

About the Role

As Pavilion Pay’s inaugural Azure Site Reliability Engineer (SRE), you will play a foundational role in building a resilient infrastructure and ensuring high availability across our systems. This position, part of IT Operations, will work closely with Network, Cloud Infrastructure, DevOps, and Cloud Architects to implement best practices in system reliability, observability, and automated response. This role emphasizes reliability, platform management, and network security.

Key Responsibilities:

Reliability and Incident Management

  • Establish and track reliability metrics such as Latency, Traffic, Errors, and Capacity, focusing on uptime across applications and products, with plans to expand monitoring to kiosk and edge networks.
  • Develop and refine monitoring systems using Grafana to ensure comprehensive visibility, focusing on continuous improvements in reliability.
  • Establish robust processes for incident response and root cause analysis, leveraging OpsGenie to ensure timely and structured responses.
  • Work with TailScale, SUSE, F5 and Palo Alto Firewall to support secure, resilient network connectivity and load balancing.

Platform Management and Service Objectives

  • Collaborate with IT leadership to define and maintain service level objectives (SLOs) and monitor performance against these standards.
  • Structure and optimize platform management with a focus on supporting uptime in our production environment.

Automation, IaC, and CI/CD Pipelines

  • Develop and maintain Terraform configurations for scalable, repeatable infrastructure deployment, focusing on minimizing manual tasks and ensuring resource consistency.
  • Work with DevOps to optimize CI/CD workflows using Azure DevOps, focusing on pipeline automation and deployment efficiency.
  • Automate repetitive tasks and enhance deployment processes within AKS and Azure environments, aiming to reduce potential deployment bottlenecks.

Network and Security Collaboration

  • Partner with network engineers to optimize and maintain F5 load balancers and Palo Alto Networks/Panorama for secure, resilient network operations.
  • Collaborate with security teams to ensure network traffic and access patterns align with security best practices, integrating observability into network operations.

 

Requirements (Must have)

  • Technical Skills : Proficiency with SUSE, AKS, Linux, Azure Cloud, Grafana, Rancher, Terraform, Azure DevOps pipelines.
  • Monitoring Tools: Strong experience with Grafana for observability and OpsGenie for incident response, with a focus on maintaining uptime and proactive alerts.
  • Automation and Scripting: Proficiency in scripting (e.g., Bash, Python) and experience with TailScale for secure networking solutions.
  • Problem-Solving Mindset: Experience in identifying and remediating performance and security issues, focusing on proactive, long-term solutions.

First 90 Days:

  1. Understand the Product: Deepen familiarity with Pavilion Pay's products and their interdependencies.
  1. Microsoft Azure (Cloud & Networking) – Acts as a backup for the Cloud Engineer, managing Azure resources and infrastructure monitoring.
  1. Automation Deployment: Design an automation validation process with testing Terraform & Infrastructure as Code (IaC) for task’s such as back up’s, patching, uptime.
  2. Platform Familiarization: Gain familiarity with platform elements, especially Terraform, CI/CD, and SLO definitions, to support a highly reliable production environment.

Your skills and our values

  • Working with Others - Works with other department employees to accomplish individual andcompany service levels and goals. Display enthusiasm and promote a friendly group working environment.
  • Embracing Change - Remains open-minded and reacts quickly to new information.
  • Adapts to varying department changes
  • Attention to Detail – Stays alert in a high-paced environment. Follows detailed procedures andensures accuracy in documentation and data.
  • Decision-Making and Problem-Solving – Recognizes possible system issues that may affect payment entry processing and alerts Supervisor.


*****Applicants must be US Citizen and authorized to work in the USA, now and in the future.****



Pavilion Payments provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, sex (including pregnancy), national origin, ancestry, age, marital status, sexual orientation, gender identity or expression, disability, veteran status, genetic information or any other basis protected by law. Those applicants requiring reasonable accommodation to the application and/or interview process should notify a representative of the Human Resources Department

Top Skills

Aks
Azure Cloud
Bash
Linux
Python
Suse
The Company
HQ: Las Vegas, Nevada
104 Employees
On-site Workplace
Year Founded: 1995

What We Do

Pavilion Payments enables the world’s gaming entertainment leaders to create amazing consumer experiences and maximize spend across all of their physical and digital properties. Our complete suite of payment solutions enables safe, secure and trusted cash access at the cage, on the casino floor, or online. Our compliance and security solutions offer additional layers of automation and risk protection. And our analytics solutions enable clients to view performance across all of their gaming properties.

Visit our website at pavilionpayments.com to learn more about our solutions for casino debit and credit card cash advance, e-check, ATM, full-service TITO and payment kiosks, Anti-Money Laundering (AML) compliance assistance, layered security, and analytics

Similar Jobs

CrowdStrike Logo CrowdStrike

Sr. Database Engineer (Remote)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote
Hybrid
11 Locations
10000 Employees
135K-215K Annually
Remote
Hybrid
10 Locations
2674 Employees

PwC Logo PwC

GCP Data Engineer - Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote
Hybrid
67 Locations
364000 Employees
100K-232K Annually

PwC Logo PwC

GCP Data Engineer - Senior Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote
Hybrid
67 Locations
364000 Employees
130K-256K Annually

Similar Companies Hiring

News 12 Thumbnail
News + Entertainment • Digital Media • Consumer Web
Bethpage, NY
400 Employees
bet365 Thumbnail
Software • Gaming • Esports • Digital Media • Automation
Denver, Colorado
9000 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account