Site Reliability Engineer

Posted 15 Days Ago
Miami, FL
100K-130K Annually
Entry level
Computer Vision • Information Technology • Software
The Role
As a Site Reliability Engineer at RT², you will enhance the reliability and performance of production systems while monitoring incidents and improving deployment processes. You'll work with infrastructure tools and both on-premises and cloud systems, focusing on automating and optimizing workflows and system performance.
Summary Generated by Built In

Get to Know Us Better

RT² offers the most flexible cutting-edge Retail Management Solutions that encompass sales,
inventory management, frontline employee management and engagement, payments, business
intelligence, and digital automation tools for the wireless industry. We support Fortune 500
companies, unify their customer experience, and remove pain points across multiple retail touch
points. RT² prides itself on fostering a team-oriented culture and a dynamic work environment,
where team members are set up to make meaningful contributions across the organization.
Site Reliability Engineer - Our team here at RT² is looking for a Site Reliability Engineer to join our team. This SRE will be a key player in maintaining and improving the reliability of our systems. We're seeking SREs with experience in infrastructure tools like Terraform, Bicep, and Ansible, spanning both On-Premise and Cloud environments, including Azure. Your role will involve enhancing system stability, optimizing performance, and automating deployment processes. If you're a proactive problem solver with a passion for infrastructure and continuous improvement, this SRE position offers an exciting opportunity to make a meaningful impact.
Responsibilities:

  • Help maintain and enhance production monitoring and notifications.
  • Improve reliability and quality of production systems.
  • Measure and help optimize system performance.
  • Work with delivery. and other teams to identify points of potential failure and then work to help enhance and improve systems to mitigate.
  • Participate in capacity planning.
  • Create automation to improve deployment speed, testing, and responding to operational issues.
  • Work to meet service level objectives.
  • Help build runbooks, tools, and other supporting tools to improve incident response.
  • Monitor production systems and help manage incident response.
  • Participate in post mortems, document outages, steps to recovery, future mitigation strategies.
  • Work on both on-premises (data center) and cloud-based infrastructure (Azure).
  • Experience working with server operating systems like Windows, Unix, Linux
  • Experience working with monitoring via tools such as ELK stack, Grafana, Azure Application Insights
  • Experience with Git or other distributed source control systems.

Qualifications:

  • Bachelor’s degree (or equivalent) in computer science or related discipline
  • Experience with tools TerraForm, Bicep, Ansible.
  • Experience with both On-Premise and Cloud Providers preferably Azure
  • Experience with Hyper-V and VMWare
  • Experience with CI/CD Pipelines like Azure Pipelines, GitHub Actions, and OctoDeploy
  • Experience with scripting languages like PowerShell, Python and Bash
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
  • Experience with observability tools like Grafana, UptimeRobot, ELK, PagerDuty
  • Experience working with Agile methodologies

Salary Range: $100,000 - $130,000/Yr. 
Our pay structure takes into account various geographical markets within the United States. The base salary for this role reflects the typical expected earnings. However, the final compensation package is determined by several factors, such as your location, job-specific expertise, skills, experience, and other relevant job-related considerations.
What We Offer:

  • A unique opportunity to shape the journey of RT²
  • Working within a rapidly growing, game-changing business
  • Remote, flexible working options
  • Competitive compensation
  • Generous STI and LTI provisions
  • Health, Dental and Vision Insurance
  • Paid Annual Leave
  • Paid Sick Leave
  • 401K, and more


 

Top Skills

Ansible
Azure
Bash
Bicep
Elk
Grafana
Linux
Powershell
Python
Terraform
Unix
Windows
The Company
Royal Oak, Michigan
8 Employees
On-site Workplace

What We Do

Real Time Technologies Inc is a computer software company based out of 1517 N Main St, Royal Oak, Michigan, United States

Similar Jobs

Citadel Securities Logo Citadel Securities

Site Reliability Engineer

Information Technology • Software • Financial Services
Miami, FL, USA
1900 Employees
125K-350K Annually

Striveworks Logo Striveworks

Site Reliability Engineer

Artificial Intelligence • Big Data • Computer Vision • Machine Learning • Analytics • Defense
Easy Apply
Tampa, FL, USA
73 Employees
Remote
Orlando, FL, USA
326 Employees

Striveworks Logo Striveworks

Senior Site Reliability Engineer

Artificial Intelligence • Big Data • Computer Vision • Machine Learning • Analytics • Defense
Easy Apply
Tampa, FL, USA
73 Employees

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account