Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Hiring Remotely in India
Remote
Mid level
Artificial Intelligence • Food • Software
The Role
The Site Reliability Engineer ensures system reliability and availability through monitoring and automation while collaborating with teams for capacity planning and feature design.
Summary Generated by Built In

Description

● Ensure the reliability and availability of production systems and services by monitoring, troubleshooting, and responding to incidents.

● Develop and maintain tools and automation for system monitoring, alerting, and incident response to minimize manual intervention.

● Collaborate with development teams to plan for capacity scaling and performance improvements based on usage patterns and growth forecasts.

● Collaborate with development and product teams to ensure that new features and services are designed with reliability in mind.

● Maintain documentation for operational processes, system configurations, and best practices.

Requirements

● Bachelor's degree in computer science, information technology, or a related field (or equivalent work experience).

● Proven experience in software development and/or system administration.

● Strong scripting and coding skills (e.g., Python, Go, Shell) for automation and tool development.

● Familiarity with containerization and orchestration technologies like Docker and Kubernetes.

● Experience with cloud platforms (e.g., AWS, Azure, GCP) and infrastructure as code tools (e.g., Terraform).

● Proficiency in monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).

● Knowledge of network, security, and database concepts.

● Strong problem-solving skills and the ability to work well under pressure.

● Understanding of agile and DevOps methodologies.

● Excellent communication and collaboration skills.

● Availability to work during US hours till 3 pm ET is essential for this role.

● Candidates must have their own system/work setup for remote work.

Top Skills

AWS
Azure
Docker
Elk Stack
GCP
Go
Grafana
Kubernetes
Prometheus
Python
Shell
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chicago, IL
299 Employees
On-site Workplace
Year Founded: 2016

What We Do

Checkmate empowers enterprise restaurant brands with powerful ordering solutions and hands-on support. Our scalable technology enables restaurants to drive sales across channels, including custom websites, apps, kiosks, catering, third-party marketplaces, voice AI, and more. With seamless integrations, smarter analytics, and 24/7 service, Checkmate helps brands conquer their digital goals. Restaurants can launch unique ordering experiences, centrally manage menus, recapture revenue, leverage customer data, and continually adapt with new integrations. Regardless of how you want to grow, Checkmate has the tools and guidance to power, manage, and evolve your digital business.

Similar Jobs

Remote
20 Locations
19002 Employees
74K-133K Annually

HighLevel Logo HighLevel

Site Reliability Engineer

Information Technology • Internet of Things • Marketing Tech
Remote
Delhi, Connaught Place, New Delhi, Delhi, IND
974 Employees
Remote
India
7509 Employees

Zoom Logo Zoom

Site Reliability Engineer

Artificial Intelligence • Information Technology • Software
Remote
IND
11053 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account