Senior Site Reliability Engineer

Posted 3 Days Ago
2 Locations
Remote
Hybrid
Mid level
Cloud • Healthtech • Internet of Things • Machine Learning • Software
The Role
The Senior Site Reliability Engineer will maintain scalable and secure cloud infrastructure, implement monitoring and automation, and ensure system reliability while collaborating with various teams to enhance healthcare operations.
Summary Generated by Built In

Kontakt.io is building the platform that care operations run on.


We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.


Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20+ use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.


As a Site Reliability Engineer (SRE) at Kontakt.io, you will be responsible for ensuring the scalability, availability, and security of our cloud-based AI-driven healthcare platform. You will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime.

Your expertise in cloud infrastructure, automation, monitoring, and performance optimization will directly impact how healthcare organizations leverage real-time data to enhance patient care and operational efficiency.


If you are passionate about highly available systems, automation, and making an impact in healthcare, join Kontakt.io and help us build the future of smart care operations!

Key Responsibilities:

  • Design and maintain highly available, fault-tolerant, and scalable cloud infrastructure.
  • Implement SLOs, SLIs, and SLAs to track system reliability and optimize uptime.
  • Participate in 24/7 on-call rotation
  • Oversee production platform deployments
  • Monitor latency, traffic, errors, and system health using modern observability tools.
  • Conduct root cause analysis (RCA) and post-mortems to continuously improve system resilience.
  • Automate infrastructure provisioning using Terraform, Ansible, or Pulumi.
  • Implement CI/CD pipelines to ensure seamless and safe deployments.
  • Enable self-healing mechanisms using Kubernetes operators, auto-scaling, and fault detection.
  • Ensure compliance with HIPAA, GDPR, and other healthcare data regulations.
  • Define and execute disaster recovery (DR) and business continuity plans.
  • Manage and optimize AWS environments for cost-efficiency and performance.
  • Deploy and manage observability tools and build real-time alerting and response frameworks
  • Establish best practices for logging, debugging, and performance monitoring.
  • Improve incident response automation through runbooks, AI-based anomaly detection, and predictive analytics.

What You Bring

  • 3+ years of experience as an SRE
  • Strong expertise in Kubernetes, Docker, and container orchestration.
  • Experience managing cloud-native environments (AWS).
  • Experience with event-driven architectures, Kafka, or real-time data streaming.
  • Knowledge of machine learning infrastructure.
  • Previous experience in healthcare, compliance (HIPAA), and highly regulated environments.
  • Proficiency in Infrastructure as Code (IaC) using Terraform.
  • Deep knowledge of networking, DNS, load balancing, and security best practices.
  • Experience with CI/CD pipelines (Jenkins, CI, or ArgoCD).
  • Hands-on experience with monitoring and logging tools (Prometheus, Grafana, ELK, OpenTelemetry).
  • Strong programming skills in Python, Golang, or Bash for automation.
  • Knowledge of machine learning infrastructure.

We offer:

  • Work on a mission-driven platform that improves healthcare operations and patient outcomes.
  • B2B contract or an employment agreement
  • Competitive salary and stock option plan
  • Collaborate with top engineers, data scientists, and AI experts.
  • Flexible remote or hybrid work options (office in Krakow)
  • Collaborative and self-organized environment
  • private medical care, cafeteria system

Ready to Build the Future of Healthcare?

Apply now and help scale the platform that care operations run on. 🚀

Top Skills

AI
Ansible
Automation
AWS
Bash
Ci/Cd
Cloud Infrastructure
Docker
Ehr
Elk
Event-Driven Architectures
Go
Grafana
Kafka
Kubernetes
Monitoring
Opentelemetry
Performance Optimization
Prometheus
Python
Rtls
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
100 Employees
Hybrid Workplace
Year Founded: 2013

What We Do

As the leader in Indoor Journey Analytics, Kontakt.io optimizes processes and resources by revealing how customers move through your business. Using AI, IoT, and RTLS, Kontakt.io helps businesses uncover waste, streamline capacity, improve workflows, and help customers and staff feel seen and valued.

Gallery

Gallery

Similar Jobs

CrowdStrike Logo CrowdStrike

Sr. Site Reliability Engineer - GovCloud (Remote)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote
Hybrid
2 Locations
10000 Employees
95K-160K Annually

DFIN Logo DFIN

Senior Site Reliability Engineer - Cloud (Remote)

Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
Remote
Hybrid
United States
2600 Employees

GitLab Logo GitLab

Senior Site Reliability Engineer, Database Operations:Clickhouse

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
United States
2350 Employees
Remote
U.S.
195 Employees
140K-160K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account