Senior Site Reliability Engineer

Posted 4 Days Ago
Hiring Remotely in Poland, IN
Remote
Senior level
Cloud • Healthtech • Internet of Things • Machine Learning • Software
The Role
As a Senior Site Reliability Engineer at Kontakt.io, you will ensure the scalability, availability, and security of our AI-driven healthcare platform by designing and maintaining cloud infrastructure, implementing monitoring tools, automating processes, and collaborating with various teams to enhance operational efficiency and patient care.
Summary Generated by Built In

Kontakt.io is building the platform that care operations run on.


We reduce waste, cut costs, and improve revenue by improving throughput, asset utilization and staff productivity. Our platform uses AI, RTLS, and EHR data to enable self-learning agents to automate workflows, adapt in real-time, and orchestrate all of care delivery operations.


Easy to deploy and scale, it gives a clear picture of spaces, equipment, and people, eliminating inefficiencies and enhancing the patient experience. With measurable 10X ROI and over 20+ use cases, Kontakt.io is the go-to platform for better and faster care delivery operations.


As a Site Reliability Engineer (SRE) at Kontakt.io, you will be responsible for ensuring the scalability, availability, and security of our cloud-based AI-driven healthcare platform. You will collaborate with software, data, and infrastructure teams to build highly resilient and automated systems, allowing hospitals and care facilities to operate seamlessly and without downtime.

Your expertise in cloud infrastructure, automation, monitoring, and performance optimization will directly impact how healthcare organizations leverage real-time data to enhance patient care and operational efficiency.


If you are passionate about highly available systems, automation, and making an impact in healthcare, join Kontakt.io and help us build the future of smart care operations!

Key Responsibilities:

  • Design and maintain highly available, fault-tolerant, and scalable cloud infrastructure.
  • Implement SLOs, SLIs, and SLAs to track system reliability and optimize uptime.
  • Participate in 24/7 on-call rotation
  • Oversee production platform deployments
  • Monitor latency, traffic, errors, and system health using modern observability tools.
  • Conduct root cause analysis (RCA) and post-mortems to continuously improve system resilience.
  • Automate infrastructure provisioning using Terraform, Ansible, or Pulumi.
  • Implement CI/CD pipelines to ensure seamless and safe deployments.
  • Enable self-healing mechanisms using Kubernetes operators, auto-scaling, and fault detection.
  • Ensure compliance with HIPAA, GDPR, and other healthcare data regulations.
  • Define and execute disaster recovery (DR) and business continuity plans.
  • Manage and optimize AWS environments for cost-efficiency and performance.
  • Deploy and manage observability tools and build real-time alerting and response frameworks
  • Establish best practices for logging, debugging, and performance monitoring.
  • Improve incident response automation through runbooks, AI-based anomaly detection, and predictive analytics.

What You Bring

  • 3+ years of experience as an SRE
  • Strong expertise in Kubernetes, Docker, and container orchestration.
  • Experience managing cloud-native environments (AWS).
  • Experience with event-driven architectures, Kafka, or real-time data streaming.
  • Knowledge of machine learning infrastructure.
  • Previous experience in healthcare, compliance (HIPAA), and highly regulated environments.
  • Proficiency in Infrastructure as Code (IaC) using Terraform.
  • Deep knowledge of networking, DNS, load balancing, and security best practices.
  • Experience with CI/CD pipelines (Jenkins, CI, or ArgoCD).
  • Hands-on experience with monitoring and logging tools (Prometheus, Grafana, ELK, OpenTelemetry).
  • Strong programming skills in Python, Golang, or Bash for automation.
  • Knowledge of machine learning infrastructure.

We offer:

  • Work on a mission-driven platform that improves healthcare operations and patient outcomes.
  • B2B contract or an employment agreement
  • Competitive salary and stock option plan
  • Collaborate with top engineers, data scientists, and AI experts.
  • Flexible remote or hybrid work options (office in Krakow)
  • Collaborative and self-organized environment
  • private medical care, cafeteria system

We Make Things Easy

Easy to Use. Simplicity is harder than complexity. Each of our apps focuses on a single user and a specific problem. We create solutions for everyone to help them get things done.
Easy to Buy. We simplify pricing with a single, per-bed or per-room model that encompasses all the necessary products and services to achieve your desired outcomes.
Easy to Deploy. Using AI, cloud, and mobile technologies, our equipment autonomously communicates and validates itself without the need for human intervention, cutting deployment time from months to weeks or even days.


We Deliver Fast Outcomes.

Industry’s #1 Time To Value. We accelerate your ROI and deliver positive outcomes to users faster than anyone else, thanks to how easy things work with our AI- and cloud-based platform.
Delivered As A Service. Delivering everything from devices to apps to support, our as-a-service model allows you to add new use cases with a simple click. Gain agility and speed like never before.
Outcome Driven. We deliver outcomes, not boxed equipment. From on-site installation to monitoring, all the way to service-level agreements, our approach is uniquely designed to ensure the outcomes you need.


We Ensure Unmatched Scalability

Priced for Scaling. We offer scalable pricing, regardless of your project size. Enabling our customers to create value cost-effectively is a key element of our success.
A Platform for Scaling. Lower TCO, quicker adoption of new use cases, extensive cloud scalability, and future-proofing your IT investments are among the many reasons why Kontakt.io is right for you.
Managed for Scaling. SOC-2 and HIPAA compliant, our platform integrates with your wireless and security infrastructure, allowing you to use your current IT network with confidence and uninterrupted functional

Top Skills

Bash
Go
Python
The Company
HQ: New York, NY
100 Employees
Hybrid Workplace
Year Founded: 2013

What We Do

As the leader in Indoor Journey Analytics, Kontakt.io optimizes processes and resources by revealing how customers move through your business. Using AI, IoT, and RTLS, Kontakt.io helps businesses uncover waste, streamline capacity, improve workflows, and help customers and staff feel seen and valued.

Gallery

Gallery

Similar Jobs

Drata Logo Drata

Senior Site Reliability Engineer II (Remote)

Security • Software • Cybersecurity • Automation
Easy Apply
Remote
United States
500 Employees

Movable Ink Logo Movable Ink

Senior Site Reliability Engineer

Artificial Intelligence • Marketing Tech • Software
Easy Apply
Remote
United States
590 Employees

Cisco Meraki Logo Cisco Meraki

Senior Site Reliability Engineer, Engineering Enablement - REMOTE

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Easy Apply
Remote
United States
3000 Employees
126K-185K Annually

GitLab Logo GitLab

Senior Site Reliability Engineer, Database Operations:Clickhouse

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
United States
2350 Employees
118K-252K Annually

Similar Companies Hiring

Air Space Intelligence Thumbnail
Transportation • Software • Machine Learning • Logistics • Defense • Artificial Intelligence • Aerospace
Boston , Massachusetts
109 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account