Site Reliability Engineer

Posted 22 Days Ago
Be an Early Applicant
Hiring Remotely in Almaty
Remote
Senior level
Design
The Role
The Site Reliability Engineer at Planner 5D will maintain and enhance IT infrastructure, ensuring reliability through monitoring and alerting practices. This role involves using various technologies for container orchestration, database management, and cloud services, alongside infrastructure as code practices like CI/CD. The engineer will also address production issues and oversee the launch of multiple services across global data centers.
Summary Generated by Built In

About us:


Planner 5D is a global hub for home design, uniting over 90+ million users in 230 territories around our planet (yes, we have 2 users connecting from Antarctica each month!). Our platform simplifies the home renovation process from concept to execution, and this process has never been more straightforward with our cutting-edge software. You don't need to be a professional designer to enhance your home or workspace. We are dedicated to streamlining the design experience, making the customization of dream homes accessible to everyone.


Our cross-platform presence spans Web, iOS, Android, Windows, and MacOS, ensuring seamless synchronization for our users. Planner 5D is more than just software; it's a vibrant community of enthusiastic and product-oriented professionals. From diverse backgrounds, our team collaborates across time zones, fueled by a shared passion to revolutionize the home design market.


We are constantly looking for like-minded talent eager to take ownership and drive innovation within our expanding suite of tools. Join us in shaping the future of home design.


We are searching for SRE specialist who will join our team and work on improving and maintaining our IT infrastructure. Who will work closely with our tech stack: Docker Swarm, Kubernetes, Gitlab, nginx, PHP, MySQL, MariaDB, TiDB, Hazelcast, MongoDB, Redis, RabbitMQ, Ansible, Python, Bash


Responsibilities:

  • Maintaining and improving infrastructure as code
  • Ensuring and improving reliability of infrastructure
  • Monitoring and alerting and uptime during both active working hours and on-call periods
  • CI/CD in some cases
  • Ensure system reliability and availability during on-call periods

  • Launch, maintenance and monitoring of:

  • Front, load balancing and mirrors (nginx, haproxy, ...)
  • Application servers (PHP)
  • Container orchestration (Docker Swarm and Kubernetes)
  • MariaDB Galera cluster
  • File storage clusters (GlusterFS, Ceph, ...)
  • MongoDB cluster
  • Worker servers (rendering, AI, ...) - using Kubernetes or Ansible + Docker
  • Monitoring and alerting (Grafana, Prometheus, Loki, ...)
  • Cloud services (BigQuery, SES, ...)
  • CDN and DNS
  • Caching and queue servers (Redis, Hazelcast, RabbitMQ, ...)
  • Multiple datacenters around the world
  • Other internal services / servers (VPN, analytics, mailing, ...)
  • Infrastructure as code (Ansible, Docker, Docker Swarm, Kubernetes, Gitlab CI/CD, ...)
  • Virtual and dedicated servers

Required knowledge of:

  • Linux (Ubuntu, Debian, Centos)
  • Python and Bash
  • Docker (build system, Docker Swarm)
  • Kubernetes
  • On-premise - how servers and their clusters work, how to set them up (database, cache, front, ...)
  • Databases (SQL - MySQL/MariaDB flavor, NoSQL - MongoDB, Redis, ...)
  • Networks, network protocols, settings of network elements
  • Infrastructure and configuration as code (Gitlab CI/CD, Ansible, ...)
  • Monitoring / logging - Grafana, Prometheus, Loki, ...
  • Best practices in security, performance, memory management, maintainability, ...

Welcome knowledge:

  • Database administration — SQL (MySQL, PostgreSQL), NoSQL DB (MongoDB, Redis)
  • Load balancing and server clusters and how to setup / manage them (Haproxy, SQL-Proxy, Envoy, Galera cluster, LVS)
  • Message broker (RabbitMQ, Kafka, ...)
  • Backup and restore (disaster plan and practice)
  • VPN - Wireguard, OpenVPN

Nice to have

  • Experience with one of the cloud systems (AWS, Google, Azure, AliCloud)
  • Some knowledge of C/C++, Java, PHP, Kotlin, Swift
  • Understanding of Russian or Lithuanian languages

What we offer:

- Great work environment that balances freedom and responsibility

- Our team is made up of friendly and passionate enthusiasts who are eager to help and share their knowledge

- We believe in work-life balance, so you'll receive around 30+ days of paid holidays per year

- Company computer equipment by request

- Education courses for our team members, and we're willing to allocate funds toward your training expenses

- English classes on our corporate platform

- Offline company events in Lithuania and online team-building

- Unlimited free Planner5D subscriptions


Note: We are only considering candidates with a valid residence permit. Unfortunately, we do not provide visa sponsorship or relocation support.

Top Skills

Bash
Python
The Company
Vilnius, Vilnius
81 Employees
On-site Workplace
Year Founded: 2011

What We Do

Make your home a better place! Planner 5D is a popular home improvement and interior design platform used by 77+ million users worldwide...

Similar Jobs

Provectus Logo Provectus

NOC Engineer

Artificial Intelligence • Information Technology • Consulting
Remote
Almaty, KAZ
572 Employees

Remofirst Logo Remofirst

Senior Data Engineer

Information Technology
Remote
Almaty, KAZ
79 Employees

Remofirst Logo Remofirst

Automation QA Engineer

Information Technology
Remote
Almaty, KAZ
79 Employees

Similar Companies Hiring

InspiringApps Thumbnail
Software • Mobile • Internet of Things • Enterprise Web • Design • Artificial Intelligence • App development
Boulder, CO
24 Employees
Altium Thumbnail
Software • Productivity • Other • Enterprise Web • Design • Cloud • Analytics
San Diego, CA
900 Employees
Asimov Thumbnail
Machine Learning • Design • Biotech • Artificial Intelligence
Boston, MA
95 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account