Sr HA-Systems Engineer

Posted 22 Days Ago
Be an Early Applicant
2 Locations
Remote
61K-109K Annually
Senior level
Software
The Role
The Sr HA-Systems Engineer will design and implement high-availability, fault-tolerant systems and optimize their performance. Responsibilities include leading incident management, mentoring engineers, and ensuring systems meet uptime and reliability standards through robust strategies and automation tools.
Summary Generated by Built In

Datasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest your talents in us, and we’ll return the compliment.

Job Description:

If you're passionate about building resilient, high-performance systems and want to make an impact in a growing, innovative company, this might be an opportunity for you.

We are seeking an experienced High Availability Systems Engineer to join our engineering team. This role will be responsible for designing, implementing, and optimizing systems to ensure high availability, reliability, and scalability across our platforms. The ideal candidate will have a deep understanding of distributed systems, cloud infrastructure, and best practices for achieving 99.99% uptime.

We are open to hiring remote in the US or Canada. **Please note, we are unable to sponsor or take over sponsorship of an employment Visa at this time.**

Key Responsibilities:

Design and Architecture:

  • Architect and build highly available, fault-tolerant systems to support mission-critical applications.

  • Collaborate with cross-functional teams to design scalable, robust, and secure cloud-based solutions.

  • Develop strategies for disaster recovery, data replication, and failover processes.

System Performance and Optimization:

  • Analyze system performance, identify bottlenecks, and implement optimizations to ensure optimal uptime and performance.

  • Conduct load testing, capacity planning, and performance tuning to meet high availability requirements.

  • Utilize monitoring tools to proactively detect issues and minimize downtime.

Automation and Infrastructure as Code:

  • Develop and maintain infrastructure as code (IaC) using tools like Terraform and Ansible.

  • Implement automation for deployments, scaling, and configuration management to reduce manual intervention and increase system reliability.

Incident Management and Troubleshooting:

  • Lead incident response and root cause analysis for system outages, ensuring quick resolution and prevention of future incidents.

  • Build and maintain robust monitoring, alerting, and diagnostic systems for proactive issue identification.

Mentorship and Leadership

  • Provide technical leadership, mentorship, and guidance to junior engineers and other team members.

  • Stay updated on the latest trends in high availability and distributed systems, and share knowledge within the team.

Required Qualifications:

  • Bachelor's in Computer Science, Engineering, or a related field (or equivalent experience).

  • 8+ years of experience in systems engineering, infrastructure architecture, or related fields.

  • Proven track record of designing and implementing highly available, fault-tolerant systems in cloud or on-prem environments.

  • Experience with distributed systems, microservices architecture, and high availability patterns (e.g., active-active, active-passive).

  • Proficient in cloud platforms (Azure, GCP, AWS) or on-prem data centers and cloud-native technologies.

  • Deep knowledge and understanding of Linux systems

  • Experience using monitoring and observability tools (Prometheus, Grafana, Loki, etc.).

  • Strong coding/scripting skills in Python, Go, or Shell for automation.

  • Excellent problem-solving skills with a focus on resilience and scalability.

  • Strong communication skills with the ability to convey complex technical concepts to diverse stakeholders.

  • Ability to work independently and take ownership of projects from inception to deployment.

Preferred Qualifications:

  • Strong experience with containers and orchestration (Docker, Kubernetes).

  • Familiarity with CI/CD pipelines and DevOps practices.

  • Advanced knowledge of networking, load balancers, and distributed data storage solutions (e.g., Cassandra, Elasticsearch, Kafka).

  • Experience with multi-region deployments and global scaling strategies.

  • Certification in cloud platforms (e.g., AWS Certified Solutions Architect, Google Professional Cloud Architect).

  • Background in security best practices, including compliance frameworks (e.g., SOC 2, ISO 27001).

  • Experience in agile methodologies and DevOps culture.

The base salary range represents the estimated low and high end for this position at the time of this posting. Consistent with applicable law, each candidate’s compensation offer may vary and will be determined based on but not limited to, your geographic region, skills, qualifications, and experience along with the requirements of the position.  Datasite reserves the right to modify this pay range at any time.

$61,100.00 - $108,700.00

As a global organization, Datasite knows that diverse perspectives are essential to our success. We’re committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.

Top Skills

Agile
Ansible
Automation
AWS
Azure
Cassandra
Ci/Cd
Cloud Infrastructure
Coding
Disaster Recovery
Distributed Data Storage
Distributed Systems
Docker
Elasticsearch
Fault-Tolerant Systems
GCP
Go
Grafana
High Availability
Infrastructure As Code
Kafka
Kubernetes
Linux
Load Balancers
Loki
Monitoring Tools
Networking
Prometheus
Python
Shell
Terraform
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Minneapolis, CA
821 Employees
On-site Workplace
Year Founded: 1968

What We Do

Datasite the maker of Datasite Diligence virtual data room platform, helping dealmakers around the world close more deals faster.

Similar Jobs

PwC Logo PwC

Deals Tech & Data Solutions Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote
7 Locations
370000 Employees
75K-118K Annually

Samsara Logo Samsara

Senior Software Engineer, Smart Trailer & Connected Equipment

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote
Hybrid
United States
2800 Employees
126K-191K Annually

Samsara Logo Samsara

Software Engineer, Smart Trailer & Connected Equipment

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote
Hybrid
United States
2800 Employees
106K-160K Annually

Zone & Co Logo Zone & Co

Manager, Development

Fintech • Professional Services • Software • Consulting
Remote
United States

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account