Sr HA-Systems Engineer

Posted 8 Days Ago
Be an Early Applicant
2 Locations
Remote
61K-109K Annually
Senior level
Software
The Role
Design and optimize high-availability systems for M&A SaaS technology, focusing on resilience and scalability with mentorship responsibilities.
Summary Generated by Built In

Datasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest your talents in us, and we’ll return the compliment.

Job Description:

If you're passionate about building resilient, high-performance systems and want to make an impact in a growing, innovative company, this might be an opportunity for you.

We are seeking an experienced High Availability Systems Engineer to join our engineering team. This role will be responsible for designing, implementing, and optimizing systems to ensure high availability, reliability, and scalability across our platforms. The ideal candidate will have a deep understanding of distributed systems, cloud infrastructure, and best practices for achieving 99.99% uptime.

We are open to hiring remote in the US or Canada. **Please note, we are unable to sponsor or take over sponsorship of an employment Visa at this time.**

Key Responsibilities:

Design and Architecture:

  • Architect and build highly available, fault-tolerant systems to support mission-critical applications.

  • Collaborate with cross-functional teams to design scalable, robust, and secure cloud-based solutions.

  • Develop strategies for disaster recovery, data replication, and failover processes.

System Performance and Optimization:

  • Analyze system performance, identify bottlenecks, and implement optimizations to ensure optimal uptime and performance.

  • Conduct load testing, capacity planning, and performance tuning to meet high availability requirements.

  • Utilize monitoring tools to proactively detect issues and minimize downtime.

Automation and Infrastructure as Code:

  • Develop and maintain infrastructure as code (IaC) using tools like Terraform and Ansible.

  • Implement automation for deployments, scaling, and configuration management to reduce manual intervention and increase system reliability.

Incident Management and Troubleshooting:

  • Lead incident response and root cause analysis for system outages, ensuring quick resolution and prevention of future incidents.

  • Build and maintain robust monitoring, alerting, and diagnostic systems for proactive issue identification.

Mentorship and Leadership

  • Provide technical leadership, mentorship, and guidance to junior engineers and other team members.

  • Stay updated on the latest trends in high availability and distributed systems, and share knowledge within the team.

Required Qualifications:

  • Bachelor's in Computer Science, Engineering, or a related field (or equivalent experience).

  • 8+ years of experience in systems engineering, infrastructure architecture, or related fields.

  • Proven track record of designing and implementing highly available, fault-tolerant systems in cloud or on-prem environments.

  • Experience with distributed systems, microservices architecture, and high availability patterns (e.g., active-active, active-passive).

  • Proficient in cloud platforms (Azure, GCP, AWS) or on-prem data centers and cloud-native technologies.

  • Deep knowledge and understanding of Linux systems

  • Experience using monitoring and observability tools (Prometheus, Grafana, Loki, etc.).

  • Strong coding/scripting skills in Python, Go, or Shell for automation.

  • Excellent problem-solving skills with a focus on resilience and scalability.

  • Strong communication skills with the ability to convey complex technical concepts to diverse stakeholders.

  • Ability to work independently and take ownership of projects from inception to deployment.

Preferred Qualifications:

  • Strong experience with containers and orchestration (Docker, Kubernetes).

  • Familiarity with CI/CD pipelines and DevOps practices.

  • Advanced knowledge of networking, load balancers, and distributed data storage solutions (e.g., Cassandra, Elasticsearch, Kafka).

  • Experience with multi-region deployments and global scaling strategies.

  • Certification in cloud platforms (e.g., AWS Certified Solutions Architect, Google Professional Cloud Architect).

  • Background in security best practices, including compliance frameworks (e.g., SOC 2, ISO 27001).

  • Experience in agile methodologies and DevOps culture.

The base salary range represents the estimated low and high end for this position at the time of this posting. Consistent with applicable law, each candidate’s compensation offer may vary and will be determined based on but not limited to, your geographic region, skills, qualifications, and experience along with the requirements of the position.  Datasite reserves the right to modify this pay range at any time.

$61,100.00 - $108,700.00

As a global organization, Datasite knows that diverse perspectives are essential to our success. We’re committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.

Top Skills

Ansible
AWS
Azure
Cassandra
Docker
Elasticsearch
GCP
Go
Grafana
Kafka
Kubernetes
Linux
Loki
Prometheus
Python
Shell
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Minneapolis, CA
821 Employees
On-site Workplace
Year Founded: 1968

What We Do

Datasite the maker of Datasite Diligence virtual data room platform, helping dealmakers around the world close more deals faster.

Similar Jobs

Zone & Co Logo Zone & Co

Software Engineer

Fintech • Professional Services • Software • Consulting
Remote
United States

Atlassian Logo Atlassian

Principle Machine Learning Engineer -- Teamwork Graph

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
Mountain View, CA, USA
11000 Employees
190K-306K Annually

Sleuth Logo Sleuth

Backend Engineer - AI & Data (Engineer #4)

Artificial Intelligence • Software • Biotech • Pharmaceutical
Remote
Hybrid
Los Angeles, CA, USA
10 Employees
130K-160K Annually

CDW Logo CDW

Managing Consulting Engineer - IBM Mainframe

Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
Remote
US
15100 Employees
105K-165K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account