DataSite

Sr HA-Systems Engineer

Posted 8 Days Ago

Be an Early Applicant

2 Locations

Remote

61K-109K Annually

Senior level

Software

The Role

Design and optimize high-availability systems for M&A SaaS technology, focusing on resilience and scalability with mentorship responsibilities.

Summary Generated by Built In

Datasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest your talents in us, and we’ll return the compliment.

Job Description:

If you're passionate about building resilient, high-performance systems and want to make an impact in a growing, innovative company, this might be an opportunity for you.

We are seeking an experienced High Availability Systems Engineer to join our engineering team. This role will be responsible for designing, implementing, and optimizing systems to ensure high availability, reliability, and scalability across our platforms. The ideal candidate will have a deep understanding of distributed systems, cloud infrastructure, and best practices for achieving 99.99% uptime.

We are open to hiring remote in the US or Canada. **Please note, we are unable to sponsor or take over sponsorship of an employment Visa at this time.**

Key Responsibilities:

Design and Architecture:

Architect and build highly available, fault-tolerant systems to support mission-critical applications.
Collaborate with cross-functional teams to design scalable, robust, and secure cloud-based solutions.
Develop strategies for disaster recovery, data replication, and failover processes.

System Performance and Optimization:

Analyze system performance, identify bottlenecks, and implement optimizations to ensure optimal uptime and performance.
Conduct load testing, capacity planning, and performance tuning to meet high availability requirements.
Utilize monitoring tools to proactively detect issues and minimize downtime.

Automation and Infrastructure as Code:

Develop and maintain infrastructure as code (IaC) using tools like Terraform and Ansible.
Implement automation for deployments, scaling, and configuration management to reduce manual intervention and increase system reliability.

Incident Management and Troubleshooting:

Lead incident response and root cause analysis for system outages, ensuring quick resolution and prevention of future incidents.
Build and maintain robust monitoring, alerting, and diagnostic systems for proactive issue identification.

Mentorship and Leadership

Provide technical leadership, mentorship, and guidance to junior engineers and other team members.
Stay updated on the latest trends in high availability and distributed systems, and share knowledge within the team.

Required Qualifications:

Bachelor's in Computer Science, Engineering, or a related field (or equivalent experience).
8+ years of experience in systems engineering, infrastructure architecture, or related fields.
Proven track record of designing and implementing highly available, fault-tolerant systems in cloud or on-prem environments.
Experience with distributed systems, microservices architecture, and high availability patterns (e.g., active-active, active-passive).
Proficient in cloud platforms (Azure, GCP, AWS) or on-prem data centers and cloud-native technologies.
Deep knowledge and understanding of Linux systems
Experience using monitoring and observability tools (Prometheus, Grafana, Loki, etc.).
Strong coding/scripting skills in Python, Go, or Shell for automation.
Excellent problem-solving skills with a focus on resilience and scalability.
Strong communication skills with the ability to convey complex technical concepts to diverse stakeholders.
Ability to work independently and take ownership of projects from inception to deployment.

Preferred Qualifications:

Strong experience with containers and orchestration (Docker, Kubernetes).
Familiarity with CI/CD pipelines and DevOps practices.
Advanced knowledge of networking, load balancers, and distributed data storage solutions (e.g., Cassandra, Elasticsearch, Kafka).
Experience with multi-region deployments and global scaling strategies.
Certification in cloud platforms (e.g., AWS Certified Solutions Architect, Google Professional Cloud Architect).
Background in security best practices, including compliance frameworks (e.g., SOC 2, ISO 27001).
Experience in agile methodologies and DevOps culture.

The base salary range represents the estimated low and high end for this position at the time of this posting. Consistent with applicable law, each candidate’s compensation offer may vary and will be determined based on but not limited to, your geographic region, skills, qualifications, and experience along with the requirements of the position. Datasite reserves the right to modify this pay range at any time.

$61,100.00 - $108,700.00

As a global organization, Datasite knows that diverse perspectives are essential to our success. We’re committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.