Engineer II - Hashicorp Cloud DR (Hybrid)

Posted 2 Days Ago
Be an Early Applicant
Bengaluru, Karnataka
Mid level
Cloud • Information Technology • Security • Software
The Role
As a Site Reliability Engineer II, you will enhance the scalability and reliability of HashiCorp's cloud products by implementing system reliability best practices, conducting load testing, and collaborating with engineering teams to ensure high performance and availability. You will also provide troubleshooting expertise and develop automated tools for incident management and testing.
Summary Generated by Built In

About the team

As an Engineer for the Hashicorp Cloud DR team, you will have a critical role in enhancing the reliability DR governance of HashiCorp's cloud products. You will play a pivotal role in enhancing our operational resilience and maintaining the reliability of our enterprise and cloud-based products. With a focus on DR and overall Quality you will be at the forefront of ensuring high availability and compliance of DR standards across HashiCorp’s offerings.

With at least 3 years of experience in Software Development in DR domain or reliability engineering or systems engineering or system testing or related fields, you will lead efforts to identify, address, and mitigate operational challenges before they impact our customers. Your expertise in system resilience, and understanding disaster recovery strategies will ensure that our services meet the highest standards of reliability and help achieve operational excellence across our cloud products.

You will provide expert execution of the DR test plans, You will be working on a wide variety of tools and exploring new avenues to ensure all the products meet the essential Operational readiness criteria.


What you’ll do

  • Implement best practices for system reliability and disaster recovery, including proactive identification of potential failure points and the development of automated mitigations.
  • Design and execute comprehensive DR testing strategies to identify bottlenecks and failure points that affect RPO and RTO across our cloud products.
  • Drive initiatives around DR compliance and implement best practices and technologies to improve system resilience, ensuring high availability and fault tolerance through the Chaos testing framework.
  • Work closely with engineering and product teams to integrate operational readiness into the development lifecycle, enhancing product stability and user satisfaction.
  • Build and refine tools and frameworks for automated testing, environment simulation, and incident reproduction, reducing manual effort and increasing test coverage.
  • Conduct mock drills and drive chaos tests in collaboration with partner teams, analyzing test results, documenting findings and making actionable recommendations for systemic improvements
  • Share your knowledge and expertise with team members, fostering a culture of learning and continuous improvement.


What you’ll need

  • 3+ years of experience in software development, reliability engineering, systems engineering, or non functional testing roles with a focus on Disaster recovery or backup and recovery of Cloud based systems.
  • Having commitment to explore career opportunity in Reliability Engineering field
  • Proficient in Golang programming language or any other scripting language 
  • Hands on experience with version control systems such as Git , Gitlab
  • Understands micro services architecture
  • Good understanding of CI/CD process and maintaining quality pipelines
  • Exposure to cloud technologies ( AWS, Azure, Or GCP) and container technologies like Nomad or Kubernetes.
  • Effective communication and collaboration skills, capable of working with cross-functional teams and articulating technical concepts to diverse audiences.


Nice to Have:

  • Exposure to disaster recovery domain or worked on any product testing for DR is a plus 
  • Experience with infrastructure as code (Terraform, CloudFormation) is a plus.
  • Chaos testing experience is a plus
  • Understanding of compliance frameworks like ISO/IEC 27031, ISO 22301, SOC2 is a Plus  #LI-Hybrid


Top Skills

Go
Java
Python
The Company
HQ: San Francisco, CA
1,200 Employees
Hybrid Workplace
Year Founded: 2012

What We Do

HashiCorp was founded by Mitchell Hashimoto and Armon Dadgar in 2012 with the goal of revolutionizing datacenter management: application development, delivery, and maintenance. The datacenter of today is very different than the datacenter of yesterday, and we think the datacenter of tomorrow is just around the corner.

Similar Jobs

Capital One Logo Capital One

Principal Associate- Machine Learning Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Bengaluru, Karnataka, IND
55000 Employees
Hybrid
Bengaluru, Karnataka, IND
289097 Employees

BlackLine Logo BlackLine

Senior Software Engineer - SAP ABAP

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Hybrid
Bengaluru, Karnataka, IND
1810 Employees

Magna International Logo Magna International

Developer, SAP BI/BW/SLT+HANA

Automotive • Hardware • Robotics • Software • Transportation • Manufacturing
Hybrid
Bangalore, Bengaluru, Karnataka, IND
171000 Employees

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account