Cloud Engineering at TRACTIAN
In a data-driven company like TRACTIAN, the Cloud Engineering team is essential for maintaining robust, secure, and scalable cloud infrastructures. This team implements automation, security practices, and rigorous protocols to safeguard our digital assets and data infrastructure across diverse cloud environments. The Cloud Engineering team plays a crucial role in our internal operations and client solutions by ensuring continuous integration, secure deployments, and advanced observability.
What will you do
As a Cloud Engineering Lead, you will be responsible for leading a technical team, safeguarding the company’s cloud infrastructure primarily on AWS and OCI, with occasional projects involving GCP and Azure. Your role involves implementing state-of-the-art infrastructure solutions, embedding robust security measures, and ensuring efficient deployment processes. This position requires deep technical expertise and a hands-on approach to infrastructure automation, security integration, and observability.
Responsibilities:
- Lead and mentor a hands-on, technical cloud engineering team.
- Architect, implement, and secure scalable cloud infrastructure on AWS, OCI, and occasionally GCP/Azure.
- Deploy and maintain infrastructure-as-code using Terraform and Helm.
- Oversee CI/CD pipelines, enhancing them through GitHub Actions and GitHub Enterprise.
- Maintain and optimize Kubernetes clusters and AWS ECS environments, including GPU infrastructure management.
- Embed comprehensive security measures, integrating advanced security tools and practices proactively.
- Configure and manage Cloudflare solutions for enhanced security and performance.
- Implement observability and monitoring solutions with Datadog, Grafana, OpenTelemetry, and Sentry.
- Utilize Jira effectively for project management and issue tracking.
- Collaborate closely with other engineering teams to drive secure and efficient development practices.
- Address vulnerabilities, security incidents, and tickets promptly and proactively.
Requirements:
- 5+ years of hands-on experience in Cloud Engineering, DevSecOps, or similar roles.
- Extensive knowledge of AWS and OCI; familiarity with GCP/Azure preferred.
- Expert in Terraform, Helm, Kubernetes (including GPU-based workloads), Docker, and AWS ECS.
- Strong experience with GitHub Actions, GitHub Enterprise, and Cloudflare.
- Proficiency in monitoring tools including Datadog, Grafana, OpenTelemetry, and Sentry.
- Experience managing Jira workflows effectively.
- Solid understanding of security best practices, compliance frameworks including SOC2 and ISO27001.
- Strong scripting skills (Python, Bash, or PowerShell) for automation.
- Excellent leadership skills and a commitment to staying technically hands-on.
Preferred Qualifications:
- Certifications in AWS, OCI, Kubernetes (CKA, CKAD), or relevant cloud engineering certifications.
- Prior experience in high-growth tech environments.
Why Join Us:
- Opportunity to lead and directly influence infrastructure and security strategy.
- Innovative and challenging technical environment.
- Continuous learning and career growth opportunities.
Compensation
Competitive Salary
Premium Medical, Dental, and Vision Coverage
Paid Time Off (PTO): 15 Days
401(k) Retirement Plan
Language Learning Opportunities - Take advantage of optional, fully funded Portuguese or Spanish courses to enhance your skills and global reach.
Birthday Time Off - Celebrate your birthday with a paid day off during your birthday week.
Gympass Membership - Access a wide range of gyms and training programs.
Sports Incentive - Receive a monthly bonus when you regularly participate in physical activities.
Long-Term Benefit - After four years of service, earn a fully funded trip anywhere in the world.
Top Skills
What We Do
Tractian is a machine intelligence company that offers industrial monitoring systems. Tractian builds streamlined hardware-software solutions to give maintenance technicians and industrial decision-makers comprehensive oversight of their operations. It is democratizing access to sophisticated real-time monitoring and asset operations tools.
Tractian's solutions are used in environments that address a combined total of 5% of global industrial output. The company’s broad market reach is evidenced in its customer base from various industries, such as John Deere, Procter & Gamble, Caterpillar, Goodyear, Carrier, Johnson Controls, and Bimbo, the owner of the brands Little Bites and Thomas Bagels. Tractian's customers see a 6-12x ROI with savings of $6,000 per monitored machine annually on average.
In a major milestone and a first for the industry, Tractian launched the AI-Assisted Maintenance category in the industrial sector. In this new paradigm, artificial intelligence identifies machine problems and suggests preventive actions to be taken, giving invaluable insight and support to maintenance professionals. It is important to highlight that the intent of Assisted Maintenance is firmly rooted in augmenting maintenance professionals to provide more assertive diagnosis with human-in-the-loop feedback.
Tractian's mission is to elevate this category of workers in a highly impactful way. The Assisted Maintenance category will provide unimaginable support for maintenance professionals. By combining shop floor expertise with our technology, maintainers will be able to anticipate and address issues with unprecedented accuracy and speed