Description
TripleTen for Business empowers companies to achieve their business goals by bridging talent gaps in Data Science, AI for professionals, Python Development, and Management.
Our transformative approach includes tailored training programs, informed by comprehensive pre-training assessments, ensuring precise alignment with client needs. With expert-led content and personalized mentoring, we help employees excel and achieve new levels of proficiency.
We are looking for a Senior Site Reliability Engineer. In this role, you will take ownership of ensuring service high availability*, documenting infrastructure details, and empowering developers through training and guidance on working with it*
What you will do
- Develop infrastructure, and write solutions to simplify operations.
- Build processes to achieve and maintain 99.99% uptime, and improve the exercise process.
- Develop automation and service reliability, plan resources, and reduce ops in development.
- Build infrastructure and monitoring, help developers solve infrastructure problems, train developers to solve problems independently, and improve the observability of infrastructure, monitoring, schedules, and alerts.
Requirements
- 2+ years of Site Reliability Experience.
- Experience working with Prometheus - must have.
- Experience working with Kubernetes, GitLab CI, and Ansible.
- Experience working with Unix systems (we have Ubuntu) and the console.
- Understanding the basics of TCP/IP to build networks, how web services work, REST API, and gRPC.
- Experience performing diagnostics, including interpreting the output of Ps, Top, Strace, Perf, and TCPDump.
- Understanding of how user applications interact with the operating system, including familiarity with system calls, processes, and threads.
- Willingness to build high-load systems and understanding of how to do that.
- Understanding of fault tolerance and service scaling.
- High degree of emotional intelligence, ability to find common ground with colleagues and work as part of a team.
- Must be professionally fluent in English
Nice to have:
- Experience working with AWS and Terraform.
- Experience programming in Python / Golang or desire to learn how.
What we can offer you
- Full-time remote collaboration with a convenient schedule. Professional freedom, where we trust your experience instead of wasting each other's time and effort micromanaging;
- A diverse and tight-knit team. Our teammates are spread out across Serbia, the US, Israel, Georgia, Armenia, Latin America, and more. They’ve worked at all of big techs, ed-techs, design agencies, and cultural institutions;
- Comfortable digital workspace. We use Miro, Notion, Google Workspace, Jira, etc.— to make working together process seamless.
Top Skills
What We Do
TripleTen is the premier online coding bootcamp, renowned for its exceptional completion rate and graduate employment success.
We offer comprehensive bootcamps in Software Engineering, Quality Assurance, Business Intelligence Analytics, Cyber Security, and Data Science. The cost of these bootcamps ranges from $4,900 to $9,700. Each program provides full access to an interactive online platform, engaging real-life projects, expert tutor support from experienced developers, thorough code reviews, informative online webinars, and dynamic coding sessions. Whether you require assistance with a task or simply need some motivation, our dedicated team is always available to lend a helping hand.
We have empowered 5,000+ students to graduate from our program with impressive portfolios, featuring 6 to 15 projects that showcase their skills to employers. With an exceptional 87% employment rate, our Career Acceleration program gives graduates a competitive edge for success.