Company Description
We have an excellent opportunity for a bright, smart, and highly motivated Senior/Principal Site Reliability Engineer to join our mature project team.
You have a unique chance to become part of our team and work with best practices and methodologies. This role empowers you to take the lead and excel to your fullest potential.
CUSTOMER
Our customer is Beeswax (https://www.beeswax.com/about/), a rapidly growing US AdTech company. Founded by three former Google specialists, it has a highly technical team and an excellent technological culture.
Beeswax provides extremely high-scale Bidder-as-a-Service solutions in advertising technology, works with global businesses, and has raised $28M (including the most recent Series B raise of $15M) to date.
Sigma Software works with Beeswax to provide numerous key components of the platform. It is looking for engineers to complement the Beeswax engineering team and drive further platform development.
PROJECT
We're seeking a skilled Senior Site Reliability Engineer responsible for the Cloud Infrastructure and Observability solutions for the Client`s platform and ensuring all systems run smoothly. If you're passionate about complex tasks, optimizing systems, driving innovation, providing the highest quality, and collaborating with top talent, this is the perfect opportunity.
The project is an easy-to-use, massive-scale, and highly available demand-side platform. Backed by Amazon Web Services and Kubernetes, the team has embraced Infrastructure as code to manage thousands of applications, servers, and containers running in multiple regions worldwide.
Bring your expertise to our dynamic and forward-thinking environment!
Job Description
- Design and build infrastructure and tooling to provide high scalability, reliability, and sub-second performance levels using security industry best practices
- Write code and scripts to support Infrastructure as code (IaC), configuration management, and automated incident resolution
- Support and extend the observability stack to capture and alert on any system issues
- Participate in on-call rotations and be an escalation contact for service incidents
- Write systems documentation, troubleshoot playbooks and other instruction manuals
- Other duties and responsibilities as assigned
Qualifications
- Bachelor’s or higher degree in computer science, computer engineering, relevant technical field, or equivalent practical experience
- Expertise with architecture solutions and system design
- Experience in analyzing and troubleshooting large-scale distributed systems
- At least 6 years of administration experience with Linux, AWS, and Kubernetes
- At least 6 years of experience in configuration management using Cloud Formation, Terraform, and Ansible or similar
- At least 3 years of experience with Python
- Strong problem-solving skills
- Strong verbal/written communication skills
- At least an Upper-Intermediate level of English
Top Skills
What We Do
Sigma Software Group, an award-winning and trusted IT partner, has been serving customers for over 21 years, providing comprehensive IT solutions to various businesses, ranging from startups to established software product houses. As one of Europe's substantial IT consultancies, it brings together a dedicated workforce of over 2,100 professionals in 40 offices across 19 countries. With a diverse client base, including more than 300 enterprises, including Fortune 500 stalwarts, Sigma Software Group is a preferred choice for developing solutions that help businesses create cutting-edge products while meeting their unique needs.
Sigma Software Group operates as a dynamic ecosystem of tech companies, offering 25 ready-to-implement innovative products and 40+ value-added services. Furthermore, Sigma Software Group is committed to fostering innovation through initiatives such as the Sigma Software Labs business incubator, Sigma Software University, the SID Venture Partners VC Fund, UA Tech Network, Techosystem, the European Business Association, and other collaborative efforts.
Since 2015, Sigma Software Group has consistently earned recognition on the IAOP's prestigious World's Top 100 Outsourcing list. The company's accomplishments have also been acknowledged by prominent global media outlets such as Forbes, CNBC, The Times, and Reuters