Site Reliability Engineer
We are seeking a talented and experienced Senior Site Reliability Engineer (SRE) specializing in Operating Systems (OS), Applications, Databases, and Middleware to join Maersk. This role requires deep expertise in implementing SRE practices and driving engagements, with a strong focus on observability, automation, performance through open-source monitoring tools and problem solving techniques. The ideal candidate will have substantial experience with Azure Cloud, DC components like compute, storage, network, Java/JVM, Dockers Kubernetes, alongside good coding skills. Knowledge on Automation & AIOps will be an added advantage.
Job Purpose/summary
-
Lead the establishment and implementation of SRE practices across multiple platforms within Maersk to ensure the reliability, availability, and performance of critical systems and services.
-
Define and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs).
-
Oversee the administration, maintenance, and optimization of operating systems, applications, databases, and middleware.
-
Collaborate closely with development and operations teams to integrate and optimize applications for performance and reliability.
-
Manage and optimize Azure Cloud infrastructure for scalability, performance, and cost-efficiency.
-
Deploy, manage, and monitor Kubernetes clusters to ensure high availability and resilience.
-
Implement and manage observability solutions using open-source tools such as Prometheus and Grafana.
-
Develop and maintain robust monitoring, logging, and alerting systems to proactively identify and resolve issues.
-
Utilize Application Performance Management (APM) tools and practices to continuously monitor and enhance application performance.
-
Drive AIOps initiatives to streamline operations and enhance efficiency through automation and machine learning.
-
Develop automation scripts and tools to reduce manual intervention and ensure consistent deployment and management practices.
-
Lead cross-functional engagements with development, operations, and business teams to foster a culture of reliability and continuous improvement.
-
Provide technical leadership, guidance, and mentorship to team members and stakeholders.
What we are looking for:
-
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
-
Proven experience as a Site Reliability Engineer or in a similar role.
-
Deep knowledge of operating systems (Linux/Windows), application management, databases (SQL/NoSQL), and middleware.
-
Strong expertise in Azure Cloud services and Kubernetes orchestration.
-
Hands-on experience with observability tools like Prometheus, Grafana, and APM solutions.
-
Proficiency in coding/scripting languages such as Python, Go, Shell, etc.
-
Familiarity with AIOps practices and tools.
-
Excellent problem-solving skills and a proactive approach to addressing issues.
-
Strong communication and collaboration skills.
Preferred skills:
-
Certification in Azure or Kubernetes.
-
Experience with other cloud platforms (AWS, Google Cloud).
-
Familiarity with CI/CD pipelines and DevOps practices.
-
Experience with Infrastructure as Code (IaC) tools like Terraform or Ansible.
-
Knowledge of security best practices and compliance standards.
We Offer
As an organization with global presence, joining Maersk is a wonderful and exciting opportunity for you to work with people of diverse talents & background. We offer a fast paced, challenging and truly international atmosphere with activities spread around the globe in Copenhagen, India, London, Hague and Charlotte. The environment is dynamic with focus on high performance, results, and respect for our employees. There will be the possibility of continuous professional and personal development and for gaining a professional and social network.
As a company, we are committed to growing our people. We will provide you with opportunities that broaden your knowledge and strengthen your professional & technical skills.
-
We operate in a fast-paced environment utilizing modern technologies and bias toward action
-
We value customer outcomes and are passionate about using technology to solve problems
-
We are a diverse team with colleagues from different backgrounds and cultures
-
We offer the freedom, and responsibility, to shape the setup and the processes we use in our community
-
We support continuous learning, including through conferences, workshops and meetups
Ideally, you will have (*)
-
Natural bar-raiser: curious and passionate, with a desire to continuously learn more
-
Bias to action, being familiar with methods and approaches needed to get things done in a collaborative, lean and fast-moving environment
-
Respond effectively to complex and ambiguous problems and situations
-
Simplify, clearly and succinctly convey complex information and ideas to individuals at all levels of the organization
-
Motivated by goal achievement and continuous improvement, with the enthusiasm and drive to motivate your team and the wider organization
Maersk is committed to a diverse and inclusive workplace, and we embrace different styles of thinking. Maersk is an equal opportunities employer and welcomes applicants without regard to race, colour, gender, sex, age, religion, creed, national origin, ancestry, citizenship, marital status, sexual orientation, physical or mental disability, medical condition, pregnancy or parental leave, veteran status, gender identity, genetic information, or any other characteristic protected by applicable law. We will consider qualified applicants with criminal histories in a manner consistent with all legal requirements.
We are happy to support your need for any adjustments during the application and hiring process. If you need special assistance or an accommodation to use our website, apply for a position, or to perform a job, please contact us by emailing [email protected].
Top Skills
What We Do
A.P. Moller - Maersk is an integrated transport and logistics company; going all the way, together, for our customers and society. ALL THE WAY is our commitment to connect the world so that everyone has both the possibility and the ability to trade, grow and thrive.
The company employs roughly 110.000 employees across operations in 130 countries.