SRE Engineer

Posted 15 Hours Ago
Be an Early Applicant
New Delhi, Delhi
Mid level
Food • Retail • Agriculture • Manufacturing
At McCain Foods we know the importance that food plays in people's lives.
The Role
The Site Reliability Engineer will enhance system reliability and automation, collaborating with teams to define service objectives, track performance, and improve software deployment. Responsibilities include designing and coding infrastructure software, operational support, incident analysis, and promoting reliability engineering practices throughout the development lifecycle.
Summary Generated by Built In

Position Title: SRE Engineer
Position Type: Regular - Full-Time
Position Location: New Delhi
Requisition ID: 30491
JOB PURPOSE:
Reporting to the Sr Manager, DevSecOps & SRE, the Site Reliability Engineer will be responsible for: Site reliability engineers (SREs) are responsible for improving system reliability and resilience to make it faster and easier to develop and deploy new software capabilities. SREs focus especially on building automation to reduce manual effort and prevent operations incidents.
JOB RESPONSIBILITIES:

  • Work with stakeholders such as product owners and Engineering to define service level objectives (SLOs) for s ystem operations.
  • Track performance against SLOs in partnership with monitoring teams or other stakeholders, and ensure systems continue to meet SLOs over time.
  • Create dashboards and reports to communicate key metrics.
  • Create software to improve performance, scalability, and stability of systems.
  • Collaborate with development teams to promote the concept of reliability engineering during all phases of the software development lifecycle to detect and correct performance issues and meet availability goals.
  • Design, code, test, and deliver infrastructure software to automate manual operational work (i.e., "toil").
  • Participate in operational support and on-call rotation shifts for supported systems and products.
  • Conduct blameless post mortems to troubleshoot priority incidents.
  • Perform analytics on previous incidents to understand root causes and better predict and prevent future issues.
  • Use automation to reduce the probability and/or impact of problem recurrence.
  • Identify, evaluate, and recommend monitoring tools and diagnostic techniques to improve system observability.
  • Participate in system design consulting, platform management, capacity planning and launch reviews.
  • Collaborate and share lessons learned regarding performance and reliability issues with all stakeholders including developers, other SREs, operations teams, and project management teams.
  • Participate in communities of practice to share knowledge and foster continuous improvement.
  • Remain current on site reliability engineering methods and trends such as observability-driven development and chaos engineering.
  • Drive continuous improvement in software quality and infrastructure reliability and resilience .
  • Oversee, design, implement, and manage DevOps capabilities using continuous integration/continuous delivery toolsets and automation.
  • SRE engineer will focus on Application Performance Monitoring (APM) including Design, Solution, POC, profiling and tuning application compute and data nodes and resources. Some key duties of this role are:
  • Assist in defining SRE and Observability architecture, design
  • Analyze, Implement new features of SRE and Observability Platform
  • Full stack monitoring across all layers (Infrastructure/Network/Database/Application/Services/Third Party)
  • Provide technical hands-on leadership in commercial and Open source/commercial monitoring Tool salection Implementation.
  • Implement SRE driven automated Incident Detection -> automated Engagement -> Triage/Mitigate - RCA/Postmortems -> Problem task Remediation.
  • AI Driven Correlation, De-duplication Noise Reduction and Auto Remediation
  • Provide weekly monitoring and alert analysis and continuous improvement
  • Create a model of the run-time environment (discovery)
  • Profile the performance and behavior of user-defined transactions
  • Establish Performance metrics from each of the applications/systems technical components (Webserver, App server, Database, etc.)
  • Application performance management database
  • APM tool Administration and Support
  • Monitoring Tool design and implementation
  • APM Setup/Usage policies and guidelines
  • Capacity Planning and monitoring
  • Monitor selected application performance
  • Report vital statistics of application performance in production
  • Make recommendations for improvements with Service Desk
  • Make recommendations for adjustments to runtime resources to improve overall performance profile


KEY QUALIFICATION & EXPERIENCES:

  • Strong problem solving and analytical skills.
  • Strong interpersonal and written and verbal communication skills.
  • Highly adaptable to changing circumstances. Interest in continuously learning new skills and technologies.
  • Experience with programming and scripting languages (e.g. Java, C#, C++, Python, Bash, PowerShell).
  • Experience with incident and response management.
  • Experience with Agile and DevOps development methodologies.
  • Experience with container technologies and supporting tools (e.g. Docker Swarm, Podman, Kubernetes, Mesos).
  • Experience with working in cloud ecosystems (Microsoft Azure AWS, Google Cloud Platform,).
  • Experience with monitoring and observability tools (e.g. Splunk, Cloudwatch, AppDynamics, NewRelic, ELK, Prometheus, OpenTelemetry).
  • Experience with configuration management systems (e.g. Puppet, Ansible, Chef, Salt, Terraform).
  • Experience working with continuous integration/continuous deployment tools (e.g. Git, Teamcity, Jenkin, Artifactory).
  • Experience in GitOps based automation is Plus
  • Bachelor's degree (or equivalent years of experience).
  • 5+ years of relevant work experience. SRE experience preferred.
  • Background in Manufacturing, Platform/Tech compnies is preferred.
  • Must have Public Cloud provider certifications (Azure, GCP or AWS)
  • Having CNCF certification is plus


OTHER INFORMATION

  • Travel: as required.
  • Job is primarily performed in a Hybrid office environment.


McCain Foods is an equal opportunity employer. We see value in ensuring we have a diverse, antiracist, inclusive, merit-based, and equitable workplace. As a global family-owned company we are proud to reflect the diverse communities around the world in which we live and work. We recognize that diversity drives our creativity, resilience, and success and makes our business stronger.
McCain is an accessible employer. If you require an accommodation throughout the recruitment process (including alternate formats of materials or accessible meeting rooms), please let us know and we will work with you to meet your needs.
Your privacy is important to us. By submitting personal data or information to us, you agree this will be handled in accordance with the Global Employee Privacy Policy
Job Family: Information Technology
Division: Global Digital Technology
Department: I and O Project Delivery
Location(s): IN - India : National Capital Territory : New Delhi
Company: McCain Foods(India) P Ltd

What the Team is Saying

Areej
Tanner
Sandra
Peter
Chuk
The Company
HQ: Florenceville-Bristol, NB
20,000 Employees
Hybrid Workplace
Year Founded: 1957

What We Do

The power it has to uplift and bring people, Guided by our purpose - Celebrating real connections through delicious, planet-friendly food - we believe that working together with our teams, business and community partners will bring sustainable growth and positive change - today, tomorrow and for generations to come.

As a privately owned family company with over 60 years of experience, a presence in over 160 countries and a global team of 22,000 people, our values and culture are at the heart of everything we do. Our product quality, people and customer dedication help us achieve global sales in excess of CDN $10 billion. Through our investment and innovation, we continue to be a global leader in prepared potato products, including our famous French Fries and appetizers.

We are passionate about supporting and developing our people-providing opportunities to grow and learn in their roles, as well as building careers for the long term.

Why Work With Us

We are working to bring digital tools and data into our processes to drive efficiency, automation and data-driven insights. From connecting our business, enabling our supply chain, supporting our customers, to reinventing agriculture. So if you are a tech expert looking to join a company transforming technology, think of McCain.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

McCain Foods Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
Company Office Image
HQFlorenceville
Company Office Image
HQOakbrook Terrace, IL
Company Office Image
HQToronto, ON
McCain Grand Falls
McCain Othello
McCain Rice Lake
McCain Foods India Pvt. Ltd
McCain Mehsana Plant
McCain Appleton
McCain Burley
McCain Carberry
McCain Coaldale
McCain Whittlesey Plant
McCain Easton
McCain Produce
Potato Processing Technology Centre
McCain Grand Island
McCain Hull
McCain Grantham Plant
McCain Potatoes
McCain Plover Potato
McCain Plover Appetizers
McCain Portage la Prairie
McCain Foods GB Limited
McCain Scarborough Plant
McCain Wombourne Plant
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account