Job Title: Site Reliability Engineer II
Job Location: Chennai, India (Hybrid)
Are you looking for an exciting role that is a part of an award-winning organization and one of the fastest-growing technology companies? If so, look no further, Trimble's technology teams need your experience to build and manage mission-critical systems operating on the cloud. This role is to help build out new cloud-based environments. This role will help by providing a strong understanding of cloud-based services (e.g. Amazon AWS) through automation tooling, production, QA and development environment builds.
Technical/Operational/Functional Expertise:
- Work with team to plan, design and deploy new cloud technologies
- Create, Maintain , and Enhance Automated Product Deployments
- Develop, Modify, Support and maintain AWS based components through Infrastructure as Code and automation
- Design and implement cost control strategies.
- Enhance availability and incident management by implementing self healing of solutions based on alerts
- Continuously improve the monitoring and alerting capabilities, enabling us to be proactive instead of reactive
- Support day to day operations , measuring , monitoring and troubleshooting
- Participate in on-call rotation with mindset of automating and improving
- Design and maintain Custom monitoring dashboards for DEV/OPS/Support
- Create and maintain Cloud Operations processes and procedures
- Enhance our fault tolerance and high availability strategy
- Enhance cloud elasticity through automatic provisioning and destruction of services based on demand.
- Collaborate with our product development teams to engineer creative solutions or solve complex challenges.
- Responsible for creating processes and training engineers on common cloud administration tasks
Leadership:
- Strong interpersonal communication skills and the ability to communicate with customers, vendors and partners, and across all levels of the organization
- Explaining issues and presenting a clear cloud strategy across e-Builder
- Leading roadmap discussions with regards to the cloud in conjunction with the development and QA teams
Your goals will include:
- Meeting and achieving goals for Key Performance Indicators, Service Level Agreements and Operating Level Agreements
- Maintaining high levels of system uptime
- Increasing the percentage of monitoring detected service disruptions
- Creating, Defining, Managing, Tracking and Improving processes to ensure effective services are being provide.
What Skills & Experience You Should Bring
- 3 to 5 years of experience working within AWS(Must have)
- Strong scripting experience, preference for Python, Bash, PowerShell(Must have)
- 3 to 5 years of experience with monitoring solutions (DataDog, Nagios, Newrelic)
- 3 to 5 years of experience supporting Microsoft stack of technologies including SQL Server and Windows.
- 3 to 5 years of experience supporting gnu/linux OS
- Proficient with container technologies, like Docker, Kubernetes, ECS, and EKS.
- Must have strong problem-solving and troubleshooting skills (over the majority of ISO/OSI).
- Familiarity with continuous deployment methodology and other common DevOps tools including Git, Jenkins
- Proficient with configuration management and provisioning tools such as Chef, Puppet, Salt, or Ansible, or Terraform
- Proficient knowledge in networking technologies & Cloud-specific Network assets
- Ability and flexibility to be on-call for escalations and support, migration and deployments
- Additional Preferred Qualifications:
- Experience or familiarity with Security Certifications such as PCI, SOC2, ISO 27001, FISMA/FedRAMP and HIPAA a plus
- Any AWS certifications
- Familiarity with ITIL is a plus
- Database experience (SQL, NoSQL) is a plus
About Our Division
Our Department: e-Builder, Owner & Public Sector
Trimble’s Inclusiveness Commitment
We believe in celebrating our differences. That is why our diversity is our strength. To us, that means actively participating in opportunities to be inclusive. Diversity, Equity, and Inclusion have guided our current success while also moving our desire to improve. We actively seek to add members to our community who represent our customers and the places we live and work.
We have programs in place to make sure our people are seen, heard, and welcomed and most importantly that they know they belong, no matter who they are or where they are coming from.
Top Skills
What We Do
Trimble is transforming the way the world works by delivering products and services that connect the physical and digital worlds. Core technologies in positioning, modeling, connectivity and data analytics enable customers to improve productivity, quality, safety and sustainability. From purpose built products to enterprise lifecycle solutions, Trimble software, hardware and services are transforming industries such as agriculture, construction, geospatial and transportation. For more information about Trimble (NASDAQ:TRMB), visit: www.trimble.com.
Trimble products are used in over 141 countries around the world. Employees in more than 30 countries, coupled with a highly capable network of dealers and distribution partners serve and support customers worldwide. As the market leader in most of our businesses, we offer a compelling value proposition to our customers based on productivity, return on investment and environmental stewardship. Come position yourself with an innovative industry leader and position yourself for success.