Observability/Monitoring/Tools Engineer

Posted 8 Days Ago
Be an Early Applicant
Īnd, Chamba, Himāchal Pradesh
5-7 Years Experience
Big Data • Cloud • Information Technology
The Role
Seeking a proactive Engineer specializing in Observability, Monitoring, and Automation to enhance network and application performance. Responsible for configuring alerts, dashboards, automation, and data trend analysis. Experience in platforms like Datadog and SolarWinds is preferred. Ideal for individuals passionate about scalability, observability, and collaboration in a fast-paced environment.
Summary Generated by Built In

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

We are actively seeking a proactive and skilled Engineer specializing in
Observability, Monitoring, and Automation to join our dynamic team. In this
critical role, you will be responsible for implementing, managing, and
enhancing observability platforms to ensure optimal network and
application performance. Your responsibilities will include configuring
alerts and dashboards, end-to-end automation, and conducting data trend
analysis.
This position is ideal for individuals with a strong background in
observability platform engineering, network and application performance
monitoring, and automation, who are eager to leverage their technical
expertise in a collaborative and fast-paced environment. Experience with
observability platforms and tools like Datadog and SolarWinds is a plus. If
you are passionate about building scalable systems, enhancing
observability, and working with cross-functional teams, and are committed
to delivering high-quality solutions, this could be the perfect opportunity for
you.

Core experience/responsibilities
● Monitoring Platform Engineering
○ 5+ years of experience with platforms such as SolarWinds,
Datadog, HP Openview, BMC, etc.
○ 5+ years of experience in network, application performance, and
synthetic monitoring.
○ Expertise in configuring alerts, creating dashboards, and
conducting data trend analysis.
○ Experience in automating the detection of missing assets and
configuring them into the monitoring ecosystem via REST
API/scripting.
○ Proficiency in monitoring various end devices including routers,
switches, firewalls, storage, virtual, Windows servers, Linux
servers, and UNIX servers.
○ 5+ years of experience automating infrastructure operations

using tools like Ansible and Python for event correlation.
○ Expertise in integrating monitoring data with other platforms such
as CMDB/ServiceNow.
○ Experience configuring monitors using SNMP, SSH, WinRM, WMI,
JMX, etc.
○ Ability to design and implement highly available continuous
monitoring platforms for 24x7 operations.
● Technical Solutions and Collaboration
○ Recommend baseline monitoring thresholds, KPIs, and SLAs.
○ Provide solutions to complex problems and drive process
improvements.
○ Experience with both on-premise and cloud environments.
○ Expertise in advanced troubleshooting and root cause analysis.
○ Proficiency with platforms like ServiceNow, Remedy, or Assyst.
○ Identify automation opportunities and implement proactive
monitoring solutions.
○ Work effectively with Enterprise Architects, OS engineers, and
operations support teams to provide training, develop guidelines,
and serve as a subject matter expert.
● Design and Implementation
○ Participate in technical design discussions, considering trade-offs
to support business value, scalability, and delivery timelines.
○ Ensure adherence to architectural governance and security
standards.
○ Contribute to the design and architecture of high-performance,
scalable systems, ensuring they meet business requirements and
are cost-effective.
○ Integrate security best practices into the design and
implementation of systems, ensuring robust protection against
threats.
● Process/Operational Experience
○ Plan and execute system and software installations, upgrades,
and changes across the organization.
○ Understand various methodologies such as Agile, Scrum, and
manage project objectives, delivery approaches, and plans.
○ Identify and mitigate risks throughout projects and tasks,
addressing major design flaws.
○ Experience gathering and organizing large amounts of data for
instrumentation into an enterprise monitoring solution.
○ Share knowledge of monitoring best practices with system
owners and administrators to enhance overall monitoring and
alerting posture.
Operational requirements
● Available for on-call support outside of normal business hours to
address critical issues.
● Strong communication skills to relate technical details to non-
technical leaders and users.
● Promote a positive working environment, encourage teamwork, and
mentor rising talent.
● Excellent time management and organizational skills, with experience
establishing guidelines for others.
● Ability to notice differences and issues as they arise and escalate

them to management.
● Facilitate discussions and explore alternative approaches to resolve
conflicts.
● Take personal accountability for decision-making and collaborating
with cross-functional teams.

Nice to Have

● Working expertise in infrastructure/application log aggregation
ingested into a security
● Experience with log aggregation tools such as ELK, Logstash, Kibana,
Splunk, or QRadar.
● Proficiency in Ansible and Python, with the ability to create complex
SQL queries for reporting and correlation.

Education

● Bachelor's degree in Computer Science, Information Technology, or a
related field is required.

Category: Information Technology

Top Skills

Python
The Company
HQ: Boston, MA
32,000 Employees
Hybrid Workplace
Year Founded: 1951

What We Do

Iron Mountain Incorporated (NYSE: IRM) is the global leader for storage and information management services. Trusted by more than 220,000 organizations around the world, Iron Mountain boasts a real estate network of more than 80 million square feet across more than 1,350 facilities in 45 countries dedicated to protecting and preserving what matters most for its customers. Iron Mountain’s solutions portfolio includes records management, data management, document management, data centers, art storage and logistics, and secure shredding help organizations to lower storage costs, comply with regulations, recover from disaster, and better use their information. Founded in 1951, Iron Mountain stores and protects billions of information assets, including critical business documents, electronic information, medical data and cultural and historical artifacts.

Gallery

Gallery

Jobs at Similar Companies

MassMutual India Logo MassMutual India

Associate

Big Data • Fintech • Information Technology • Insurance • Financial Services
Hyderabad, Telangana, IND

Silverfort Logo Silverfort

Sales Operations Analyst

Information Technology • Sales • Security • Cybersecurity • Automation
Remote
United States
357 Employees

Jobba Trade Technologies, Inc. Logo Jobba Trade Technologies, Inc.

Customer Success Specialist

Cloud • Information Technology • Productivity • Professional Services • Software
Hybrid
Chicago, IL, USA
45 Employees

Similar Companies Hiring

MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
SG
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account