Senior Observability Engineer

Posted 2 Days Ago
Be an Early Applicant
Hiring Remotely in United States
Remote
Mid level
Cloud • Information Technology
The Role
The Senior Observability Engineer will implement and maintain observability solutions, ensuring scalability and reliability while collaborating with cross-functional teams. Responsibilities include developing monitoring systems, creating dashboards, identifying performance issues, and documenting systems. The role aims to enhance customer experiences by effectively managing application performance and system reliability.
Summary Generated by Built In

We are looking for talented, creative, and proactive individuals who are passionate about solving complex business problems and contributing to the next generation of modern applications. Our goal is to help our customers understand the connections between application performance, user experience, and business outcomes, thereby creating exceptional customer experiences. Join us in shaping the future of Observability Engineering within our Intelligent Operations team with innovative data and integration solutions tools.

Responsibilities

  • Implement and maintain cutting-edge Observability solutions utilizing tools like New Relic, Datadog, AppDynamics, or Dynatrace for our large-scale enterprise customers.
  • Develop and maintain systems for effective monitoring, logging, and tracing, ensuring scalability and reliability.
  • Collaborate with cross-functional teams, including software engineers, product managers, and data scientists, to build resilient systems.
  • Integrate observability practices into different engineering workflows and lead the adoption, optimization, and integration of products within the customer’s business infrastructure.
  • Create custom dashboards, set up alerts, and develop AIOps rules, ensuring effective tracking against goals/KPIs.
  • Provide technical support in post-sales processes, including installation, deployment, training, technical check-ups, and escalation management.
  • Identify performance bottlenecks and anomalous system behavior and resolve root causes of service issues.
  • Stay updated with the latest trends in observability, logging, monitoring, and cloud technologies and introduce innovative solutions and best practices.
  • Participate in strategic technology planning, focusing on scalability, cost-effectiveness, and risk management in observability infrastructure.
  • Document observability systems and processes comprehensively and prepare reports for management on system performance and reliability.
  • Utilize Infrastructure as Code (IaC) principles for efficient infrastructure provisioning and management.

Qualifications

  • Minimum 3-5 years of hands-on experience with Application Performance Management tools such as Datadog, New Relic, AppDynamics, Dynatrace, Splunk ITSI, Honeycomb, Chronosphere, Riverbed Aternity/Alluvio, ExtraHop, & Logic Monitor.
  • Hands-on experience with cloud-native, open-source solutions like Prometheus, Grafana, ELK stack/Elastic.io, OpenTelemetry (OTEL),
  • Experience with public cloud solutions like CloudWatch, App Insights, etc.
  • Strong understanding of network & system management solutions, distributed systems, networking, and database technologies.
  • Operational background and familiarity with ITIL ITSM, SRE, or DevOps best practices and principles.
  • Excellent problem-solving skills, organizational, project management, and communication skills.
  • Eagerness to collaborate, contribute to team success, and a continuous learning mindset.

Top Skills

Appdynamics
Datadog
Dynatrace
Elk Stack
Grafana
New Relic
Opentelemetry
Prometheus
The Company
HQ: Chicago, IL
1,154 Employees
On-site Workplace
Year Founded: 2007

What We Do

AHEAD builds platforms for digital business. By weaving together cloud infrastructure, intelligent operations, and modern applications, we help enterprises deliver on the promise of digital transformation.

Similar Jobs

NBCUniversal Logo NBCUniversal

Senior Full Stack JavaScript Developer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
Seattle, WA, USA
68000 Employees
155K-175K Annually

NBCUniversal Logo NBCUniversal

ServiceNow Staff Software Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
New York, NY, USA
68000 Employees
130K-190K Annually

NBCUniversal Logo NBCUniversal

Sr Staff Software Engineer, Cloud Control Plane

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
New York, NY, USA
68000 Employees
145K-210K Annually

NBCUniversal Logo NBCUniversal

Data Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
New York, NY, USA
68000 Employees
100K-135K Annually

Similar Companies Hiring

MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
SG
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account