Observability Architect

Posted 6 Days Ago
Be an Early Applicant
Hiring Remotely in United States
Remote
Expert/Leader
Cloud • Information Technology
The Role
The Observability Architect will design and implement scalable observability solutions, lead optimization of monitoring systems, mentor team members, guide technical direction, and drive integration of observability best practices across engineering teams. Collaboration with cross-functional stakeholders to align observability strategies with business outcomes is also key.
Summary Generated by Built In

We are looking for talented, creative, and proactive individuals who are passionate about solving complex business problems and contributing to the next generation of modern applications. Our goal is to help our customers understand the connections between application performance, user experience, and business outcomes, thereby creating exceptional customer experiences. Join us in shaping the future of Observability Engineering within our Intelligent Operations team with innovative data and integration solutions tools. Internally known as a Principal Technical Consultant.

Responsibilities

  • Architect and implement scalable and resilient observability solutions, leveraging tools like Datadog, New Relic, AppDynamics, Dynatrace, and open-source technologies for large-scale enterprise environments.
  • Lead the design, development, and optimization of monitoring, logging, and tracing systems, ensuring scalability, cost-efficiency, and business alignment.
  • Serve as a technical advisor and thought leader, driving the integration of observability best practices across our client’s engineering and product teams.
  • Collaborate with cross-functional stakeholders, including engineering leads, product managers, and business executives, to design observability strategies aligned with business goals.
  • Mentor and coach team members, building technical expertise within the organization and fostering a culture of innovation and operational excellence.
  • Conduct architectural reviews and evaluations to ensure observability solutions meet business, security, and compliance requirements.
  • Stay ahead of industry trends in AI-powered observability, logging, monitoring, and cloud-native technologies, and guide teams in adopting these innovations.
  • Create and maintain comprehensive documentation for observability systems, frameworks, and workflows, ensuring knowledge sharing across teams.
  • Establish and refine KPIs, develop custom dashboards, implement AIOps rules, and proactively address performance bottlenecks and anomalies.
  • Participate in post-sales activities, including customer onboarding, training, and providing advanced technical escalation support when required.
  • Drive strategic technology planning, ensuring observability infrastructure remains scalable, resilient, and cost-effective as the business grows.

Qualifications

  • 10-15 years of progressive hands-on experience in Observability, Application Performance Management, or related fields, with at least 5 years in senior or lead roles.
  • Extensive experience with Application Performance Management tools, including Datadog, New Relic, AppDynamics, Dynatrace, Splunk ITSI, Honeycomb, Chronosphere, Riverbed Aternity/Alluvio, ExtraHop, and Logic Monitor.
  • Deep expertise in cloud-native, open-source observability solutions, such as Prometheus, Grafana, the ELK stack/Elastic.io, and OpenTelemetry (OTEL).
  • Advanced understanding of public cloud observability tools, including AWS CloudWatch, Azure Application Insights, and Google Cloud Operations Suite (formerly Stackdriver).
  • Strong foundational knowledge of distributed systems, networking, and database technologies, with experience architecting and scaling solutions in enterprise environments.
  • Proven experience leading teams and guiding technical direction, including architecting and building complex solutions from the ground up.
  • Operational background with familiarity in ITIL ITSM, SRE, and DevOps practices, with the ability to implement and evangelize these principles across teams.
  • Demonstrated ability to mentor and develop junior engineers and foster a culture of innovation and collaboration.
  • Exceptional problem-solving, communication, and organizational skills, with experience presenting to executive stakeholders.

Top Skills

Appdynamics
AWS
Azure
Chronosphere
Datadog
Dynatrace
Elk
Extrahop
GCP
Grafana
Honeycomb
New Relic
Prometheus
Splunk
The Company
HQ: Chicago, IL
1,154 Employees
On-site Workplace
Year Founded: 2007

What We Do

AHEAD builds platforms for digital business. By weaving together cloud infrastructure, intelligent operations, and modern applications, we help enterprises deliver on the promise of digital transformation.

Similar Jobs

Remote
United States
1300 Employees

NBCUniversal Logo NBCUniversal

Senior Full Stack JavaScript Developer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
Seattle, WA, USA
68000 Employees
155K-175K Annually

NBCUniversal Logo NBCUniversal

ServiceNow Staff Software Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
New York, NY, USA
68000 Employees
130K-190K Annually

NBCUniversal Logo NBCUniversal

Sr Staff Software Engineer, Cloud Control Plane

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
New York, NY, USA
68000 Employees
145K-210K Annually

Similar Companies Hiring

MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
SG
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account