Sr. Site Reliability Engineer II

Posted 13 Days Ago
Be an Early Applicant
New York, NY
107K-231K
Senior level
AdTech • Marketing Tech
The Role
The Senior Site Reliability Engineer will lead team development, manage SLIs, SLOs, and SLAs, and enhance infrastructure reliability while mentoring team members.
Summary Generated by Built In

Sr. Site Reliability Engineer

Location: New York, NY - 3 days per week on-site (Required)


Who We Are

DoubleVerify (DV) is a leading independent provider of marketing measurement software, data, and analytics. We authenticate the quality and effectiveness of digital media for the world’s largest brands and media platforms, ensuring media transparency and accountability. Since 2008, DV has empowered hundreds of Fortune 500 companies to maximize their media investments by delivering best-in-class solutions across the digital ecosystem, contributing to a stronger, safer, and more secure digital advertising industry. Learn more at www.doubleverify.com.

Position Overview

As a Senior Site Reliability Engineer (SRE) at DoubleVerify, you will play a critical role in building and scaling our SRE team. This dual-role position requires both hands-on technical expertise and a passion for mentoring and educating team members. You will be responsible for implementing and promoting SRE best practices, including the development and monitoring of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs). Your contributions will ensure the reliability, scalability, and performance of our digital media measurement platforms, directly impacting our mission of delivering media transparency and accountability.

Responsibilities

  • Team Development and Mentorship: Build and grow the SRE team by recruiting, mentoring, and educating team members on SRE principles, promoting a culture of reliability and automation.
  • Technical Contributor: Contribute directly to the design, implementation, and maintenance of highly available infrastructure and services, with a focus on automation to minimize manual intervention.
  • SLA/SLO/SLI Management: Define, monitor, and report on SLIs, SLOs, and SLAs to ensure alignment with business objectives and user expectations. Use these metrics to drive reliability improvements and guide decision-making.
  • Incident Management and Response: Develop and implement robust incident response processes, including on-call rotations and post-incident reviews, to minimize downtime and prevent recurrence.
  • Collaboration and Communication: Partner closely with development, operations, and product teams to integrate reliability into the software development lifecycle, promoting cross-functional collaboration.
  • Continuous Improvement: Analyze system performance data to identify areas for improvement, implementing solutions to enhance reliability, scalability, and efficiency.

Requirements

  • Experience: 5+ years in site reliability engineering, DevOps, or a related field, with experience mentoring and educating other engineers.
  • Technical Proficiency: Expertise in Linux/Unix systems administration, cloud platforms (AWS, GCP, or Azure), and container orchestration tools like Kubernetes.
  • Programming Skills: Proficiency in scripting and programming languages such as Python, Go, or Bash for automation and tool development.
  • Monitoring and Observability: Experience with monitoring and logging tools such as Prometheus, Grafana, Splunk, or Nagios. Proven ability to develop and track SLIs, SLOs, and SLAs.
  • Automation and Infrastructure as Code: Hands-on experience automating infrastructure and deployments using tools like Terraform, Ansible, or Chef.
  • Communication and Mentorship: Strong verbal and written communication skills, with a passion for mentoring and educating team members on technical concepts and SRE best practices.
  • Problem-Solving Aptitude: Exceptional analytical skills with a proactive approach to identifying and resolving system issues.
  • Team Collaboration: Ability to work both independently and collaboratively within a team environment.


Preferred Qualifications

  • Advanced Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • Certifications: Relevant industry certifications such as AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer, or Certified Kubernetes Administrator (CKA).
  • Security Awareness: Familiarity with security best practices in cloud and containerized environments.
  • Configuration Management: Experience with infrastructure as code and configuration management tools like Terraform, Ansible, or Chef.


Why Join Us

At DoubleVerify, we are committed to fostering an inclusive and dynamic workplace where employees can bring their authentic selves to work. We value passion, accountability, collaboration, and innovation, believing that diverse perspectives drive better business outcomes. Join us to contribute to a mission-driven company dedicated to enhancing the digital advertising ecosystem.

DoubleVerify is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.


The successful candidate’s starting salary will be determined based on a number of non-discriminating factors, including qualifications for the role, level, skills, experience, location, and balancing internal equity relative to peers at DV.
The estimated salary range for this role based on the qualifications set forth in the job description is between [$107,000- $231,000]. This role will also be eligible for bonus/commission (as applicable), equity, and benefits.
The range above is for the expectations as laid out in the job description; however, we are often open to a wide variety of profiles, and recognize that the person we hire may be more or less experienced than this job description as posted.

Not-so-fun fact: Research shows that while men apply to jobs when they meet an average of 60% of job criteria, women and other marginalized groups tend to only apply when they check every box. So if you think you have what it takes but you’re not sure that you check every box, apply anyway!


Top Skills

Ansible
AWS
Azure
Bash
Chef
GCP
Go
Grafana
Kubernetes
Linux
Nagios
Prometheus
Python
Splunk
Terraform
Unix
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chicago, IL
721 Employees
On-site Workplace
Year Founded: 2008

What We Do

DV is powering the new standard of marketing performance, giving advertisers clarity and confidence in their digital investment. Built on best practices, DV solutions create value for media buyers and sellers by bringing transparency and accountability to the market, ensuring ad viewability, brand safety, fraud protection, accurate impression delivery and audience quality across campaigns to drive performance. Since 2008, DV has helped hundreds of Fortune 500 companies gain the most value out of their media spend by delivering best in class solutions across the digital ecosystem that help build a better industry.

Learn more at doubleverify.com.

Similar Jobs

Hudson River Trading Logo Hudson River Trading

Senior Site Reliability Engineer - Enterprise Technology

Artificial Intelligence • Fintech • Other • Automation
Hybrid
New York, NY, USA
1000 Employees
150K-250K
3 Locations
14000 Employees
120K-130K Annually
Remote
2 Locations
79 Employees
180K-220K Annually

Similar Companies Hiring

Optimum Media Thumbnail
Software • Marketing Tech • Digital Media • AdTech
Long Island City, NY
270 Employees
JuiceMedia.AI Thumbnail
Marketing Tech • Machine Learning • Digital Media • Big Data Analytics • Analytics • Agency • AdTech
Marina Del Rey, CA
68 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account