Senior Site Reliability Engineer

Posted 10 Hours Ago
Be an Early Applicant
Denver, CO
137K-178K Annually
Senior level
Cloud • Enterprise Web • Information Technology • Other
We Connect What's Next
The Role
The Senior Site Reliability Engineer is responsible for maintaining the uptime and scalability of critical infrastructure, managing incidents, proactively identifying system risks, collaborating with teams, and employing best practices in automation and monitoring.
Summary Generated by Built In

Company Description

Zayo provides mission-critical bandwidth to the world’s most impactful companies, fueling the innovations that are transforming our society. Zayo’s 141,000-mile network in North America and Europe includes extensive metro connectivity to thousands of buildings and data centers. Zayo’s communications infrastructure solutions include dark fiber, private data networks, wavelengths, Ethernet, and dedicated Internet access. Zayo serves wireless and wireline carriers, media, tech, content, finance, healthcare and other large enterprises.

Do you dream in high scalable systems, thrive in fast-paced environments and enjoy tackling complex technical challenges? Are you passionate about building and maintaining highly reliable, scalable systems? If so, Zayo is seeking a Senior Site Reliability Engineer! We're looking for a talented Senior Site Reliability Engineer to play a critical role in ensuring the uptime, performance, and scalability of our critical infrastructure.

Responsibilities:

  • Incident Management: Own the incident lifecycle, from leading root cause analysis and resolution to implementing preventative measures to avoid future occurrences. Be on-call to diagnose and resolve critical service outages.

  • Reliability Engineering: Proactively identify and mitigate potential system risks, focusing on automation, monitoring, and tooling to ensure high service availability.

  • Scalability and Performance: Design and implement solutions to ensure our infrastructure can handle ever-growing demands while maintaining optimal application performance.

  • Collaboration: Work closely with developers, product managers, and other engineers to translate business needs into robust and reliable technical solutions. Become the beacon for best practices and efficient processes throughout the organization.

  • Continuous Learning: Stay up-to-date with the latest trends and technologies in SRE practices, automation tools, and cloud platforms.

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).

  • Minimum of seven (7) years of experience in a Site Reliability Engineering or related role.

  • Strong understanding of system administration, Linux, and scripting languages (Python and various shells).

  • Experience working in large scale distributed production environments.

  • Experience with cloud platforms especially troubleshooting and debugging in AWS using AWS native tools.

  • Experience with container orchestration (Kubernetes and Docker).

  • Experience deploying and managing scalable infrastructure within AWS and Kubernetes ecosystems using Terraform and other cloud-native approaches.

  • Experience with infrastructure management tools such as Ansible, Terrafor, Puppet, etc.

  • Experience with monitoring and alerting tools (Prometheus, Grafana, Cacti, etc.).

  • Experience with monitoring platforms such as SevOne, Assure1, and Nagios.

  • Proven ability to work independently and as part of a team.

  • Excellent problem-solving, analytical, and critical thinking skills.

  • A passion for automation and building efficient systems.

Bonus Points if you have:

  • Experience working with various vendor APIs (or netconf) including Nokia, Juniper, Fujitsu, Infinera, Cisco, and Ciena.

  • Strong working knowledge of networking concepts and application protocols, especially TCP/IP, BGP, DNS, and more with focused experience on Internet Service Provider services such as IP VPN, Transit, and Waves.

  • Experience with various network orchestration platforms such as Ciena Blue Planet MDSO, Cisco NSO, Nokia NSP, or others.

  • Previous experience with Golang.

Base Salary Range: $136,900 - $178,000 USD/annually, commensurate with experience.

Benefits, Rewards & Wellness

  • Excellent Health, Dental & Vision Insurance

  • Retirement 401(k) Savings Plan

  • Fitness membership discounts

  • Generous paid time off policy including paid parental leave

Zayo provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, provincial or local laws.

This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

Top Skills

Python
The Company
HQ: Boulder, CO
4,000 Employees
Hybrid Workplace
Year Founded: 2007

What We Do

Zayo Group Holdings, Inc. is a leading global communications infrastructure platform, delivering a range of solutions, including fiber & transport, packet and managed edge services. Zayo owns and operates a Tier 1 IP backbone spanning 134,000 miles across North America and Europe. By providing this mission-critical bandwidth to its category-leading customers across the wireless, hyperscale, media, tech and finance industries, Zayo is fueling the innovations that are transforming society. For more information, visit https://zayo.com.

Why Work With Us

We are ambitious and collaborative. Our culture is centered on excellence and exceeding customer expectations through high performance, big ideas, and a growth mindset.

Gallery

Gallery

Similar Jobs

Spectrum Logo Spectrum

Sr Site Reliability Engineer: Infrastructure as a Service for On-Premises Cloud - Storage

Information Technology • Internet of Things • Mobile • On-Demand • Software
Greenwood Village, CO, USA
100000 Employees
88K-157K Annually

Visa Inc, Logo Visa Inc,

Sr. Site Reliability Engineer

Fintech • Information Technology • Payments
Highlands Ranch, CO, USA
26500 Employees
117K-150K Annually

Visa Inc, Logo Visa Inc,

Sr. Site Reliability Engineer - PRE

Fintech • Information Technology • Payments
Highlands Ranch, CO, USA
26500 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Artlist Thumbnail
Social Media • Other • Music • Digital Media
Tel Aviv, IL
450 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account