Senior Devops/SRE

Posted 7 Days Ago
Hiring Remotely in United States of America
Remote
Senior level
Edtech
The Role
The Senior DevOps/SRE at Lexia Learning is responsible for defining, designing, implementing, and monitoring CI/CD technologies. Key tasks include architecting automated deployments, managing infrastructure using Terraform and AWS, and ensuring system reliability. The role requires overseeing web application monitoring, mentoring junior team members, and participating in a 24x7 on-call support rotation.
Summary Generated by Built In

Job Overview:

A Lexia Learning Senior Dev Ops Engineer has a pivotal role in the definition, design, implementation, and monitoring of our CID and related technologies. This engineer will rely on a broad skill set and work closely with Development, Database and IT operations teams to architect automated deployments using an infrastructure as code methodology for all hosted application environments. This role will also be strategically focused on ensuring the high availability, configuration consistency, scheduled maintenance and consistent performance and site reliability of the production and non-production web applications, web application monitoring as well as all components of devops tool chains.

Job Responsibilities:

  • Infrastructure as Code (IaC), architect modular and reusable Ansible playbooks and roles as well as extend existing playbooks for web application deployment.
  • Architect, implement, and maintain infrastructure using Terraform on AWS.
  • Architect, implement, and maintain the configuration of CI/CD jobs and delivery pipelines using Groovy/Jenkins.
  • Manage all infrastructure code through Git version control systems.
  • Architect both private cloud (VMware, F5 LTM, Linux) and public cloud (AWS) infrastructure components.
  • Build, maintain and scale infrastructure for Production and non-Production environments.
  • Develop and maintain LXC and Docker containers for CI/CD and application deployment.
  • Architect the monitoring of highly available production web applications and dependent systems as well as maintain existing HA systems with a high degree of automation.
  • Assist in the strategic development and rapid enterprise integration of cyber security capabilities and tools to protect our networks, systems, and information.
  • Coordinate with engineering, business, and technical subject matter specialists to identify and mitigate Information Security issues and incidents.
  • Perform security monitoring. Follow up on alerts from Intrusion Detection Systems (IDS), and Security Information Event Management (SIEM) Systems.
  • Troubleshooting problems of a complex nature autonomously.
  • Perform support duties within a 24x7 on-call support rotation with flexibility to adjust hours for outages and perform after hours and weekend support as needed.
  • Communicate and collaborate effectively at a high level with groups such as IT operations, DB Engineering, Application Development and Management in support of architecture change and application availability.
  • Mentor less senior members of the System engineering team.

Job Requirements:

  • BA/BS degree in Computer Science or related technical field, or equivalent practical experience.
  • 6+ years progressively responsible IT industry experience with web application hosting/development, deployment and production support and command line level experience in a Linux Based environment.
  • 6+ years hands-on command line level experience in a CENTOS or similar Linux Based environment.
  • 4+ years of hands-on experience with any of these container architecture and orchestration technologies: DOCKER, SWARM, LXC, and LXD.
  • 4+ years of experience Architecting complex ANSIBLE playbooks or equivalent IaC.
  • 4+ years of Architecting JENKINS CI and CD pipeline deployments to multiple environments through the use of Groovy Seeds, Freestyle Jobs, Multi branch pipelines from multiple SCMs and artifact stores and multiple Jenkins instances.
  • 4+ years hands-on cumulative experience in a development capacity using one or more of the following or similar languages: BASH SHELL, JAVA, GROOVY, PYTHON, RUBY, PHP, NODEJS, GO.
  • 2+ years AWS cloud deployment and production support experience at scale with a variety of services
  • 2+ years of experience with TERRAFORM/OPENTOFU.
  • Deployment and monitoring of highly available web applications without outage.
  • Ability to safely implement changes in production systems where you have minimal knowledge.
  • Ability to perform in 24x7 on call rotation.

To learn more about our organization and the exciting work we do, visit https://www.lexialearning.com/

An Equal Opportunity Employer

We are dedicated to fostering a culture that celebrates unique backgrounds, ideas, and experiences. All qualified applicants will receive consideration for employment without discrimination on the basis of race, color, age, religion, sex, gender, gender identity/expression, sexual orientation, national origin, protected veteran status, or disability.

Top Skills

Bash
Go
Groovy
Java
Node.js
PHP
Python
Ruby
The Company
HQ: Dallas, TX
1,831 Employees
On-site Workplace
Year Founded: 2009

What We Do

Cambium Learning® Group is the education essentials company, providing award-winning education technology and services for PreK-12 markets. With an intentionally curated portfolio of respected global brands, Cambium serves as a leader in the education space, helping millions of educators and students feel more universally valued each and every day. In everything it does, the company focuses on the elements that are most essential to the success of education, delivering simpler, more certain solutions that make a meaningful difference right now.

The Cambium family of companies includes: Cambium Assessment, Lexia® Learning, Learning A-Z®, Voyager Sopris Learning®, ExploreLearning®, Time4Learning®, and Kurzweil Education®.

Similar Jobs

MoneyLion Logo MoneyLion

Senior DevOps / Site Reliability Engineer

Fintech • Machine Learning • Mobile • Software • Financial Services
Easy Apply
Remote
US
600 Employees

The PNC Financial Services Group Logo The PNC Financial Services Group

Site Reliability Engineer Senior

Machine Learning • Payments • Security • Software • Financial Services
Remote
USA
56000 Employees

Favor Delivery Logo Favor Delivery

Senior Site Reliability Engineer

Food • Logistics • Mobile • On-Demand • App development
Remote
Texas, USA
460 Employees

Voltage Park Logo Voltage Park

Site Reliability Engineer

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
Remote
2 Locations
51 Employees
140K-180K Annually

Similar Companies Hiring

Academia.edu Thumbnail
Software • Social Impact • Information Technology • Edtech • Digital Media • Consumer Web
SAN FRANCISCO, CA
110 Employees
Campus Thumbnail
Edtech
New York, NY
267 Employees
ReUp Education Thumbnail
Social Impact • Edtech
Austin, TX
145 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account