Senior Site Reliability Engineer

Posted 18 Days Ago
Be an Early Applicant
Winston-Salem, NC
Senior level
Fintech • Information Technology • Analytics
The Role
The Senior Site Reliability Engineer ensures application reliability and scalability, automates operations, monitors performance, and maintains high availability while collaborating with development teams on best practices.
Summary Generated by Built In

As a Site Reliability Engineer (SRE), you'll play a pivotal role in ensuring the health, reliability, performance, and scalability of our applications. You'll bridge the gap between development and operations, leveraging your technical expertise and problem-solving skills to triage production issues, automate operations, optimize processes, and maintain high availability.

Key Responsibilities:

    Steward of Application Health: Work closely with application developers to design resilient, scalable, and maintainable applications, ensuring they meet operational requirements and minimize downtime.

     Collaboration: Participate in code review; mentor and train peers; advocate DevOps principals to application developers.

     Infrastructure Automation: Develop and maintain automation and tools to streamline deployments, configuration management, and infrastructure provisioning.

     Monitoring and Alerting: Implement robust monitoring systems to proactively identify and address performance bottlenecks, anomalies, and security threats.

     Capacity Planning: Forecast resource needs and optimize infrastructure utilization to ensure high availability and performance.

     Change Management: Collaborate with development teams to ensure smooth deployment of new features and updates, minimizing disruptions.

     Security and Compliance: Adhere to security best practices and implement measures to protect systems from vulnerabilities and threats.

     Guardian of SLA: Actively monitor and maintain the health and performance of applications, ensuring they meet Service Level Agreements. Respond to, triage and mitigate emergent problems in production.

     On-Call Support: Participate in on-call rotation to provide timely support and resolution for critical issues.

Required Skills and Experience:

     Strong programming skills in languages like Python, Ruby, Bash, Rust, Go.

     Experience with cloud platforms (AWS, GCP, Azure) and infrastructure as code tools (Cloudformation, Terraform, Ansible, Chef)

     Deep understanding of containerization technologies (Docker, Kubernetes)

     Proficiency with linux (Ubuntu)

     Proficiency in monitoring and alerting tools (Cloudwatch, Prometheus, Grafana)

     Knowledge of DevOps practices and methodologies (CI/CD, Agile)

     Excellent problem-solving and troubleshooting skills

     Strong communication and collaboration abilities

Preferred Skills and Experience:

     Knowledge of asynchronous processing (Kafka, Celery)

     SQL Database administration and query optimization (Postgres, MySQL)

     Experience with Github Actions

 

Why Join Us:

     Be part of a dynamic and innovative team

     Work on cutting-edge technologies and projects

     Opportunity for professional growth and development

 

If you are passionate about building reliable and scalable systems and have a strong foundation in DevOps, we encourage you to apply.

We are an Equal Opportunity Employer, including disability/vets.

This position is not eligible for student visa sponsorship, including F-1 OPT or CPT. Candidates must have authorization to work in the U.S. without the need for employer sponsorship now or in the future.

Top Skills

Ansible
AWS
Azure
Bash
Celery
Chef
CloudFormation
Cloudwatch
Docker
GCP
Github Actions
Go
Grafana
Kafka
Kubernetes
MySQL
Postgres
Prometheus
Python
Ruby
Rust
Terraform
Ubuntu
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Winston-Salem, NC
2,044 Employees
On-site Workplace
Year Founded: 1980

What We Do

We reimagine everyday business challenges through advanced analytics, technology-enabled and market-driven solutions built to solve some of industries’ biggest obstacles to growth. Inmar Intelligence’s customer-centric approach is evident through our success helping companies dynamically engage audiences, build brand loyalty, create efficiencies and drive profitable growth.

We help leading Fortune 500 companies and emerging brands stay relevant and propel growth while providing their consumers with personalized and precision-driven tools to save money, improve health and safety, and more conveniently go about their lives.

For more than 35 years, we have served retailers, manufacturers, healthcare providers, government and employers as their trusted intermediary and helped them redefine innovation.

Similar Jobs

NVIDIA Logo NVIDIA

Senior Site Reliability Engineer, HPC and LSF

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
4 Locations
21960 Employees
Durham, NC, USA
58848 Employees
6 Locations
68 Employees

Dune Logo Dune

Senior Site Reliability Engineer

Software • Analytics • Financial Services • Cryptocurrency
Remote
42 Locations
97 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Enterprise Web • Consulting • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account