Senior Site Reliability Engineer

Posted 3 Days Ago
Be an Early Applicant
Winston-Salem, NC
Senior level
Fintech • Information Technology • Analytics
The Role
As a Senior Site Reliability Engineer, you will ensure application health and reliability, collaborate with developers, automate infrastructure, monitor performance, optimize resource utilization, and adhere to security best practices while participating in on-call support.
Summary Generated by Built In

As a Site Reliability Engineer (SRE), you'll play a pivotal role in ensuring the health, reliability, performance, and scalability of our applications. You'll bridge the gap between development and operations, leveraging your technical expertise and problem-solving skills to triage production issues, automate operations, optimize processes, and maintain high availability.

Key Responsibilities:

Steward of Application Health: Work closely with application developers to design resilient, scalable, and maintainable applications, ensuring they meet operational requirements and minimize downtime.

Collaboration: Participate in code review; mentor and train peers; advocate DevOps principals to application developers.

Infrastructure Automation: Develop and maintain automation and tools to streamline deployments, configuration management, and infrastructure provisioning.

Monitoring and Alerting: Implement robust monitoring systems to proactively identify and address performance bottlenecks, anomalies, and security threats.

Capacity Planning: Forecast resource needs and optimize infrastructure utilization to ensure high availability and performance.

Change Management: Collaborate with development teams to ensure smooth deployment of new features and updates, minimizing disruptions.

Security and Compliance: Adhere to security best practices and implement measures to protect systems from vulnerabilities and threats.

Guardian of SLA: Actively monitor and maintain the health and performance of applications, ensuring they meet Service Level Agreements. Respond to, triage and mitigate emergent problems in production.

On-Call Support: Participate in on-call rotation to provide timely support and resolution for critical issues.

Required Skills and Experience:

Strong programming skills in languages like Python, Ruby, Bash, Rust, Go.

Experience with cloud platforms (AWS, GCP, Azure) and infrastructure as code tools (Cloudformation, Terraform, Ansible, Chef)

Deep understanding of containerization technologies (Docker, Kubernetes)

Proficiency with linux (Ubuntu)

Proficiency in monitoring and alerting tools (Cloudwatch, Prometheus, Grafana)

Knowledge of DevOps practices and methodologies (CI/CD, Agile)

Excellent problem-solving and troubleshooting skills

Strong communication and collaboration abilities

Preferred Skills and Experience:

Knowledge of asynchronous processing (Kafka, Celery)

SQL Database administration and query optimization (Postgres, MySQL)

Experience with Github Actions

 

Why Join Us:

Be part of a dynamic and innovative team

Work on cutting-edge technologies and projects

Opportunity for professional growth and development

 

If you are passionate about building reliable and scalable systems and have a strong foundation in DevOps, we encourage you to apply.

We are an Equal Opportunity Employer, including disability/vets.

Top Skills

Bash
Go
Python
Ruby
Rust
The Company
HQ: Winston-Salem, NC
2,044 Employees
On-site Workplace
Year Founded: 1980

What We Do

We reimagine everyday business challenges through advanced analytics, technology-enabled and market-driven solutions built to solve some of industries’ biggest obstacles to growth. Inmar Intelligence’s customer-centric approach is evident through our success helping companies dynamically engage audiences, build brand loyalty, create efficiencies and drive profitable growth.

We help leading Fortune 500 companies and emerging brands stay relevant and propel growth while providing their consumers with personalized and precision-driven tools to save money, improve health and safety, and more conveniently go about their lives.

For more than 35 years, we have served retailers, manufacturers, healthcare providers, government and employers as their trusted intermediary and helped them redefine innovation.

Similar Jobs

Lowe’s Logo Lowe’s

Sr. Software Engineer - (Site Reliability Engineer)

Consumer Web • eCommerce • Information Technology • Retail • Software • Analytics • App development
Hybrid
Charlotte, NC, USA
300000 Employees

Red Hat Logo Red Hat

Senior Site Reliability Engineer

Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
Raleigh, NC, USA
20000 Employees
111K-184K Annually

Blankfactor Logo Blankfactor

Senior Site Reliability Engineer (SRE)

Artificial Intelligence • Software
North Carolina, USA
360 Employees
Raleigh, NC, USA
943 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account