Senior Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Arlington Heights, IL
Expert/Leader
Artificial Intelligence • Healthtech • Analytics • Biotech
The Role
Senior Site Reliability Engineer responsible for performance, monitoring, and automation of Compute and Network infrastructure, ensuring optimal service availability and reliability.
Summary Generated by Built In

Job Description SummaryThe Senior Site Reliability Engineer will be responsible for performance and availability of Compute and Network infrastructure consumed by all business segments. The Site Reliability team is composed of highly talented individuals obsessively focused with availability through operational excellence. The ideal individual is relentlessly technical, passionate for automating everything and committed to delivering amazing customer experiences

Job Description

Roles and Responsibilities

  • Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria.
  • Develop automated solutions to address potential problems before they result in a service interruption.
  • Provide impact assessment and mitigation plan for changes going into the production environment.
  • Investigate root cause of severe and systemic outages, identify corrective actions and apply across the enterprise.
  • Develop availability measures that align with consumer experience to accurately assess the usability of crucial services.
  • Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages.
  • Identify thresholds for all critical links in the data path to quickly isolate where imbalances may result in potential outages.
  • Analyse failure points in services to model risk level and resolution steps if failure occurs.
  • Assist in driving architecture enhancements into system to mitigate potential failure points.
  • Programmatically monitor for and remediate configuration drift of critical devices.
  • Develop response plans to potential failure points and evaluate effectiveness during planned tests.
  • Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture.
  • Provide technical coaching and direction to more junior teammates.

Qualifications/Essential Requirements

  •  Bachelor's Degree in Computer Science or STEM” Majors (Science, Technology, Engineering and Math) with at least 10 years of progressive experience.
  • Experience in site reliability engineering, with a focus on AWS.
  • Strong understanding of AWS Services, architecture, and best practices.
  • Experience with configuring, customizing, and extending monitoring /APM tools (Datadog, Kloudfuse, Grafana, Splunk, etc.)
  • Operational experience in complex distributed systems, including defining, measuring and monitoring SLO/SLAs for availability and reliability goals.
  • Experience with incident management and post-incident reviews.
  • AWS Certified Solutions Architect Associate, AWS Certified DevOps Engineer is a plus.

Preferred Qualification

  • Expertise on management & administration of Kubernetes clusters.
  • Strong background in scripting, automation, configuration management, and infrastructure-as-code practices (Terraform AWS CloudFormation, Crossplane, Pulumi etc.)  
  • Good understanding of DevOps practices, CI/CD pipelines, version control systems (Git). Experience in GitOps is a plus.
  • Strong knowledge on Unix based operating systems & workload management and networking systems.

#L1-Hybrid

Additional Information

GE HealthCare offers a great work environment, professional development, challenging careers, and competitive compensation. GE HealthCare is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.

GE HealthCare will only employ those who are legally authorized to work in the United States for this opening. Any offer of employment is conditioned upon the successful completion of a drug screen (as applicable).

While GE HealthCare does not currently require U.S. employees to be vaccinated against COVID-19, some GE HealthCare customers have vaccination mandates that may apply to certain GE HealthCare employees.

Relocation Assistance Provided: No

Top Skills

AWS
Aws Cloudformation
Crossplane
Datadog
Git
Grafana
Kloudfuse
Pulumi
Splunk
Terraform
Unix
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chicago, IL
50,282 Employees
On-site Workplace
Year Founded: 1892

What We Do

Every day millions of people feel the impact of our intelligent devices, advanced analytics and artificial intelligence.

As a leading global medical technology and digital solutions innovator, GE Healthcare enables clinicians to make faster, more informed decisions through intelligent devices, data analytics, applications and services, supported by its Edison intelligence platform.

With over 100 years of healthcare industry experience and around 50,000 employees globally, the company operates at the center of an ecosystem working toward precision health, digitizing healthcare, helping drive productivity and improve outcomes for patients, providers, health systems and researchers around the world.

We embrace a culture of respect, transparency, integrity and diversity.

Similar Jobs

Discover Logo Discover

Senior Manager Application Integration (SRE)

Cloud • Fintech • Machine Learning • Analytics • Financial Services
Hybrid
Chicago, IL, USA
18000 Employees
129K-217K Annually

Barclays Logo Barclays

Senior Site Reliability Engineer

Fintech • Financial Services
2 Locations
83500 Employees
4 Locations
1001 Employees
140K-180K Annually
4 Locations
20002 Employees
116K-174K Annually

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account