Software Engineer, Reliability & Availability

Reposted 16 Days Ago
Be an Early Applicant
Mountain View, CA
130K-260K Annually
Junior
Information Technology • Software
The Role
As a Software Engineer in Reliability & Availability, you will enhance cloud infrastructure stability and resilience through automation, monitoring, and incident management.
Summary Generated by Built In
About NewsBreak

NewsBreak is redefining the way users interact with local news and their communities. By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibrant, and authentically connected lives. Through robust collaborations with thousands of local publishers and businesses across the nation, NewsBreak is revolutionizing how a new wave of readers access and engage with essential, locally sourced content & information.

Since our inception in 2015, our trajectory has been nothing short of remarkable. We proudly stand as the nation’s premier local news app.

As a Series-C unicorn startup, our headquarter nestles in the tech hub of Mountain View, California, with other offices in New York City and Seattle. For more information, visit www.newsbreak.com/about

About the role

As a Software Engineer in Reliability & Availability, you will be responsible for ensuring the stability, scalability, and resiliency of our cloud infrastructure and services. Working at the core of SRE, system performance, and availability management, you will design robust solutions to minimize downtime, optimize performance, and enhance system reliability. Your focus will be on AWS cloud infrastructure, Kubernetes (EKS), and big data processing (EMR), implementing high availability, fault tolerance, and self-healing mechanisms for distributed systems. Through automation, proactive monitoring, and incident response, you will help maintain seamless operations across our cloud-native platforms.

Responsibilities

  • Ensure service reliability and availability by designing and implementing fault-tolerant architectures leveraging AWS, EKS (Elastic Kubernetes Service), and EMR (Elastic MapReduce).
  • Build, automate, and optimize infrastructure for high-performance, scalable, and resilient cloud services.
  • Develop monitoring, observability, and alerting solutions to proactively detect and mitigate service degradation and performance bottlenecks.
  • Improve service lifecycle management, from capacity planning and launch reviews to post-incident analysis and continuous optimization.
  • Enhance auto-scaling mechanisms to dynamically adjust resources and maintain system stability under varying workloads.
  • Drive automation using Infrastructure-as-Code (IaC) and CI/CD pipelines to minimize manual intervention and improve service resilience.
  • Engage in on-call rotations, manage incidents, conduct blameless postmortems, and drive long-term reliability improvements.

Requirements

  • BS or MS in Computer Science, Engineering, or a related field, with at least 2+ years of experience in SRE, DevOps, or Infrastructure Engineering roles.
  • Strong programming experience in at least one of the following: C, C++, Java, Python, or Go.
  • Hands-on experience with cloud platforms (AWS, GCP, or Azure), with a strong emphasis on AWS services (EKS, EMR, EC2, RDS, S3).
  • Deep understanding of Kubernetes (EKS) and containerized workloads, including scaling, monitoring, and failure recovery strategies.
  • Strong experience with monitoring tools (Prometheus, Grafana,) ,log management (ELK, CloudWatch, Splunk), distributed tracing and profiling solutions.
  • Extensive experience supporting production Internet services, troubleshooting performance issues, and implementing high-availability strategies.
  • Strong problem-solving and debugging skills, with a systematic approach to incident response, root cause analysis, and continuous improvement.

The US base salary range for this full-time position is listed below. Pay may vary based on a number of factors including job-related skills, level, experience, geographic location and relevant education or training. At NewsBreak, we design our overall rewards package to attract top talents. Depending on the position, the role may also be eligible for discretionary bonus and options. Your recruiter can share more details during the hiring process.

Annual Base Pay Range

$130,000$260,000 USD

CPRA Privacy Notice for California Candidates


Top Skills

AWS
C
C++
Cloudwatch
Elk
Emr
Go
Grafana
Java
Kubernetes
Prometheus
Python
Splunk
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Mountain View, , CA
346 Employees
On-site Workplace
Year Founded: 2015

What We Do

NewsBreak is the leading platform for local news and information, with more than 40 million users across America. By using new technology, NewsBreak provides community-focused news and information from over 10,000 sources in a timely and accessible way. NewsBreak is bridging the gap between new technology and traditional local media, offering an innovative digital solution that allows users to get the information they need to live safer, more vibrant, and connected lives.

Based in Mountain View, California, NewsBreak connects users with local information, national publishers, and targeted advertising from local businesses, with increased traffic and revenue that helps strengthen local communities.

We are always looking for great talents. Contact us via [email protected]

Our website:
https://www.newsbreak.com/about

Creator Program:
https://www.newsbreak.com/creators

Publishing Platform:
https://mp.newsbreakapp.com

Android App Download:
https://play.google.com/store/apps/details?id=com.particlenews.newsbreak&hl=en

iOS App Download:
https://itunes.apple.com/us/app/news-break-personal-local/id1132762804?mt=8

Similar Jobs

Pfizer Logo Pfizer

Senior Associate Scientist, Cell Bank

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
La Jolla, CA, USA
121990 Employees
67K-111K Annually

ServiceNow Logo ServiceNow

Staff Software Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Santa Clara, CA, USA
26000 Employees
164K-286K Annually

ServiceNow Logo ServiceNow

Senior Engineering Manager

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Santa Clara, CA, USA
26000 Employees
188K-328K Annually

Chime Logo Chime

Staff Software Engineer, Data Platform

Fintech • Machine Learning • Mobile • Security • Software
Easy Apply
Hybrid
San Francisco, CA, USA
1459 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account