Site Reliability Engineer

Posted 14 Days Ago
Hiring Remotely in Open Road, AL
Remote
Junior
Digital Media • Social Media
The Role
As a Site Reliability Engineer at TextNow, you will ensure system reliability by designing and maintaining resilient infrastructures, automate cloud deployments with Terraform and Ansible, handle incident responses, and collaborate with teams to improve operational practices and system reliability.
Summary Generated by Built In

We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that's because we're made up of people with curious minds who bring an optimistic, yet critical lens into the work we do. We're the largest provider of free phone service in the nation. And we're just getting started.


Join us in our mission to break down barriers to communication and free the flow of conversation for people everywhere.


TextNow is looking for a new member of our SRE team who will be responsible for our infrastructure, automation, observability, and a few other critical functions.

What You'll Do:

  • Ensure System Reliability: Design, build, and maintain scalable, resilient, and highly available systems to support TextNow’s infrastructure and services.
  • Automation & Infrastructure as Code: Develop and maintain automation using Terraform, Ansible, and other tools to enable efficient deployment, scaling, and operations of cloud-based systems (AWS preferred).
  • Incident Response & On-Call Support: Participate in an on-call rotation, troubleshoot issues, and drive incident resolution to minimize downtime and improve system performance. Conduct post-mortems and implement corrective actions to enhance reliability.
  • Performance Monitoring & Optimization: Implement and improve observability tools, logging, and monitoring solutions to identify and mitigate potential system issues proactively.
  • Collaboration & Cross-Team Engagement: Work closely with software engineers, DevOps, and product teams to align technical efforts with business objectives and improve system reliability from development to production.
  • Continuous Improvement: Identify areas for improvement in architecture, automation, and operational practices. Contribute to the design and implementation of new SRE best practices.

Who You Are:

  • Experienced in SRE/DevOps: You have 2+ years of experience in an operationally focused role, such as SRE, DevOps, or Infrastructure Engineering, with a deep understanding of reliability, scalability, and performance optimization.
  • Proficient with Key Technologies: Hands-on experience with AWS, GitHub, Terraform, Ansible, or similar tools to build and manage cloud infrastructure efficiently.
  • Incident Management Expert: You are comfortable handling production incidents, analyzing root causes, and implementing long-term fixes to prevent recurrence.
  • Automation & Observability Focused: Passionate about reducing toil through scripting and automation while ensuring robust observability using logging, metrics, and monitoring tools.
  • Collaborative & Impact-Driven: You enjoy working cross-functionally with engineers, product teams, and leadership to drive meaningful improvements to system reliability.

More about TextNow...


Our Values:

· Customer Obsessed (We strive to have a deep understanding of our customers)

· Do Right By Our People (We treat each other with fairness, respect, and integrity)

· Accept the Challenge (We adopt a "Yes, We Can" mindset to achieve ambitious goals)

· Act Like an Owner (We treat this company like it's our own... because it is!)

· Give a Damn! (We are deeply committed and passionate about our work and achieving results)


Benefits, Culture, & More:

· Strong work life blend 

· Flexible work arrangements (wfh, remote, or access to one of our office spaces)

· Employee Stock Options 

· Unlimited vacation 

· Competitive pay and benefits

· Parental leave

· Benefits for both physical and mental well being (wellness credit and L&D credit)

· We travel a few times a year for various team events, company wide off-sites, and more


Diversity and Inclusion:

At TextNow, our mission is built around inclusion and offering a service for EVERYONE, in an industry that traditionally only caters to the few who have the means to afford it. We believe that diversity of thought and inclusion of others promotes a greater feeling of belonging and higher levels of engagement. We know that if we work together, we can do amazing things, and that our differences are what make our product and company great. 


TextNow Candidate Policy

By submitting an application to TextNow, you agree to the collection, use, and disclosure of your personal information in accordance with the TextNow Candidate Policy

Top Skills

Ansible
AWS
Terraform
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Waterloo, Ontario
239 Employees
Hybrid Workplace
Year Founded: 2009

What We Do

Our mission is to break down the barriers to communication and free the flow of conversation for people everywhere.
TextNow is evolving the way the world connects and that’s because we’re made up of people with curious minds who bring an optimistic, yet critical lens into the work we do.
We’re the largest provider of free phone service in the nation. And we’re just getting started. Join TextNow and break down barriers to communication with us.

Similar Jobs

GitLab Logo GitLab

Intermediate Site Reliability Engineer, US Public Sector Services

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
US
2350 Employees
104K-222K Annually

GitLab Logo GitLab

Intermediate Site Reliability Engineer, FinOps

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
29 Locations
2350 Employees

DFIN Logo DFIN

Principal Site Reliability Engineer - Cloud (Remote)

Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
Remote
United States
2600 Employees

The PNC Financial Services Group Logo The PNC Financial Services Group

Site Reliability Engineer - Tempus

Machine Learning • Payments • Security • Software • Financial Services
Remote
USA
56000 Employees

Similar Companies Hiring

Artlist Thumbnail
Social Media • Other • Music • Digital Media
Tel Aviv, IL
450 Employees
bet365 Thumbnail
Software • Gaming • Esports • Digital Media • Automation
Denver, Colorado
9000 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account