Senior Site Reliability Engineer

Posted 7 Hours Ago
Be an Early Applicant
Hiring Remotely in USA
Remote
140K-240K Annually
Senior level
Software • Financial Services
The Role
As a Senior Site Reliability Engineer at Proof, you will ensure high availability of ECS-based production environments, oversee Kubernetes test clusters, support AWS ECS operations, collaborate on Azure dashboards, and implement security best practices. Your responsibilities include active-active deployment management, CI/CD optimization, and incident response.
Summary Generated by Built In

Proof is the world's first identity-assured transaction management platform and we are on a mission to digitize trust for all of life’s most critical transactions. Developed by the same market leaders and experts who brought notarization online with Notarize℠, Proof offers trust in a digital world by verifying identities and securing transactions to protect businesses and their customers. Since 2015, we’ve completed many of the world’s first digital commerce transactions, including the first online real estate closing, online mortgage closing, online auto sale, and online will and we're still just getting started!


This role focuses on improving the reliability of our ECS-based production systems. If you're ready to shift away from Kubernetes management to a robust ECS platform, this might be the role for you (some Kubernetes experience is valued).


At Proof we have two distinct environments:


Production: Run on AWS ECS, emphasizing high availability and performance.

Test: Runs on AWS EKS, providing a flexible, containerized platform for development and testing.

What you’ll do as a Senior SRE at Proof:

  • Ensure high availability of our ECS-based production environments, improving our 99.9...% uptime
  • Oversee Kubernetes test clusters, with a focus on simplicity and high uptime for Jenkins and test environments
  • Support and enhance our AWS ECS production environments, ensuring smooth operation and scalability
  • Design and manage active-active cross-regional deployments for robust disaster recovery
  • Collaborate with developers on Azure ADX dashboards and build actionable Grafana alerts/playbooks
  • Streamline CI/CD processes
  • Participate in incident response activities and on-call rotation. With our architectural choices, we’ve successfully kept on-call notifications very low
  • Implement security and Devops best practices across teams.

What we’re looking for:

  • Containerization expertise: Hands-on experience production containerized workloads
  • Kubernetes experience is valued, but this role emphasizes AWS ECS for production workloads
  • Proven track record with active-active cross-regional architectures
  • Proficiency with Terraform and/or other IaC tools
  • Experience with observability tools (e.g., Prometheus, Grafana) and log management solutions
  • Expertise in optimizing CI/CD pipelines
  • Strong scripting and automation skills
  • Collaborative, self-directed, and proactive problem-solving mindset
  • Comfortable setting priorities and taking initiative.

Preferred Qualifications:

  • Experience with cost optimization in cloud environments
  • Knowledge of networking and security best practices in cloud-native environments
  • Familiarity with auto-scaling strategies for ECS
  • Experience with PostgreSQL in a multi-region production environment.

Our stack:

  • React: Front-end monolith
  • Ruby on Rails: Backend monolith
  • Java: A handful of backend services
  • PostgreSQL, Redis, S3: state
  • HSM/PKI/CA infrastructure
  • Jenkins w/ Groovy & Python
  • AWS Lambda

Proof is committed to building an inclusive environment for people of all backgrounds and everyone is encouraged to apply. We are an equal opportunity employer and do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We'd love to hear from you. 

Top Skills

Java
Ruby
The Company
Boston, MA
403 Employees
On-site Workplace
Year Founded: 2015

What We Do

Proof℠ is the world's first identity-assured transaction management platform. Developed by the same market leaders and experts who brought notarization online with Notarize℠, Proof offers trust in a digital world by verifying identities and securing transactions to protect your business and its customers. When risk is low and speed matters, get it signed. When the law dictates it, get it notarized. When trust matters, you need Proof.

Similar Jobs

Atlassian Logo Atlassian

Senior Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees

Garner Health Logo Garner Health

Senior Site Reliability Engineer

Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
Easy Apply
Remote
USA
200 Employees

Thumbtack Logo Thumbtack

Senior Site Reliability Engineer, Systems

eCommerce • Information Technology • On-Demand • Professional Services • Software
Remote
United States
1400 Employees

Webflow Logo Webflow

Senior Site Reliability Engineer

eCommerce • Software • Design • SEO
Easy Apply
Remote
U.S.

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software
Philadelphia, PA
51 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account