Principal Site Reliability Engineer

Sorry, this job was removed at 09:59 a.m. (CST) on Monday, Jan 27, 2025
Be an Early Applicant
Hiring Remotely in United States
Remote
Software
The Role

Jama Software® is focused on maximizing innovation success in multidisciplinary engineering organizations. Numerous firsts for humanity in fields such as fuel cells, electrification, space, software-defined vehicles, surgical robotics, and more all rely on Jama Connect® requirements management software to minimize the risk of defects, rework, cost overruns, and recalls. Using Jama Connect, engineering organizations can now intelligently manage the development process by leveraging Live Traceability™ across best-of-breed tools to measurably improve outcomes. Our rapidly growing customer base spans the automotive, medical device, life sciences, semiconductor, aerospace & defense, industrial manufacturing, consumer electronics, financial services, and insurance industries.

We are looking for an inventive Principal Site Reliability Engineer. The Site Reliability Engineering team at Jama Software is a highly skilled group of Engineers that manage and maintain the Jama Software production Cloud environment, ensuring that our customers are experiencing a fantastic SaaS experience.  If you’re the type of person that enjoys focusing on innovation and providing strong technical vision as well as working with a team to build reliable, scalable frameworks, Jama Software is the place for you. You will focus on simplifying and making the Jama Cloud easy to operate at scale. You will partner with leaders across Jama Software as a subject matter expert to help design the best architectures and processes, automations and be a role model for the SRE team in delivery and teamwork. 

We'd love it if you brought a deep understanding of modern Cloud infrastructure, programming expertise, operational experience and a desire to change the status quo. We're looking for an engineer who can analyze and help improve our services and processes to get us to an even higher level of reliability, performance, scalability, and cost efficiency. You'll achieve this by crossing team and functional boundaries to advocate for reliability methodologies and will work with a variety of platform and product teams to both build reliability into our platform and drive adoption of those practices into our products. 

What You'll Do: 

  • Architect, build, and maintain highly available, fault-tolerant systems using AWS/other services 
  • Use Terraform to define infrastructure as code, enabling scalable, repeatable, and secure deployments 
  • Continuously review and recommend the design, maintenance, development and implementation, including deployment and support, of our SaaS production platform solution using Docker and other modern web technologies 
  • Set up and enforce guardrails for databases, infrastructure, and applications, ensuring consistency and adherence to best practices 
  • Support operationally critical environments using monitoring tools, scripts, and logging 
  • Document designs and implementations 
  • Design and manage secure networking solutions, including AWS VPCs, and firewalls 
  • Partner with SRE and Engineering teams to embed reliability and security best practices into the application life-cycle 
  • Collaborate with fellow Engineers, Product Managers, and Quality Assurance Engineers to develop and deliver services that meet or exceed enterprise customer reliability and quality expectations 
  • Participate and be effective at pair/mob programing and code reviews, both giving and receiving feedback 

What You'll Bring: 

  • Bachelor’s degree in Computer Science or equivalent years of experience 
  • Minimum 5+ years experience working with Java 
  • Expert-level proficiency with 5+ years experience in AWS components like EC2, CloudFormation, RDS/Aurora, IAM Roles, etc. 
  • Expert-level proficiency with 5+ years experience in operating high-availability, fault-tolerant, scalable, distributed software in production: building monitoring into your code, tweaking dashboards, defining alerts, writing runbooks, etc. 
  • 4+ years of systems engineering/administration and/or support experience with web applications, especially J2EE technologies and technologies like Docker, Tomcat, Nginx 
  • Understanding of network fundamentals including (TCP/IP, VPN, DNS, SMTP, HTTP(S)) 
  • Experience with programmatic manipulation of cloud infrastructure such as AWS 
  • Scripting and automation skills using common scripting languages like python, bash 
  • Experience with network and web application monitoring tools, Datadog is preferred 
  • Experience with DBMS (e.g. MySQL, MS SQL, Postgres, RDS), as well as graph databases (Neo4j, ArangoDB) 
  • Experience with REST 

Nice to Have: 

  • Experience with SaaS application and/or product hosting, preferably in an enterprise software development environment 
  • Hands-on experience working with AWS Glue, Bedrock, and Sagemaker, supporting customer-facing AI applications 
  • Experience within the Scrum/Agile framework 
  • Passionate, driven, intelligent, team-oriented and hard-working with the ability to raise the performance of those around you 
  • Strong automation skills and independent time management skills 
  • Curious and enjoys technology in all layers of the application ecosystem 
  • Adopts, embraces and promotes agile and test-driven practices with your peers 
  • Strong interpersonal and communication skills, including the ability to actively listen, build trust with stakeholders, identify needs, and proactively maintain positive relationships, both in written and spoken situations 
  • Experience working with a diverse employee base across remote and local environments 

Perks and Benefits: 

  • Virtual first and culturally diverse work environment spanning 8 countries. 
  • Ambitious and fun work with a chance to define distinct, company-shaping tangible contributions. 
  • Flexible time off and leave programs crafted to meet the needs for your rejuvenation and extra support to cope with life events with a quarterly $75 wellness reimbursement. 
  • Comprehensive and affordable medical, dental and vision plans with pre-tax savings accounts and a generous 401(k) employer match.
  • 12 weeks of paid parental leave to bond with your new family member.
  • Emphasis on learning and development at all levels with a subscription to LinkedIn Learning. 

At Jama Software, we welcome passionate individuals who have the unrestricted right to work in the United States, including natural citizens and Green Card holders, and reside in eligible states to join our team.

Jama Software participates in E-Verify and will provide the federal government with your Form I-9. 

If you get to this point, we hope you're feeling excited about the job you just read. Even if you don't feel that you meet every single requirement, we still encourage you to apply. Women, BIPOC, LGBTQ, and other under-represented groups are highly encouraged to apply. We're eager to meet people that believe in Jama Software’s mission and can contribute to our team in a variety of ways – not just candidates who check all the boxes. 

Jama Software is an Equal Opportunity Employer. Qualified applicants will be considered without regard to race, color, religion, sex, national origin, age, veteran status, sexual orientation, gender identity, disability, genetic information or that of their relatives, friends or associates or any other characteristic protected under federal, state, or applicable law. 

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform crucial job functions, and to receive other benefits and privileges of employment. Please contact us at [email protected] to request an accommodation.

The Company
Portland, OR
210 Employees
On-site Workplace
Year Founded: 2007

What We Do

Jama Software is focused on maximizing innovation success. Numerous firsts for humanity in fields such as fuel cells, electrification, space, autonomous vehicles, surgical robotics, and more all rely on Jama Connect™ to minimize the risk of product failure, delays, cost overruns, compliance gaps, defects, and rework. Jama Connect™ uniquely creates Living Requirements™ that form the digital thread through siloed development, test and risk activities to provide end-to-end compliance, risk mitigation, and process improvement.

Similar Jobs

Atlassian Logo Atlassian

Principal Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
167K-269K Annually

NBCUniversal Logo NBCUniversal

Site Reliability Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote
Hybrid
New York, NY, USA
68000 Employees
110K-145K Annually

Motive Logo Motive

Site Reliability Engineer, Embedded

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
Remote
United States
3600 Employees
109K-156K Annually

Atlassian Logo Atlassian

Site Reliability Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees

Similar Companies Hiring

Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees
HERE Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account