Orion Innovation is a premier, award-winning, global business and technology services firm. Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity. We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.
POSITION SUMMARY
As a Site Reliability Engineer (SRE), your responsibilities will include building scalable infrastructure on which we deliver our software. You will help ensure the reliability, availability, and performance of our production and development infrastructure.
You will collaborate with cross-functional teams to drive reliability automation, optimize deployment strategies, and enhance infrastructure monitoring.
PRIMARY RESPONSIBILITIES
- Develop and maintain automation and processes to improve system reliability and enable teams to build and deploy secure and scalable applications in AWS using technologies such as Kubernetes and Terraform.
- Establish and maintain infrastructure and application monitoring systems.
- Define and monitor SLIs, SLOs, and SLAs to ensure operational excellence.
- Analyze usage trends to forecast infrastructure needs and ensure scalability.
- Conduct load testing to validate system capacity and optimize performance.
- Participate in the incident lifecycle: preparation, detection, response, analysis, and post-incident learning. Be ready to respond to a team or business critical incident in a timely manner (be a part of the on-call rotation).
- Work closely with development teams in all phases of SDLC to investigate areas of improvement and seek for bottlenecks.
- Guide and encourage teams to follow SRE best practices.
- Participate in operations efforts and be the point person for infrastructure activities.
- Participate in architectural decisions to help improve the quality of our software and infrastructure.
QUALIFICATIONS
- BS in Computer Science, Engineering, or a related field, or equivalent practical experience.
- 3 years of experience as an SRE or a similar role.
KNOWLEDGE, SKILLS, AND ABILITIES
- Strong problem-solving and analytical skills; Strong ability to troubleshoot complex issues ranging from system resourcing, network issues to application stack traces.
- Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog)
- Strong proficiency in programming or scripting languages (e.g., Python or Bash).
- Hands-on experience with Kubernetes, Docker, and infrastructure-as-code tools (e.g., Terraform).
- Proven expertise in managing AWS Cloud Infrastructure.
- Experience in Linux/Unix administration.
- Ability to read and understand Java and Python code.
- Excellent communication and collaboration abilities. Be able to justify and stand for the proper solution.
- Ability to work effectively in a cross-functional, fast-paced environment.
- Nice to have:
- Knowledge of database operations and performance optimization
- Experience with GitLab
- Experience with Atlassian services
- Experience programming in Java or other OOP languages
PHYSICAL DEMANDS AND WORK ENVIRONMENT
- Sometimes, duties may require working outside normal working hours to align with U.S. time zones
WHAT WE OFFER:
- Dynamic and supporting international teams.
- Regular assessments and performance reviews. You will have the opportunity for promotion, bonuses and a raise in accordance with the pace at which you develop and your performances.
- Hybrid or office work.
- 20-25 vacation days per year.
- 4 days of sick leave (100% paid)
- Equipment for work, laptop and all necessary additions.
- Access to trainings and courses.
- Private health insurance.
- FIT Pass card for many sports’ facilities
Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.
Candidate Privacy Policy
Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:
- What information we collect during our application and recruitment process and why we collect it;
- How we handle that information; and
- How to access and update that information.
Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.
Top Skills
What We Do
Orion is a leading digital transformation and product development services firm. Headquartered in Edison, NJ, we have a global team of 6,200+ associates, with engineers in 14 major delivery centers across North America, Europe, Asia Pacific and Latin America.
For over 25 years, Orion has been solving complex business problems for our clients. Our transformative business solutions are rooted in digital strategy, experience design, and engineering, empowering our clients to operate with agility at scale.
Our mission is to serve as an agile and trusted partner for business transformation initiatives, providing deep emerging technology, experience design, and domain expertise.
Our business has more than tripled over the last three years.
We have grown aggressively both organically and inorganically, adding new clients, complementary skills, domain expertise, and strengthening our global footprint.