Site Reliability Engineer-Application

Posted Yesterday
Be an Early Applicant
Mississauga, ON
Hybrid
110K-118K Annually
Senior level
Healthtech • Software
The Role
As a Site Reliability Engineer at PointClickCare, you'll enhance application performance and operational workflows. You'll contribute to monitoring, automation, and CI/CD processes, ensuring high resilience and availability of systems while collaborating with software development and security teams. You'll also maintain continuous improvement in service levels and performance metrics, and participate in on-call rotations.
Summary Generated by Built In

PointClickCare is a leading North American healthcare technology platform enabling meaningful care collaboration and real‐time patient insights. For over 20 years, the company has been focused on realizing its vision: to help create a world in which providers and plans can confidently deliver frictionless care. Since its inception, PointClickCare has grown exponentially, with over 2,200 employees working to impact millions across North America. Recognized by Forbes as one of the top 100 private cloud companies and acknowledged by Waterstone Human Capital as Canada’s Most Admired Corporate Cultures, PointClickCare leads the way in creating cloud-based healthcare software.

 

At PointClickCare, we offer a wealth of opportunities and a vibrant culture that empowers our employees. Our dynamic environment is the perfect place to advance your career while engaging in meaningful work alongside incredible colleagues. Here, you’ll discover a space where your talents can thrive, your career can grow, and your work will have a lasting impact on healthcare across North America. We believe that work becomes profoundly fulfilling when driven by a higher purpose.

 

Join us and be part of a team that is making a real impact.

 

To learn more about us, check out Life at PointClickCare and connect with us on Glassdoor and LinkedIn.


Site Reliability Engineer (SRE) Responsibilities:


· Lead and implement SRE best practices to foster a strong SRE culture.

· Coach and mentor intermediary staff to develop their skills and grow into SRE.

· Lead incident response calls and troubleshoot system and application-level issues.

· Lead RCAs to capture lessons learned and implement innovative solutions to prevent future incidents.

· Contribute technically to a team focused on applying Software Engineering Practices to operations at scale.

· Implement, monitor, and report on Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for application services, collaborating with business and product owners to establish key performance indicators (KPIs).

· Participate in technical training events, game day scenarios, and engineering spikes.

· Provide support for a wide range of applications with a focus on increasing automation, repeatability, and consistency.

· Create and maintain monitoring technologies and processes to enhance visibility into application performance and business metrics, ensuring manageable operational workload.

· Proactively improve application and infrastructure resiliency under various error and performance conditions.

· Collaborate with security Engineers to develop plans and automation for proactive response to new risks and vulnerabilities.

· Work with internal teams to ensure operational development solutions meet business requirements.

· Provide architectural guidance to software development teams to enhance resiliency, efficiency, performance, and cost-effectiveness.

· Develop, communicate, and monitor standard processes to promote the long-term sustainability and health of operational tasks.

· Collaborate with feature/service-oriented development teams.

· Implement and improve CI/CD pipelines to facilitate seamless and reliable software releases.

· Participate in an on-call rotation to respond to incidents and ensure 24/7 system availability.

 

Qualifications:


· Bachelor's Degree in Computer Science, Computer Engineering, Software Engineering, MIS, or related discipline.

· Prior experience as a Site Reliability Engineer (SRE) in a previous role. (Minimum 2 years’ experience.)

· Prior relevant software Development/Architecture/Engineering/DevOPS experience (Minimum 5 years’ experience).

· Strong experience in building and supporting cloud-based solutions, Azure cloud infrastructure and services knowledge preferred.

· Proficiency in cloud computing concepts.

· Experience with virtualization and container solutions such as Docker and Kubernetes.

· Familiarity with Databricks, Event Hub, Redis, Azure Service Bus, Azure Functions, and Tomcat.

· Understanding of diverse infrastructure platforms and concepts.

· Experience with Windows based systems and Linux administration.

· Experience with configuration management and deployment automation tools (e.g., Chef, Terraform, Puppet, Ansible, Jenkins, Spinnaker, ArgoCD, GitHub Actions).

· Proficiency in one or more programming languages such as Java, Python, Go, Perl, or Ruby.

· Working knowledge of database technologies (e.g., SQL Server, MySQL, PostgreSQL).

· Experience with monitoring and logging solutions (e.g., Prometheus, Grafana, ELK stack, AppDynamics, DataDog).

· Strong debugging and optimization skills, with the ability to automate routine tasks.

· Systematic problem-solving approach with strong communication skills and a proactive mindset.

· Knowledge of the Software Development Life Cycle, with experience working in QA and beta environments.

· Understanding of the Agile software development methodology.

· Knowledge of standard production practices, including change management and incident management (ITIL).


Nice to Have:


· Troubleshooting experience with diverse hosting technologies, such as web server platforms, Java application platforms, operating systems, network components, virtualization technologies, and database platforms.

· Proficiency in Linux, including experience compiling kernels, tracing syscalls, understanding TCP.

· Knowledge of open-source software and contributions to the open-source community.

· Familiarity with Rhapsody and various healthcare messaging standards, such as HL7 and FHIR.





#LI-hybrid

#LI-AJ1

PointClickCare Benefits & Perks:

Benefits starting from Day 1!

Retirement Plan Matching

Flexible Paid Time Off

Wellness Support Programs and Resources

Parental & Caregiver Leaves

Fertility & Adoption Support

Continuous Development Support Program

Employee Assistance Program

Allyship and Inclusion Communities

Employee Recognition … and more!


It is the policy of PointClickCare to ensure equal employment opportunity without discrimination or harassment on the basis of race, religion, national origin, status, age, sex, sexual orientation, gender identity or expression, marital or domestic/civil partnership status, disability, veteran status, genetic information, or any other basis protected by law. PointClickCare welcomes and encourages applications from people with disabilities. Accommodations are available upon request for candidates taking part in all aspects of the selection process. Please contact [email protected] should you require any accommodations.


When you apply for a position, your information is processed and stored with Lever, in accordance with Lever’s Privacy Policy. We use this information to evaluate your candidacy for the posted position. We also store this information, and may use it in relation to future positions to which you apply, or which we believe may be relevant to you given your background. When we have no ongoing legitimate business need to process your information, we will either delete or anonymize it. If you have any questions about how PointClickCare uses or processes your information, or if you would like to ask to access, correct, or delete your information, please contact PointClickCare’s human resources team: [email protected] 


PointClickCare is committed to Information Security. By applying to this position, if hired, you commit to following our information security policies and procedures and making every effort to secure confidential and/or sensitive information.

Top Skills

Go
Java
Perl
Python
Ruby
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mississauga, Ontario
1,557 Employees
On-site Workplace
Year Founded: 2000

What We Do

PointClickCare is the market leader driving the transformation of healthcare vulnerable and complex populations through a broad, connected care network powered by deep insights with a commitment to value, outcomes and innovation. We connect post-acute and acute care settings, people and systems like no other company. Our steadfast commitment to our culture and to providing growth opportunities to our employees is evidenced by recent recognition of PointClickCare as one of Canada’s best-managed companies and most admired corporate cultures.

Similar Jobs

Behavox Logo Behavox

Site Reliability Engineer 3 (Next Gen)

Artificial Intelligence • Software
Toronto, ON, CAN
213 Employees
Ottawa, ON, CAN
2194 Employees

Behavox Logo Behavox

Site Reliability Engineer 3 (TS&CG)

Artificial Intelligence • Software
Toronto, ON, CAN
213 Employees

Moneris Logo Moneris

Senior Site Reliability Engineer

Fintech • Payments • Financial Services
10 Locations
2184 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account