Platform Infrastructure Cloud, SRE Engineer - Assistant Vice President

Reposted One Month Ago
Be an Early Applicant
Lisboa
Senior level
Financial Services
The Role
As a DevOps Engineer at iCapital, you'll ensure smooth operation of production and development environments, leverage cloud capabilities, and implement advanced infrastructure strategies. Collaborate with teams to enhance system availability and developer productivity through automation and container management.
Summary Generated by Built In


iCapital is powering the world’s alternative investment marketplace. Our financial technology platform has transformed how advisors, wealth management firms, asset managers, and banks evaluate and recommend bespoke public and private market strategies for their high-net-worth clients. iCapital services approximately $203billion in global client assets invested in 1,655 funds, as of September 2024.

iCapital has been named to the Forbes Fintech 50 for six consecutive years (2018 – 2023); a three-time selection by Forbes to its list of Best Startup Employers (2021-2023); and a three-time winner of MMI/Barron’s Solutions Provider award (See link below). 


About the Role

The Site Reliability Engineering team at iCapital is fundamental to ensuring our platform delivers consistent, reliable service to our client base. As an Assistant Vice President, you'll work at the intersection of software engineering and operations, applying engineering principles to infrastructure challenges. You'll be responsible for designing and implementing systems that scale efficiently, architecting observability solutions that provide actionable insights, and building automation that enhances our platform's reliability. This role requires someone who thinks systematically about reliability, can translate business requirements into technical implementations, and thrives on making complex systems more robust.

Responsibilities

  • Design, implement, and maintain service level objectives (SLOs) that align with business goals and customer expectations
  • Develop observability strategies, focusing on meaningful metrics that drive actionable insights
  • Drive automation initiatives to eliminate toil and improve system reliability
  • Champion reliability best practices across development teams through consultation and tooling
  • Lead incident response, conduct thorough postmortems, and drive systematic improvements
  • Participate in on-call rotations with a focus on continuous service improvement 

Qualifications

  • 5+ years of SRE experience or related experience with 3+ years in AWS
  • Strong experience with container orchestration platforms like Kubernetes and related ecosystem tools
  • Working knowledge of databases such as MongoDB, Postgres, DynamoDB 
  • Strong foundation in reliability engineering principles and distributed systems behavior
  • Experience defining and implementing SLOs/SLIs and using them to drive system improvements
  • Demonstrated ability to design and implement observability solutions that provide actionable insights while minimizing alert fatigue
  • Coding abilities in at least one IaC language (Terraform preferred, Ansible, Chef, etc.) and one programming language such as Python, Ruby or Java with a focus on maintainable, tested code
  • Understanding of modern observability practices and experience implementing and maintaining monitoring solutions (Prometheus/Grafana, Splunk, NewRelic, CloudWatch, and ELK in the cloud)
  • Strong incident response skills with experience leading incident retrospectives and driving improvements
  • Excellent problem-solving abilities and experience debugging distributed systems
  • Track record of successfully automating operations and reducing toil
  • Strong communication skills with ability to explain complex technical concepts to diverse audiences
  • A desire to share, teach, and learn as part of a team 

Employees in this role will work fully remote. Every department has different needs, and some positions will be designated in-office jobs, based on their function

Benefits

iCapital offers a comprehensive benefits package that includes a total compensation program consisting of competitive salary, annual performance bonus, and equity for all full-time employees; healthcare with 100% employer-paid health and dental insurance; and generous paid time off (PTO).

For additional information on iCapital Network, please visit https://www.icapitalnetwork.com/about-us  Twitter: @icapitalnetwork | LinkedIn: https://www.linkedin.com/company/icapital-network-in

Top Skills

AWS
Cloudwatch
Datadog
DevOps
Docker
DynamoDB
Elk
Grafana
Java
Kubernetes
Linux
MongoDB
Newrelic
Postgres
Prometheus
Python
Ruby
Splunk
Terraform
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
465 Employees
On-site Workplace
Year Founded: 2011

What We Do

ICONIQ Capital is a privately held investment firm serving some of the world’s most influential families and organizations. ICONIQ provides financial advisory and family office services, and manages direct investments where technology and traditional asset classes intersect, with a focus on technology growth equity, buyout, and real estate.

Similar Jobs

Easy Apply
Hybrid
Lisboa, PRT
1100 Employees
Easy Apply
Hybrid
Lisboa, PRT
1100 Employees
Easy Apply
Hybrid
Lisboa, PRT
1100 Employees
Hybrid
Lisboa, PRT
289097 Employees

Similar Companies Hiring

EDGE Thumbnail
Software • Fintech • Financial Services • Analytics
Chicago, IL
20 Employees
Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
55 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account