Site Reliability Engineer - Latin America

Posted 25 Days Ago
Be an Early Applicant
7 Locations
Remote
Mid level
Fintech • Financial Services
The Role
The Site Reliability Engineer ensures infrastructure reliability and performance by designing monitoring solutions, managing incidents, optimizing systems, and collaborating across teams.
Summary Generated by Built In

The Company


Capital Markets Gateway LLC. (CMG) is a financial technology firm, uniquely focused on the equity capital markets (ECM), connecting investors and underwriters via a collaborative network. CMG delivers integrated ECM data and analytics, unrivaled transparency into deal flow, and workflow efficiencies for an otherwise fragmented and inefficient process. Providing a digital system of record for firm-wide deal activity, CMG helps clients make more timely, better-informed decisions. Launched in 2017 by a team of ECM practitioners, CMG has completed two successful fundraising rounds and is backed by a group of the world’s most prestigious financial institutions. The CMG platform is currently relied upon by nearly 135 buy-side firms representing $40 trillion in AUM and 17 global investment banks. For more information, please visit www.cmgx.io.


The Role


CMG is looking for a Site Reliability Engineer (SRE) with a strong focus on monitoring, observability, and alerting to ensure the reliability, performance, and scalability of our infrastructure and applications. You will be responsible for designing, implementing, and maintaining monitoring solutions to provide visibility into system health and performance, proactively detect anomalies, and reduce incident response time. 


Key Responsibilities

Monitoring & Observability

  • Design, implement, and maintain monitoring and observability solutions using tools like  Prometheus, Grafana Stack (Loki/Grafana/Tempo/Alert Manager), Datadog, and OpenTelemetry. 
  • Define and implement SLOs, SLIs, and error budgets to measure system reliability. 
  • Develop and optimize dashboards, alerts, and reports for system performance and business  metrics.

Alerting & Incident Management

  • Design actionable alerting strategies to minimize noise and ensure meaningful notifications.
  • Integrate alerting systems with Jira.
  • Establish and refine runbooks for on-call teams to handle alerts efficiently. 

Performance Optimization

  • Analyze system performance metrics, identify bottlenecks, and implement optimizations to  improve system efficiency and scalability. Help conduct load testing and capacity planning to  ensure systems can handle peak traffic loads. 

Automation and Tooling

  • Identify opportunities for automation and develop tools to streamline operational processes, such  as deployment, configuration management, and monitoring. Implement monitoring and alerting  systems within automations to detect and resolve issues proactively. 

Collaboration and Communication

  • Collaborate closely with cross-functional teams, including software engineers, operations, and  infrastructure teams, to understand system requirements, provide technical guidance, and drive solutions. Communicate effectively to stakeholders about system changes, incidents, and  improvements. 

Required Qualifications

  • Must be based in Latin America
  • English level - C1 or C2
  • Proven experience as a Site Reliability Engineer or similar role. 
  • Proficiency in logging, metrics, and tracing frameworks (DataDog, Loki, Prometheus,  OpenTelemetry). 
  • Experience with cloud platforms (Azure preferred) and infrastructure-as-code tools (e.g.,  Terraform). 
  • Strong programming and scripting skills (Python, Bash). 
  • Proficiency in containerization technologies and orchestration tools (Docker, Kubernetes).
  • Understanding of Linux-based systems, networking, and security principles related to  containerized applications. 
  • Strong problem-solving and troubleshooting skills, with a passion for identifying and resolving  complex technical issues. 
  • Excellent communication and collaboration abilities. 
  • Ability to thrive in a fast-paced, constantly evolving environment. 
  • Experience with PostgreSQL monitoring and optimization (Optional/Nice to have) 

  • If you're passionate about building resilient financial systems, optimizing observability at scale, and  solving real-world reliability challenges in capital markets, we’d love to have you on our team! 

Our Values

  • We innovate with purpose 
  • We focus on outcomes vs. output 
  • We believe diverse and inclusive teams fuel innovation 
  • We are humble yet candid 
  • We do right by the customer

What we offer

  • 2 year+ contract
  • 15 business days of vacation
  • Tech courses and conferences
  • Top-of-the-line MacBook
  • Flexible working hours

At CMG, we embrace our ongoing commitment to build a culture reflecting the people,  perspectives, and passions it represents. We will accept nothing less than equity, inclusion, and  belonging for all. With the only constant in life being change, we will always listen, learn, and  improve for the betterment of our teams, customers, and communities.  


CMG is proud to be an Equal Opportunity and Affirmative Action Employer.

Top Skills

Azure
Bash
Datadog
Docker
Grafana
Kubernetes
Opentelemetry
Postgres
Prometheus
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chicago, IL
91 Employees
On-site Workplace
Year Founded: 2015

What We Do

Capital Markets Gateway (CMG) is a financial technology firm that is modernizing the equity capital markets (ECM). CMG connects investors and underwriters via a neutral platform that delivers integrated ECM data and analytics, unrivaled transparency, and workflow efficiencies. Providing a digital system of record for firm-wide deal activity, CMG helps clients make more timely, better-informed decisions. Launched in 2017 by a team of ECM practitioners, the CMG platform is currently relied upon by nearly 100 buy side firms representing $12 trillion in AUM and 15 investment banks.

Similar Jobs

Dynatrace Logo Dynatrace

Solution Architect

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote
Hybrid
Santiago, Región Metropolitana de Santiago, CHL
4700 Employees

Superhuman Logo Superhuman

Senior Site Reliability Engineer

Consumer Web • Enterprise Web • Mobile • Productivity • Software
Easy Apply
Remote
13 Locations
116 Employees

Zone & Co Logo Zone & Co

Spreadsheet Add-In Developer

Fintech • Professional Services • Software • Consulting
Remote
12 Locations

Zone & Co Logo Zone & Co

Spreadsheet Add-In Developer

Fintech • Professional Services • Software • Consulting
Remote
12 Locations

Similar Companies Hiring

Bectran, Inc Thumbnail
Software • Machine Learning • Information Technology • Fintech • Automation • Artificial Intelligence
Schaumburg, IL
51 Employees
Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
55 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account