Software Developer – Reliability

Posted 2 Days Ago
Be an Early Applicant
2 Locations
Mid level
Financial Services
The Role
The Reliability Software Engineer ensures software systems' performance, stability, and availability through development of reliability features, observability capabilities, and automation tooling. Responsibilities include incident response, performance optimization, and collaboration with cross-functional teams to enhance system operations and uptime.
Summary Generated by Built In

Role: Software Developer – Reliability

Team: Market Access & Risk

As a Reliability Software Engineer, you will play a critical role in ensuring the performance, stability and availability of our software systems, as well as their day-to-day operations. As such, the team requires a high software development capacity, along with strong analytical skills.

You will primarily be developing reliability features directly in our applications, implementing observability capabilities, running benchmarks to measure performance, and building automation and tooling to support the operations of our systems. Operations are important to ensure business continuity, they include responding to level-2 support escalations, monitoring our infrastructure capacity, and tweak system configuration to address user requests.

Position Overview:

  • System Reliability: Develop incremental stability, recovery, scalability and performance improvements. Perform root cause analyses to understand the source of incidents. Suggest and implement remedial actions in response to incidents
  • Observability: Monitor, measure, and analyze the performance, availability and stability of technology systems to identify areas of improvement and allow the team to take data-driven decisions

  • Performance Optimization: Optimize performance of production systems to address bottlenecks and improve system response times, resource utilization, and overall application performance
  • Automation and Tooling: Develop and maintain automation systems and tooling for operations, deployment, and incident management to reduce manual intervention and enhance system stability

  • Production Management: Provide level-2 support for incident response to ensure business uptime. Work closely with core developers and support teams to plan and prepare for scaling technology systems to accommodate user demands

Required Qualifications:

  • Education: Bachelor’s degree in Computer Science or related subject
  • Experience: 4+ years proven experience in Software Engineering, Software Reliability, or similar role with hand-on experience in software development and providing L2 support
  • Experience of developing in Python or similar, and familiarity with version control systems such as git
  • Experience working in a Linux environment
  • Problem-Solving Skills: Strong analytical and problem-solving skills with a keen eye for detail and a proactive approach to resolving issues
  • Communication: Excellent communication and collaboration skills to work effectively with cross-functional teams
  • Adaptability: Ability to work in a fast-paced and dynamic environment, adapting to changing priorities and requirements
  • Automation and Tooling: Experience developing automation tools and implementing configuration management

Nice to have:

  • C++ or KDB/q development experience
  • Experience with Slurm, Airflow or middleware such as Kafka and AMPS


 

Top Skills

C++
Kdb/Q
Python
The Company
HQ: New York, New York
1,267 Employees
On-site Workplace
Year Founded: 2014

What We Do

Squarepoint Capital is a leading global investment management firm that develops quantitative investment strategies to achieve high quality returns for our clients. We are a data and technology driven firm who specialize in developing automated trading systems that execute across global financial markets.

Similar Jobs

Hybrid
London, Greater London, England, GBR
289097 Employees
Hybrid
London, Greater London, England, GBR
289097 Employees
Hybrid
London, Greater London, England, GBR
289097 Employees

Wayve Logo Wayve

Software Engineer, Site Reliability Engineering

Artificial Intelligence • Transportation
London, Greater London, England, GBR
200 Employees

Similar Companies Hiring

EDGE Thumbnail
Software • Fintech • Financial Services • Analytics
Chicago, IL
20 Employees
Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
55 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account