Lead Big Data Software Engineer

Posted 6 Days Ago
Be an Early Applicant
Belfast, County Antrim, Northern Ireland
Hybrid
5-7 Years Experience
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
At Rapid7, we are on a mission to create a secure digital world for our customers, our industry, and our communities.
The Role
As a Lead Big Data Software Engineer, you'll optimize data pipelines and queries, implement monitoring strategies, and lead teams in improving performance and efficiency of data retrieval processes for a large-scale data platform.
Summary Generated by Built In

About the Team
The Rapid7 Data Platform is a unified, integrated platform powered by Rapid7's product suite providing our customers enhanced visibility into their attack surface, operational efficiency, risk management, and decision-making capabilities.
Our teams are responsible for consolidating data from all Rapid7 products, transforming it for optimised retrieval, and ensuring high-performance and seamless access to our customers. This role is crucial to the platform's success as it focuses on building a highly scalable and reliable data mesh that powers cross-product use cases through a distributed query engine for big data analytics.
About the Role
We are seeking an innovative, self-motivated Data and Performance Engineer who will act as a technical leader to collaborate with our product teams to optimise their data pipelines and retrieval processes for performance and efficiency. You will work with the Data Platform teams to implement monitoring and testing strategies to ensure the performance of the data and their queries as well as identify optimisations.
Technologies you will work with:

  • Trino
  • Iceberg
  • Parquet
  • Spark
  • Airflow
  • Kafka
  • AWS services such as Glue, S3, EKS


In this role, you will:

  • Analyse and optimise distributed SQL queries to improve performance
  • Suggest optimisations to our data pipelines
  • Provide recommendations for efficient partitioning strategies and schema designs
  • Conduct performance tuning for the data pipelines and queries
  • Develop performance monitoring strategies and tools


The skills you'll bring include:

  • 5+ years of hands-on software engineering experience, with a specific focus on database query optimization
  • Strong database system expertise in query execution planning, query optimization, performance tuning, parallel computing, and schema design
  • Experience in continuously monitoring and optimising data pipelines for performance and cost-effectiveness
  • Ability to design, develop, implement, and operate highly reliable large-scale data lake systems in cooperation with product teams
  • Skills to analyse and performance test the data mesh performance and scalability, identify bottlenecks, recommend and develop improvements
  • Mentorship and guidance of junior engineers, providing technical leadership and fostering a culture of continuous improvement and innovation
  • Excellent verbal and written communication skills.
  • Strong, creative problem solving ability.


Nice to haves:

  • Trino/Presto data-mesh
  • AWS, Terraform, Kubernetes
  • Java
  • Kafka


We believe the best ideas and solutions come from diverse teams. If you're excited about this role and feel your experience can make an impact, don't hesitate - apply today!
About Rapid7
Rapid7 is creating a more secure digital future for all by helping organizations strengthen their security programs in the face of accelerating digital transformation. Our portfolio of best-in-class solutions empowers security professionals to manage risk and eliminate threats across the entire threat landscape from apps to the cloud to traditional infrastructure to the dark web. We foster open source communities and cutting-edge research-using these insights to optimize our products and arm the global security community with the latest in attacker methods. Trusted by more than 10,000 customers worldwide, our industry-leading solutions and services help businesses stay ahead of attackers, ahead of the competition, and future-ready for what's next.
#LI-CG1

Top Skills

Java

What the Team is Saying

Priya
Sammi
Tara
John
Grace
The Company
HQ: Boston, MA
2,400 Employees
Hybrid Workplace
Year Founded: 2000

What We Do

We do this by embracing tenacity, passion, and collaboration to challenge what’s possible and drive extraordinary impact.

Here, we’re building a dynamic workplace where everyone can have the career experience of a lifetime. We challenge ourselves to grow to our full potential. We learn from our missteps and celebrate our victories. We come to work every day to push boundaries in cybersecurity and keep our 11,000+ global customers ahead of whatever’s next.

Why Work With Us

What makes us unique is how we embrace, model, and celebrate our core values. By challenging convention, being an advocate, creating impact together, always bringing our full selves, and recognizing that our work is never done, we are able to make an extraordinary impact on our business, our industry, and our own career growth.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Rapid7 Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Our default working model is hybrid, with employees working three days per week in the office. This approach underpins our commitment to flexibility and adaptability while supporting our dedication to development, teamwork and customer purpose.

Typical time on-site: 3 days a week
Company Office Image
HQBoston
Company Office Image
Arlington
Company Office Image
Austin, TX
Company Office Image
Belfast, GB
Company Office Image
Prague
Company Office Image
Reding, UK
Company Office Image
Town 'n' Country, FL
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account