Data Engineer

Posted 3 Days Ago
Be an Early Applicant
ES
Junior
Artificial Intelligence • Information Technology • Internet of Things
The Role
The Data Engineer will design, implement, and optimize data pipeline flows within the ThetaRay system. Responsibilities include maintaining data flows based on designs from data scientists, building machine learning pipelines, creating data tools for analytics, and training customer teams. The role involves collaborating with product and R&D teams and maintaining relationships with customers.
Summary Generated by Built In

Description

About ThetaRay:

ThetaRay is a trailblazer in AI-powered Anti-Money Laundering (AML) solutions, offering cutting-edge technology to fintechs, banks, and regulatory bodies worldwide. Our mission is to enhance trust in financial transactions, ensuring compliant and innovative business growth. 

Our technology empowers customers to expand into new markets and introduce groundbreaking products.

Thetaray is a culture-driven company. Our values are at the heart of our success. By joining us, you'll have the opportunity to embody these values and inspire others through your actions.

Why Join ThetaRay?

At ThetaRay, you'll be part of a dynamic global team committed to redefining the financial services sector through technological innovation. You will contribute to creating safer financial environments and have the opportunity to work with some of the brightest minds in AI, ML, and financial technology. We offer a collaborative, inclusive, and forward-thinking work environment where your ideas and contributions are valued and encouraged.

Join us in our mission to revolutionize the financial world, making it safer and more trustworthy for millions worldwide. Explore exciting career opportunities at ThetaRay – where innovation meets purpose.

We are looking for a Data Engineer to join our growing team of data experts. As a Data Engineer, you will be responsible for designing, implementing, and optimizing data pipeline flows within the ThetaRay system. You will support our data scientists with the implementation of the relevant data flows based on the data scientist’s features design and construct complex rules to detect money laundering activity.

The ideal candidate has experience in building data pipelines and data transformations and enjoys optimizing data flows and building them from the ground up. They must be self-directed and comfortable supporting multiple production implementations for various use cases.


Responsibilities

  • Implement and maintain data pipeline flows in production within the ThetaRay system based on the data scientist’s design
  • Design and implement solution-based data flows for specific use cases, enabling the applicability of implementations within the ThetaRay product
  • Building a Machine Learning data pipeline
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
  • Work with product, R&D, data, and analytics experts to strive for greater functionality in our systems
  • Train customer data scientists and engineers to maintain and amend data pipelines within the product
  • Travel to customer locations both domestically and abroad
  • Build and manage technical relationships with customers and partners



Requirements

  • 2+ years of Hands-on experience working with Apache Spark with Pyspark or Scala - must
  • Hands-on experience with SQL
  • Hands-on experience with version-control tools such as GIT
  • Hands-on experience with Apache Hadoop Ecosystem including Hive, Impala, Hue, HDFS, Sqoop etc..
  • Hands-on experience with data transformation, validations, cleansing, and ML feature engineering
  • BSc degree or higher in Computer Science, Statistics, Informatics, Information Systems, Engineering, or another quantitative field
  • Experience working with and optimizing big data pipelines, architectures, and data sets - an advantage
  • Strong analytic skills related to working with structured and semi-structured datasets
  • Business-oriented and able to work with external customers and cross-functional teams
  • Fluent in English & Spanish both written and spoken

Nice to have

  • Experience with Linux
  • Experience in building Machine Learning pipeline
  • Experience with Zeppelin/Jupyter
  • Experience with workflow automation platforms such as Jenkins or Apache Airflow
  • Experience with Microservices architecture components, including Docker and Kubernetes.


Top Skills

Spark
Pyspark
Scala
SQL
The Company
New York, NY
119 Employees
On-site Workplace
Year Founded: 2003

What We Do


At ThetaRay, we believe that banks no longer have to de-risk when they can expand their cross-border ecosystem safely.

ThetaRay is the developer of SONAR, a groundbreaking, AI-powered, transaction-monitoring SaaS solution for cross-border payments that allows banks to expand their business opportunities by achieving safe and reliable cross-border payment monitorisation.

ThetaRay's technology is the only packaged SaaS offering that analyzes SWIFT traffic, risk indicators, and client/payer/payee data to detect anomalies indicating money laundering activity across complex, cross-border transaction paths. It is also one of the only AI-driven AML solutions that can be easily integrated and deployed within days, with minimal implementation required. ThetaRay's solution increases detection capabilities for both supervised and unsupervised data and includes profiling and advanced analytics assessments, all in one platform. Financial organizations that rely on highly heterogeneous and complex ecosystems benefit greatly from ThetaRay's unmatchable low false-positive rates.

Similar Jobs

NN Group Logo NN Group

Junior Data Engineer

Fintech • Payments • Financial Services
Madrid, Comunidad de Madrid, ESP
21409 Employees

Kyndryl Logo Kyndryl

Data Engineer

Cloud • Information Technology • Consulting
Madrid, Comunidad de Madrid, ESP
46070 Employees

Fever Logo Fever

Junior Data Engineer

Marketing Tech • News + Entertainment • Software
Madrid, Comunidad de Madrid, ESP
1433 Employees

Dow Jones Logo Dow Jones

Data Engineer

Digital Media • Fintech • Information Technology
Barcelona, Cataluña, ESP
4898 Employees

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account