Data Engineer

Posted 22 Days Ago
Be an Early Applicant
5 Locations
Mid level
Artificial Intelligence • Software
The Role
The Data Engineer will develop and optimize data pipelines to support machine learning research. Responsibilities include designing scalable data architectures, collaborating with ML researchers, transforming data for experiments, and managing data integration processes using Python and SQL, along with NoSQL databases.
Summary Generated by Built In

Description

About Us:

Aquaticode builds artificial intelligence solutions for aquaculture. Our core competency

lies at the intersection of biology and artificial intelligence, utilizing specialized imaging

technology to detect, identify, and predict traits of aquatic species. We value

commitment and creativity in building real-world solutions that benefit humanity.

Requirements

We are seeking a talented Data Engineer with experience in supporting Machine Learning (ML)

research to join our team. The ideal candidate will have a strong background in building robust

data pipelines and workflows that facilitate ML projects and eagerness to learn new

technologies. This role requires proficiency in data processing technologies and an

understanding of the data needs specific to ML research.

Key Responsibilities:

· Develop, maintain, and optimize data pipelines and workflows to support ML research and model development.

· Design and implement scalable data architectures for handling large datasets used in ML models.

· Collaborate closely with ML researchers and data scientists to understand data requirements and ensure data availability and quality.

· Work with databases and data integration processes to prepare and transform data for ML experiments.

· Utilize MongoDB and other NoSQL databases to manage unstructured and semi-structured data.

· Write efficient, reliable, and maintainable code in Python and SQL for data processing tasks.

· Implement data validation and monitoring systems to ensure data integrity and performance.

· Support the deployment of ML models by integrating data solutions into production environments.

· Ensure the scalability and performance of data systems through rigorous testing and optimization.

Required Skills & Qualifications:

· Proficiency in English (spoken and written).

· Strong experience in Python and SQL.

· Hands-on experience with data processing in Apache Airflow.

· Experience working with databases, including MongoDB (NoSQL) and relational databases.

· Understanding of data modeling, ETL processes, and data warehousing concepts.

· Experience with cloud platforms like AWS, GCP, or Azure.

Good to Have:

· Experience with other NoSQL databases like InfluxDB, Elasticsearch, or similar technologies.

· Experience with backend frameworks like FastAPI, Flask, or Django.

· Knowledge of containerization tools like Docker.

· Familiarity with messaging queues like RabbitMQ.

· Understanding of DevOps practices and experience with CI/CD pipelines.

· Experience with front-end development (e.g., React, NextJs).

About Nacre Capital:

We were founded by Nacre Capital, a venture builder focused on AI within the life

sciences. Nacre has an impressive track record in creating, building, and growing deep

tech startups, including Face.com (acquired by Facebook), Fairtility, FDNA, and Seed-X.

Top Skills

Python
SQL
The Company
31 Employees
Remote Workplace

What We Do

Nacre Capital is a global venture builder specialized in creating, building and growing disruptive start-ups with deep technologies that significantly impact lives.
We are an international team of entrepreneurs, business leaders and experts including – pioneering scientists, renowned technologists, researchers, growth experts and thought leaders – that together develop and transform ventures into world-class disruptive market-leading companies.
We bring onboard the best entrepreneurs and talent to imagine and create new ventures from scratch and work together to successfully transform novel ideas into fast-growing, market-leading companies.
Our inhouse portfolio of companies solve complex fundamental problems that impact society through real breakthrough innovation employing deep technology.

Similar Jobs

Smartcat Logo Smartcat

Data Engineer

Artificial Intelligence • Machine Learning • Natural Language Processing • Conversational AI
Easy Apply
Remote
28 Locations
242 Employees

Procter & Gamble Logo Procter & Gamble

Data Engineer (Databricks/Python/SQL)

AdTech • Beauty • Marketing Tech • Retail • Pharmaceutical
Warsaw, Warszawa, Mazowieckie, POL
117512 Employees

Dynatrace Logo Dynatrace

Senior DevOps Engineer for Data Engineering team

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Hybrid
Gdańsk, Pomorskie, POL
4700 Employees

OpenX Technologies Logo OpenX Technologies

Big Data Engineer IV

AdTech • Enterprise Web • Information Technology • Machine Learning • Marketing Tech • Sales
Easy Apply
Remote
Hybrid
Kraków, Małopolskie, POL
300 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account