Senior Data Engineer (Python)

Posted 11 Hours Ago
Be an Early Applicant
Paris, Île-de-France
Senior level
Software
The Role
Seeking a Senior Data Engineer proficient in Python to work on semantic processing, data augmentation, and data transformation pipelines. Responsibilities include enhancing team technical capabilities and contributing to task phasing. Opportunities to implement cloud-based ETL pipelines and expand web scraping capabilities. Career path to Tech Lead role. Looking for a candidate with 5+ years of Python development experience and expertise in data processing pipelines, MongoDB, SQL, and Elasticsearch.
Summary Generated by Built In

Datasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest your talents in us, and we’ll return the compliment.

Job Description:

The kind of data engineer we are looking for is a software engineer first, with a strong focus (at least initially) on Python programming and scalability.


Primary responsibilities


Write Python pipelines for semantic processing (NLP) and data augmentation in general:
- Vectorisation (embeddings) to/from MongoDB/Pinecone
- Named Entity Recognition (NER) to/from MongoDB
- Elasticsearch indexing


Write Python transformation pipelines for derived data around companies
- Compute insightful data points from raw company data
- Maintain Python framework for data point computation

Level up the technical capabilities of your team, esp. junior teammates

Contribute to task breakdown and phasing with Tech Lead

Opportunities


Implement cloud-based data acquisition/ETL pipelines (esp. with Airflow, DBT, Snowflake)
Expand web scraping capabilities (Pub/Sub, GCS, CloudRun)

Career path


Move to a Tech Lead role as the tech organisation grows.

Skillset


5+ years developing scalable industry-ready software applications in Python
3+ years implementing data processing pipelines/ETLs
2+ years working with advanced MongoDB and SQL + Elasticsearch ideally
Solid understanding of computing scalability (multiprocessing/threading, distributed computed)
Some hands-on experience with common/modern data frameworks (esp. GCP, DBT, Snowflake)
Outstanding problem solving skills for performance optimisation

Who you might be


It is hard to set expectations with respect to soft skills, as everybody may contribute in their own special way.

Product-driven. You might feel your professional interests revolve around building complex systems thanks to advanced design patterns. We rather feel strongly about achieving product goals. This might involve taking pragmatic shortcuts, and even possibly writing some inelegant code along the way. That is just fine sometimes - as long as we factor in debt.


Socially open. We are neither the nerdy types nor some cold-blooded serial coders. We often build software making creative (and possibly weird) analogies with music, philosophy or football, and come up with solutions outside of the strict confines of our desks.


Taking responsibility for making things happen. This is about self-motivation, being forward thinking and embracing accountability for pushing the Product into the right direction.

Humble. Everybody knows very little. However, we can get really good in certain areas: we believe in the so-called growth mindset, meaning everybody can learn pretty much everything. This makes us nonetheless humble in our attitude with others.


Curious. We spend a lot of time at the office around our fellow co-workers. We like to share our passions (travel, food, books, sports, etc. etc.) and interact beyond tech.

As a global organization, Datasite knows that diverse perspectives are essential to our success. We’re committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.

Top Skills

Python
The Company
HQ: Minneapolis, CA
821 Employees
On-site Workplace
Year Founded: 1968

What We Do

Datasite the maker of Datasite Diligence virtual data room platform, helping dealmakers around the world close more deals faster.

Similar Jobs

Arrow Electronics, Inc. Logo Arrow Electronics, Inc.

Broadcom Market Analyst

Cloud • Enterprise Web • Hardware • Information Technology • Internet of Things • Robotics • Semiconductor
Courbevoie, Hauts-de-Seine, Île-de-France, FRA
22000 Employees
Hybrid
Paris, Île-de-France, FRA
289097 Employees
Hybrid
Paris, Île-de-France, FRA
289097 Employees
Hybrid
Paris, Île-de-France, FRA
289097 Employees

Similar Companies Hiring

TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account