Senior Data Engineer (AI and ML frameworks)

Posted 2 Days Ago
Be an Early Applicant
Hiring Remotely in Warsaw, Warszawa, Masovian
Remote
Senior level
Software
The Role
The Senior Data Engineer will develop applications based on Kappa architecture, unify data from EHRs, and implement machine learning models while collaborating with cross-functional teams.
Summary Generated by Built In

Company Description

We are looking for a talented Senior Data Engineer with a strong background in developing or contributing to applications based on microservices using a Kappa architecture. The project aims to unify data sourced from different EHR systems in the healthcare domain, using the FHIR data format. 

CUSTOMER
Our client is a leading analytics company operating at the intersection of technology, artificial intelligence, and big data. They support manufacturers and retailers in the fast-moving consumer goods sector, helping them better understand market dynamics, uncover consumer behavior insights, and make data-driven business decisions.  

PROJECT
The project aims to unify data sourced from various EHR systems in the healthcare domain using the FHIR data format. The company’s proprietary technology platform combines high-quality data, deep industry expertise, and advanced predictive algorithms built over decades of experience in the field. 

Job Description

Data Standardization and Transformation: 

  • Convert diverse data structures from various EHR systems into a unified format based on FHIR standards 

  • Map and normalize incoming data to the FHIR data model, ensuring consistency and completeness 

Kafka Integration: 

  • Consume and process events from the Kafka stream produced by the Data Writer Module 

  • Deserialize and validate incoming data to ensure adherence to required standards 

Data Segmentation: 

  • Separate data streams for warehousing and AI model training, applying specific preprocessing steps for each purpose 

  • Prepare and validate data for storage and machine learning model training 

Error Handling and Logging: 

  • Implement robust error handling mechanisms to track and resolve data mapping issues 

  • Maintain detailed logs for auditing and troubleshooting purposes 

Data Ingestion and Processing: 

  • Use LLMs to extract structured data from EHRs, research articles, and clinical notes 

  • Ensure semantic consistency and interoperability during data ingestion 

Knowledge Graph Construction: 

  • Integrate extracted data into a knowledge graph, representing entities and relationships for semantic data integration 

  • Implement contextual understanding and querying of complex relationships within the knowledge graph (KG) 

Advanced Predictive Modeling: 

  • Leverage KGs and LLMs to enhance data interoperability and predictive analytics 

  • Develop frameworks for contextualized insights and personalized medicine recommendations 

Feedback Loop: 

  • Continuously update the knowledge graph with new data using LLMs, ensuring up-to-date and relevant insights. 

Work Closely with Cross-Functional Teams 

  • Collaborate with data scientists, AI specialists, and software engineers to design and implement data processing solutions 

  • Communicate effectively with stakeholders to align on goals and deliverables 

Contribute to Engineering Culture: 

  • Foster a culture of innovation, collaboration, and continuous improvement within the engineering team

Qualifications

  • Deep understanding of patterns and software development practices for event-driven architectures
  • Hands-on experience with stateful stream data processing solutions (Kafka or similar streaming platforms)
  • Strong knowledge of data serialization/deserialization using various data formats (at minimum JSON and Avro), and integration with schema registries
  • Proven Python software development expertise, with experience in data processing and integration (most of the software is written in Python)
  • Practical experience building end-to-end solutions with Apache Flink or a similar platform
  • Experience with containerization and orchestration using Kubernetes (K8s) and Helm, especially on Google Kubernetes Engine (GKE)
  • Familiarity with Google Cloud Platform (GCP) or a similar cloud platform
  • Hands-on experience implementing data quality solutions for schema-on-read or schema-less data
  • Hands-on experience integrating with Apache Kafka, particularly the Confluent Platform
  • Familiarity with AI and ML frameworks
  • Proficiency in SQL and experience with both relational and NoSQL databases
  • Experience with graph databases like Neo4j or RDF-based systems
  • Experience in the healthcare domain and familiarity with healthcare standards such as FHIR and HL7 for data interoperability

WOULD BE A PLUS:

  • Experience with web data scraping

Additional Information

PERSONAL PROFILE

  • Strong problem-solving skills, with the ability to design innovative solutions for complex data integration and processing challenges 

  • Excellent communication skills, with the ability to articulate complex technical concepts and work effectively with various stakeholders 

  • Commitment to improving healthcare through data-driven solutions and technology 

  • Stay abreast of the latest technologies and industry trends while continually improving your skills and knowledge 

  • Ability to work in a collaborative environment, valuing diverse perspectives and contributing to a positive team culture 

Top Skills

Apache Flink
Fhir
Google Cloud Platform
Hl7
Kafka
Kubernetes
Neo4J
NoSQL
Python
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
New York, New York
1,516 Employees
On-site Workplace

What We Do

Sigma Software Group, an award-winning and trusted IT partner, has been serving customers for over 21 years, providing comprehensive IT solutions to various businesses, ranging from startups to established software product houses. As one of Europe's substantial IT consultancies, it brings together a dedicated workforce of over 2,100 professionals in 40 offices across 19 countries. With a diverse client base, including more than 300 enterprises, including Fortune 500 stalwarts, Sigma Software Group is a preferred choice for developing solutions that help businesses create cutting-edge products while meeting their unique needs.

Sigma Software Group operates as a dynamic ecosystem of tech companies, offering 25 ready-to-implement innovative products and 40+ value-added services. Furthermore, Sigma Software Group is committed to fostering innovation through initiatives such as the Sigma Software Labs business incubator, Sigma Software University, the SID Venture Partners VC Fund, UA Tech Network, Techosystem, the European Business Association, and other collaborative efforts.

Since 2015, Sigma Software Group has consistently earned recognition on the IAOP's prestigious World's Top 100 Outsourcing list. The company's accomplishments have also been acknowledged by prominent global media outlets such as Forbes, CNBC, The Times, and Reuters

Similar Jobs

Accuris Logo Accuris

Principal Software Engineer

Information Technology • Machine Learning • Software • Conversational AI • Generative AI • Manufacturing
Remote
Poland
1200 Employees

Altium Logo Altium

Team Leader, A365 Engineering Team

Cloud • Enterprise Web • Other • Productivity • Software • Analytics • Design
Remote
Poland
900 Employees

BigCommerce Logo BigCommerce

Software Engineer II

Cloud • Consumer Web • eCommerce • Information Technology • Software
Remote
4 Locations
1500 Employees

BigCommerce Logo BigCommerce

Software Engineer II (Scala)

Cloud • Consumer Web • eCommerce • Information Technology • Software
Remote
3 Locations
1500 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account