Data Integration Engineer

Posted 11 Hours Ago
Be an Early Applicant
Hiring Remotely in United States
Remote
Mid level
Information Technology • Software
The Role
The Data Integration Engineer will design and develop data pipelines for AI solutions, manage AI projects, mentor AI engineers, and implement strategies aligned with organizational goals. Responsibilities include optimizing data storage and retrieval, training models, and ensuring data quality while collaborating with data teams.
Summary Generated by Built In

Position Type: Remote/Hybrid, will need to report onsite in Richmond, VA quarterly
VA or EST candidates only
Contract Length: 10 months
Position Overview:

The Lead Agentic Data Engineer will design, develop, and deploy data pipelines that power agentic AI solutions for real-world challenges. This role requires expertise in data processing for agentic systems, ensuring data quality, optimizing storage and retrieval, and facilitating seamless interaction between AI agents and data sources.
Required Skills:

  • Understanding the Big data Technologies
  • Experience developing ETL and ELT pipelines
  • Experience with Spark, GraphDB, Azure Databricks
  • Expertise in Data Partitioning
  • 3 years of experience with Data conflation
  • 3 years of experience developing Python Scripts
  • 2 years of experience training LLMs with structured and unstructured data sets
  • 3 years of experience with GIS spatial data

Duties:

  • Guiding and mentoring AI engineers, helping them develop their skills and knowledge in the field. 
  • Leading and managing AI projects, ensuring they stay on track, meet deadlines, and the findings are actionable and relevant. 
  • Contributing to the creation and implementation of AI strategies that align with the organization's goals and objectives. 
  • Designing and developing data pipelines for agentic systems, develop Robust data flows to handle complex interactions between AI agents and Data sources. 
  • Ability to use advanced mathematical modeling, statistical analysis, and optimization techniques to gather and analyze data, identifying problems and developing solutions to improve efficiency in prompts. 
  • Ability to train and fine tune large language models and Design and build the data architecture, including databases, data warehouses, and data lakes, to support various data engineering tasks. 
  • Develop and manage Extract, Load, transform (ELT) processes to ensure data is accurately and efficiently moved from source systems to analytical platforms used in data science. 
  • Implement data pipelines that facilitate feedback loops, allowing human input to improve system performance in human-in-the-loop systems. 
  • Work with vector databases to store and retrieve embeddings efficiently. 
  • Collaborate with data scientists and engineers to preprocess data, train models, and integrate AI into applications. 
  • Optimize data storage and retrieval with high performance 

Top Skills

Azure Databricks
Big Data
Elt
ETL
Gis
Graphdb
Python
Spark
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Camp Hill, Pennsylvania
31 Employees
On-site Workplace
Year Founded: 2012

What We Do

LingaTech, Inc., a minority owned business, NMSDC MBE Certified, and is PA Small Diverse Business (SDB) Verified since 2014. We are a member of the Harrisburg Regional Chamber of Commerce & CREDC, and are registered with Hireveterans.com & PurplePlacement.com.

We believe in technology innovation and customer partnership to deliver world class IT consultants, products and services. We provide high end consultants to partner with your organization to maximize your growth and achieve your IT goals. As your technology partner, when your business grows we grow with you.

We offer software services in three major areas – Product development, Custom software development and Project Management. With professionals having more than 15 years of experience in software industry, our clients are assured of products/services that are of great quality.

Similar Jobs

ClearCaptions Logo ClearCaptions

Real-Time Cloud Data Integration Engineer

Cloud • Hardware • Healthtech • Information Technology • Mobile • Other • Infrastructure as a Service (IaaS)
Remote
United States
273 Employees
122K-130K Annually

ClearCaptions Logo ClearCaptions

Real-Time Cloud Data Integration Engineer

Cloud • Hardware • Healthtech • Information Technology • Mobile • Other • Infrastructure as a Service (IaaS)
Remote
United States
273 Employees
122K-130K Annually
Remote
USA
52 Employees

Pendulum Logo Pendulum

Staff Systems Integration Engineer, Data and Systems

Artificial Intelligence • Information Technology • Machine Learning
Remote
USA
13 Employees
190K-205K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account