Senior Data Engineer

Posted 19 Days Ago
Hiring Remotely in United States
Remote
Senior level
Cloud • eCommerce • Information Technology • Professional Services • Software
Cleo is the pioneer and global leader in the ecosystem integration software category.
The Role
The Senior Data Engineer will design, develop, and maintain data pipelines and infrastructure, focusing on optimizing data for AI/ML models. Responsibilities include leading the creation of data warehouses, ensuring data quality, and collaborating with cross-functional teams to meet data requirements for machine learning applications.
Summary Generated by Built In

The Senior Data Engineer is a hands-on leader responsible for designing, developing, and maintaining data pipelines and infrastructure at Cleo. This role involves setting the strategy for data systems, collaborating closely with cross-functional teams, and ensuring the scalability, reliability, and efficiency of data solutions. The Senior Data Engineer will focus on data infrastructure needs for AI/ML models, overseeing the creation of a data warehouse and associated systems from scratch, and ensuring data is properly transformed and optimized for machine learning and artificial intelligence applications. This role is integral to building the processes that support data transformation, data structures, metadata management, data quality controls, and workload management.
What You Will Be Doing

  • Lead the Design and Build of Data Pipelines: Develop and maintain scalable, reliable, and efficient data pipelines that collect, process, and store large datasets. Ensure these systems are optimized for AI/ML model training and inference.
  • Set Data Infrastructure Strategy: Define and execute the strategy for building and maintaining data warehouses, data lakes, and other data storage systems that support both operational and analytical needs. Ensure systems are optimized for AI/ML model workflows.
  • Hands-On Data Transformation for AI/ML Models: Design and implement data transformation processes, including feature engineering, data preprocessing, and data augmentation, to ensure data is in the right format for machine learning models.
  • Build Data Structures and Metadata Management: Build and manage the data structures, metadata repositories, and related systems that support data transformation, ensuring the organization has well-documented, accurate, and accessible data for AI/ML workflows.
  • Data Quality Controls and Risk Management: Establish and implement data quality controls to ensure that data used in machine learning models is clean, consistent, and accurate. Identify and raise risks at all stages of the data engineering process, including data ingestion, transformation, and storage.
  • Collaborate with Cross-Functional Teams: Work closely with data scientists, ML engineers, product managers, and business leaders to understand data requirements and ensure data systems meet the needs of AI/ML initiatives. Provide hands-on leadership to guide teams in transforming data for model training.
  • ETL Development and Optimization for AI/ML: Lead the development and optimization of ETL (Extract, Transform, Load) processes for ML/AI data. Focus on efficiently moving and transforming large datasets, ensuring data quality and readiness for model training and deployment.
  • Optimize Data for Model Training and Inference: Ensure data pipelines are designed to support the needs of AI/ML model training, such as handling large volumes of data, managing data quality, and enabling fast model iteration.
  • Data Governance for AI/ML: Define and implement data governance practices to ensure the secure, compliant, and ethical use of data in AI/ML workflows.
  • Stay Current: Keep up with emerging trends in data engineering, particularly those related to AI/ML model development, and implement best practices to optimize data systems for machine learning.


Your Qualifications

  • Experience: 5-7+ years of experience in data engineering, with a focus on transforming data for AI/ML models and optimizing data systems to support machine learning and artificial intelligence workflows.
  • Hands-On Expertise: Proven experience in hands-on data transformation and building data pipelines for AI/ML, including data preprocessing, feature engineering, and model-specific data handling.
  • Leadership: Experience leading or mentoring data engineering teams, providing hands-on guidance for AI/ML-related projects and collaborating with cross-functional teams.
  • Cloud and Big Data: Strong experience with cloud platforms and big data technologies, particularly in the context of AI/ML model development.


A few things we have to offer:

  • Competitive compensation
  • Great Healthcare + Dental + Vision
  • Flexible PTO
  • Culture of support, encouraging Life-Work balance
  • 401k match
  • FSA and HSA options
  • Employee Assistance Program
  • Paid Parental Leave
  • Representing a company with 4,000+ clients and a 99% retention rate
  • Accelerated title and salary growth potential
  • A fun and energetic work environment that makes you excited to go to work every day

Top Skills

AI
Data Engineering
Ml

What the Team is Saying

Sarah
Tom
Rashon
Austin
Tico
Brittney
Mahesh
Shane
Topher
The Company
HQ: Rockford, IL
400 Employees
Hybrid Workplace
Year Founded: 1976

What We Do

Cleo is an ecosystem integration software company focused on business outcomes, ensuring each customer’s potential is realized by delivering solutions that make it easy to discover and create value through the movement and integration of B2B enterprise data. Cleo gives customers strategic, “outside-in” visibility into the critical end-to-end business flows happening across their ecosystems of partners and customers, marketplaces, and internal cloud and on-premise applications. Our solutions empower teams to drive business agility, accelerate onboarding, facilitate modernization of key business processes, and capture new revenue streams by reimagining and remastering their digital ecosystem through robust application, B2B, and data integration technologies.

Cleo Integration Cloud (CIC) is a cloud-based integration platform, purpose-built to design, build, operate and optimize critical ecosystem integration processes. The CIC platform brings end-to-end integration visibility across API, EDI and non-EDI integrations that gives technical and business users the confidence to rapidly onboard trading partners, enable integration between applications, and accelerate revenue-generating business processes. On the platform, businesses have the choice of self-service, managed services, or a blended approach – ensuring complete flexibility and control over their B2B integration strategy.

Cleo isn't just about EDI, iPaaS, API integration, or managed integration services. Rather we are about blending the best elements of these traditional integration technologies into a single platform, so organizations can connect and transact business across their entire ecosystem with complete confidence.

Why Work With Us

Cleo’s people share a genuine respect for each other and for the future we’re helping our customers to create. Beyond guiding how we act, and more important than driving us to deliver solutions that help our company and our customers succeed, our core focus on Appreciation creates something even more essential, for everyone: enduring value.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Cleo Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
HQRockford, IL
India
Chicago, IL
Netherlands
United Kingdom
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account