Director of Data Engineering, Data and Systems

Posted 6 Hours Ago
Be an Early Applicant
San Francisco, CA
Hybrid
175K-225K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning
The Role
The Director of Data Engineering will lead the optimization and scaling of Pendulum's data infrastructure, ensuring reliable data pipelines and compliance with governance standards. They will mentor a data engineering team, drive data strategies aligned with business goals, and support AI/ML initiatives.
Summary Generated by Built In

About Pendulum


Pendulum® is leading a revolution that is occurring around the world to improve physical and mental health by first understanding, then restoring and enhancing the human microbiome. 


Studies have shown that our microbiome (the bacterial communities in and on our bodies) is linked to everything from metabolism and diabetes, to longevity, weight loss, healthy immune systems, cancer prevention, feelings of well-being, inflammatory bowel disease, and even healthy skin. We have just scratched the surface on understanding the impact that our microbiome has on our lives. During early life we develop a diverse and balanced microbiome that plays a critical role in shaping our long-term health. Over our lives, a combination of diet, lifestyle, antibiotics, and aging can decrease the effectiveness of our microbiome.


Pendulum recognized the enormous impact they could have on people’s lives if they were able to address the imbalances in the microbiome. To accomplish this, Pendulum created proprietary probiotic pipelines and a unique discovery platform to identify key, novel bacterial strains and the prebiotics that feed them. The company has also built and developed the world’s first manufacturing technology to produce bacteria in an anaerobic (oxygen-free) environment at scale.


The medical probiotics that Pendulum has formulated have transformed the consumer probiotics market into a new category of therapeutic offerings that deliver the power and efficacy of a pharmaceutical with the safety and accessibility of a natural probiotic. Due to Pendulum’s explosive revenue and customer growth over the last two years, the company earned a spot on Forbes Magazine’s exclusive “The Next Billion Dollar Startups” list.

If you’re interested in improving the lives of people globally and you love working in a cross-functional, collaborative, inspiring environment, please continue reading.


Position Summary:


We are seeking a Director of Data Engineering to lead the development, optimization, and scaling of our data platform and infrastructure. This role will be pivotal in building and maintaining robust data pipelines, managing cloud-based data environments, and driving data platform innovations to support AI/ML initiatives. As a key leader, you will oversee the development of scalable, real-time data architectures, ensuring data reliability, accessibility, and compliance with data governance standards. The ideal candidate will have a strong technical foundation, proven leadership in managing data engineering teams, and the ability to align data strategies with business goals. The role will be fully hands-on with a couple members on the team, with plan to build out over the years.


What You'll Do:

  • Lead, mentor, and grow a high-performing data engineering team, fostering collaboration, innovation, and continuous learning to meet evolving business needs. 

  • Define and drive the data infrastructure strategy, ensuring alignment with business goals, scalability, and adaptability to support data-driven decision-making and AI/ML initiatives.

  • Own the management and optimization of data warehouses and lakehouse (e.g., Snowflake), ensuring data is reliable, performant, and accessible for stakeholders. 

  • Collaborate with cross-functional leaders (e.g., Data Science, Product, Analytics) to build data foundations that support machine learning, analytics, and business intelligence efforts. 

  • Ensure best practices in data governance, security, and compliance (GDPR, CCPA), managing data quality, lineage, and security frameworks across all platforms. 

  • Develop efficient data models and schema designs to support data integration and querying, ensuring scalability and performance across business functions. 

  • Design and maintain scalable ETL pipelines using tools like dbt, Fivetran, and Airflow, with a focus on real-time processing and automation. 

  • Leverage Docker, Kubernetes, and automation tools to streamline data processing workflows and ensure efficient resource utilization across the data platform. 

  • Establish a robust operational framework, including monitoring, alerting, and incident management, to ensure the stability and performance of data platforms in production environments.

Knowledge Requirements:

  • MSc/PhD in Computer Science or a related field. 
  • 10+ years of experience building and managing data infrastructure, with expertise in data pipelines, cloud platforms, and big data technologies. 
  • Expert in Python, SQL, and big data tools (Kafka, Spark, Hive/Iceberg), with significant experience in cloud platforms (AWS, GCP, Azure). 
  • Proven experience with containerization (Docker, Kubernetes) and orchestration tools like Airflow to scale and optimize workflows.
  • Strong understanding of data quality management, governance, and regulatory compliance, with experience in GDPR and CCPA requirements.
  • Demonstrated ability to collaborate with cross-functional teams, including Data Science and ML teams, to deliver data solutions aligned with AI/ML and business objectives. 
  • Leadership experience managing data engineering teams, driving strategic decisions, and adapting to the latest advancements in data technologies.

Salary & Benefits

  •  $175,000-$225,000
  • Medical, Dental, and Vision
  • Commuter Benefits
  • Life & STD Insurance
  • Company match on 401 (k)
  • Flexible Time Off (FTO)
  • Equity

Top Skills

AI
Data Engineering
Ml
The Company
HQ: Seattle, WA
13 Employees
On-site Workplace
Year Founded: 2021

What We Do

Pendulum is an information forensics company that classifies non-textual, publicly available content to identify the source, message, audience, and evolution of harmful narratives.

Similar Jobs

Los Angeles, CA, USA
4500 Employees

Roblox Logo Roblox

[2025] Data Scientist, Ecosystem and Learning Platform - PhD New Grad

Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
San Mateo, CA, USA
2500 Employees
201K-201K Annually

Gusto Logo Gusto

FBOS Data Partner

Fintech • HR Tech
3 Locations
2674 Employees

Block Logo Block

Staff Machine Learning Engineer (Modeling), Risk

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Hybrid
8 Locations
12000 Employees
168K-297K Annually

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account