Experienced Data Engineer - Data Engineering

Posted 23 Days Ago
San Francisco, CA
Hybrid
182K-243K Annually
Mid level
Fintech • Software
The Role
The Data Engineer will design and maintain robust data sets, ensuring data quality and performance for Plaid's data systems. Responsibilities include building data workflows using SQL and Python, collaborating with cross-functional teams, and advocating for industry-standard tools and practices. The role requires significant experience managing large data pipelines and evolving analytics schemas.
Summary Generated by Built In

We believe that the way people interact with their finances will drastically improve in the next few years. We’re dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products. Plaid powers the tools millions of people rely on to live a healthier financial life. We work with thousands of companies like Venmo, SoFi, several of the Fortune 500, and many of the largest banks to make it easy for people to connect their financial accounts to the apps and services they want to use. Plaid’s network covers 12,000 financial institutions across the US, Canada, UK and Europe. Founded in 2013, the company is headquartered in San Francisco with offices in New York, Washington D.C., London and Amsterdam. #LI-Hybrid


The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. Data Engineers heavily leverage SQL and Python to build data workflows. We use tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool to orchestrate data pipelines and define workflows. We work with engineers, product managers, business intelligence, data analysts, and many other teams to build Plaid's data strategy and a data-first mindset. Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.


You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on. You will have the opportunity to learn best practices and up-level your technical skills from our strong DE team and from the broader Data Platform team. You will collaborate with and have strong and cross functional partnerships with literally all teams at Plaid from Engineering to Product to Marketing/Finance etc.

Responsibilities

  • Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Have data quality and performance top of mind while designing datasetsLeading key data engineering projects that drive collaboration across the company.
  • Advocating for adopting industry tools and practices at the right timeOwning core SQL and python data pipelines that power our data lake and data warehouse.
  • Well-documented data with defined dataset quality, uptime, and usefulness.

Qualifications

  • 4+ years of dedicated data engineering experience, solving complex data pipelines issues at scale.
  • You’ve have experience building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes)
  • You value SQL as a flexible and extensible tool, and are comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
  • You have experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks.
  • You have experience building and maintaining batch and realtime pipelines using technologies like Spark, Kafka.
  • You appreciate the importance of schema design, and can evolve an analytics schema on top of unstructured data.
  • You are excited to try out new technologies. You like to produce proof-of-concepts that balance technical advancement and user experience and adoption.
  • You like to get deep in the weeds to manage, deploy, and improve low level data infrastructure.
  • You are empathetic working with stakeholders. You listen to them, ask the right questions, and collaboratively come up with the best solutions for their needs while balancing infra and business needs.
  • You are a champion for data privacy and integrity, and always act in the best interest of consumers.

Our mission at Plaid is to unlock financial freedom for everyone. To support that mission, we seek to build a diverse team of driven individuals who care deeply about making the financial ecosystem more equitable. We recognize that strong qualifications can come from both prior work experiences and lived experiences. We encourage you to apply to a role even if your experience doesn't fully match the job description. We are always looking for team members that will bring something unique to Plaid! Plaid is proud to be an equal opportunity employer and values diversity at our company. We do not discriminate based on race, color, national origin, ethnicity, religion or religious belief, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, military or veteran status, disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local laws. Plaid is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance with your application or interviews due to a disability, please let us know at [email protected]


Please review our Candidate Privacy Notice here.

Top Skills

Python
SQL
The Company
HQ: San Francisco, CA
1,012 Employees
On-site Workplace
Year Founded: 2013

What We Do

Plaid is used by thousands of digital financial apps and services like Betterment, Expensify, Microsoft and Venmo, and by many of the largest banks to make it easy for consumers to connect their financial accounts with the apps and services they want to use. Plaid connects with over 11,000 financial institutions across the U.S, Canada and Europe.

At Plaid, we have diverse backgrounds and skills, but we're all passionate about building a more efficient and inclusive financial infrastructure—together.

Similar Jobs

BlackLine Logo BlackLine

Principal Data Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Hybrid
Pleasanton, CA, USA
1810 Employees
201K-269K Annually

Atlassian Logo Atlassian

Principal Data Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
169K-271K Annually

GoodRx Logo GoodRx

Lead Data Engineer

Consumer Web • Coupons • Healthtech • Social Impact • Pharmaceutical
Remote
Hybrid
5 Locations
800 Employees
151K-323K Annually

Snap Inc. Logo Snap Inc.

Data Engineer, 6+ Years of Experience

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
2 Locations
5000 Employees
178K-313K Annually

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account