Data Engineer

Posted Yesterday
United States of America
128K-267K Annually
Entry level
AdTech • Digital Media • Information Technology • Other
The Role
The Data Engineer will develop and improve data infrastructures and pipelines for machine learning and analytics. Responsibilities include designing systems for efficient data processing, collaborating with teams to implement algorithms, and troubleshooting data issues, while managing large volumes of data at petabyte scale.
Summary Generated by Built In

It takes powerful technology to connect our brands and partners with an audience of hundreds of millions of people. Whether you’re looking to write mobile app code, engineer the servers behind our massive ad tech stacks, or develop algorithms to help us process trillions of data points a day, what you do here will have a huge impact on our business—and the world.

A Little About Us

Yahoo makes the world’s daily habits inspiring and entertaining. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. Yahoo’s vast businesses span across Search, Communications, Media, and many other verticals.

 

Yahoo generates terabytes of data every day and it is critical to collect, manage and process data at petabyte scale to provide timely and accurate insights to executives, sales, product managers and product developers on all aspects of user interaction. 

The Mail Analytics Engineering team at Yahoo is responsible for building mission critical data systems, pipelines, warehouses, analytics systems, and Machine Learning/AI/data mining programs for the Communications business, which includes Yahoo Mail, with 200M monthly active users. We are constantly pushing the envelope of data platforms due to the insane amount of data we need to harness. 

A Lot About You

As part of the Mail Analytics Engineering team, you will be working on data engineering infrastructures, pipelines and next generation Machine Learning- and AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features. 

Our Big Data footprints are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale stream processing, machine learning and modeling, as well as satisfying complex business rules.

If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!

Your Day

  • Develop new or improve existing data infrastructures for data processing machine learning, and deep learning using your core expertise

  • Work with other engineers to implement algorithms and systems in an efficient way

  • Take end to end ownership of Machine Learning-based distributed data systems - from data and training pipelines, to real time data serving engines.

  • Develop complex queries, very large volume data pipelines, and analytics applications

  • Develop complex queries and software programs to solve analytics and data mining problems

  • Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions

  • Prototype new metrics or data systems

  • Lead data investigations to troubleshoot data issues that arise along the data pipelines

  • Maintenance and improvement of released systems

  • Engineering consulting on large and complex warehouse data

You Must Have

  • BS/MS/PhD in Computer Science/Electrical Engineering, or related engineering disciplines, ideally with specialization in Data Engineering or Machine Learning

  • Strong fundamentals: algorithms, distributed computing, data structure, database

  • Fluency with: Python/Java/SQL

  • Self-driven, challenge-loving, detail oriented, teamwork spirit, excellent communication skills, ability to multitask and manage expectations

Preferred

  • Experience in Hadoop technologies (Map/Reduce, Pig, Hive, HBase, Storm, Spark, Kafka, Oozie).

  • Experience with Google Cloud Platform (BiqQuery, Dataproc, Dataflow, etc.) a big plus

  • Experience with machine learning algorithms, NLP, and/or statistical methods a big plus

  • Experience in any of: machine learning, analytics, data mining, or data mart and warehouse

  • Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome. Check out our diversity and inclusion (www.yahooinc.com/diversity/) page to learn more.

The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience. The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions, in addition to equity incentives. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

Top Skills

Java
Python
SQL
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Sunnyvale, CA
10,001 Employees
On-site Workplace

What We Do

Yahoo is a global media and tech company that connects people to their passions. We reach nearly 900 million people around the world, bringing them closer to what they love—from finance and sports, to shopping, gaming and news—with the trusted products, content and tech that fuel their day. For partners, we provide a full-stack platform for businesses to amplify growth and drive more meaningful connections across advertising, search and media.

Similar Jobs

Capital One Logo Capital One

Data Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
2 Locations
55000 Employees
121K-152K Annually

Capital One Logo Capital One

Data Engineer

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Richmond, VA, USA
55000 Employees
121K-138K Annually

Capital One Logo Capital One

Data Engineer Sr. Associate

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
5 Locations
55000 Employees
121K-166K Annually

Capital One Logo Capital One

Distinguished Data Engineer - Card Technology

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
4 Locations
55000 Employees
240K-329K Annually

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account