Big Data Lead

Posted 17 Days Ago
Be an Early Applicant
Hiring Remotely in Pune, Mahārāshtra
Remote
Senior level
Information Technology • Consulting
The Role
Lead a team in leveraging big data tech to solve complex data issues, hands-on coding, large scale text data processing, event driven data pipelines, cloud native services in AWS and GCP.
Summary Generated by Built In

Company Description

At DemandMatrix, our vision is to disrupt the $100 billion sales and marketing intelligence industry by using domain knowledge, machine learning and AI. Fortune 100 companies like Microsoft, Google, Adobe, Amazon, IBM trust us to identify their next customer.

Job Description

What will you do?

  • To help us go to the next level we are looking to onboard a hands-on SME in leveraging big data tech to solve the most complex data issues. You will spend almost half of time with hands-on coding.
  • It involves large scale text data processing, event driven data pipelines, in-memory computations, optimization considering CPU core to network IO to disk IO.
  • You will be using cloud native services in AWS and GCP.

Who Are You? 

  • Solid grounding in computer engineering, Unix, data structures and algorithms would enable you to meet this challenge.
  • Designed and built multiple big data modules and data pipelines to process large volume. 
  • Genuinely excited about technology and worked on projects from scratch. 

Must have:

  • 7+ years of hands-on experience in Software Development with a focus on big data and large data pipelines.
  • Minimum 3 years of experience to build services and pipelines using Python.
  • Expertise with a variety of data processing systems, including streaming, event, and batch (Spark, Hadoop/MapReduce)
  • Understanding of at least one NoSQL stores like MongoDB, Elasticsearch, HBase
  • Understanding of how data models, sharding and data location strategies for distributed data stores in large scale high-throughput and high-availability environments and their effect in non-structured text data processing
  • Experience with running scalable & high available systems with AWS or GCP.

Good to have:

  • Experience with Docker / Kubernetes
  • Exposure with CI/CD
  • Knowledge of Crawling/Scraping

Additional Information


  • Entire Work From Home
  • Birthday Leave
  • Remote Work 

Top Skills

Python
The Company
Cupertino, California
56 Employees
On-site Workplace
Year Founded: 2015

What We Do

DemandMatrix is an AI-powered Technographics and Intent Data provider that helps B2B marketing and sales teams identify and target the right accounts based on their propensity to buy-into a particular technology

Similar Jobs

Capco Logo Capco

Jr Data Analyst

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote
India
6000 Employees

CrowdStrike Logo CrowdStrike

OT - Vulnerability Analyst (Remote, IND)

Cloud • Information Technology • Sales • Security • Cybersecurity
Remote
India
10000 Employees

Atlassian Logo Atlassian

Sr. Data Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
India
11000 Employees

Atlassian Logo Atlassian

Senior People Operations Business Analyst

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
India
11000 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account