GCP Data Engineer

Posted 4 Days Ago
Be an Early Applicant
Johannesburg, Gauteng
Mid level
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
The Role
The GCP Data Engineer will design and maintain scalable data pipelines and ETL processes on Google Cloud Platform, utilizing tools like BigQuery and Terraform. They will collaborate with data scientists for machine learning model deployment, ensuring data quality and integrity while writing maintainable code in Python and PySpark.
Summary Generated by Built In

Company Description

We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (18000+ experts across 37 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!

Job Description

  • Design, develop, and maintain scalable data pipelines and ETL processes on Google Cloud Platform (GCP).
  • Implement and optimize data storage solutions using BigQuery, Cloud Storage, and other GCP services.
  • Collaborate with data scientists, machine learning engineers, data engineer and other stakeholders to integrate and deploy machine learning models into production environments.
  • Develop and maintain custom deployment solutions for machine learning models using tools such as Kubeflow, AI Platform, and Docker.
  • Write clean, efficient, and maintainable code in Python and PySpark for data processing and transformation tasks.
  • Ensure data quality, integrity, and consistency through data validation and monitoring processes. - Deep understanding of Medallion architecture.
  • Develop metadata driven pipelines and ensure optimal processing of data
  • Use Terraform to manage and provision cloud infrastructure resources on GCP.
  • Troubleshoot and resolve production issues related to data pipelines and machine learning models.
  • Stay up-to-date with the latest industry trends and best practices in data engineering, machine learning, and cloud technologies. - understands data lifecycle management, data pruning, model drift and model optimisations.

Qualifications

Must have Skills : Machine Learning - General Experience, Visualization,Google Cloud Platform, Pyspark,

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • Proven experience as a Data Engineer with a focus on GCP.
  • Strong proficiency in Python and PySpark for data processing and transformation.
  • Hands-on experience with machine learning model deployment and integration on GCP.
  • Familiarity with GCP services such as BigQuery, Cloud Storage, Dataflow, and AI Platform.
  • Experience with Terraform for infrastructure as code.
  • Experience with containerization and orchestration tools like Docker and Kubernetes.
  • Strong problem-solving skills and the ability to troubleshoot complex issues.
  • Excellent communication and collaboration skills.

Additional Information

*Preferred Qualifications:**

  • Experience with custom deployment solutions and MLOps.
  • Knowledge of other cloud platforms (AWS, Azure) is a plus.
  • Familiarity with CI/CD pipelines and tools like Jenkins or GitLab CI.
  • Visualisation experience is nice to have but not mandatory.
  • Certification in GCP Data Engineering or related fields.

Top Skills

Pyspark
Python
The Company
19,994 Employees
On-site Workplace
Year Founded: 1996

What We Do

Nagarro helps future-proof your business through a forward-thinking, fluidic, and CARING mindset. We excel at digital engineering and help our clients become human-centric, digital-first organizations, augmenting their ability to be responsive, efficient, intimate, creative, and sustainable. Today, we are 19,000 experts across 36 countries, forming a Nation of Nagarrians, ready to help our customers succeed.

Similar Jobs

Pfizer Logo Pfizer

MAPP Navigator Coordinator

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
Sandton, City of Johannesburg, Gauteng, ZAF
121990 Employees
Hybrid
Johannesburg, Gauteng, ZAF
289097 Employees

TransUnion Logo TransUnion

Product Manager

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Johannesburg, Gauteng, ZAF
13000 Employees

TransUnion Logo TransUnion

Marketing Sales Specialist (EU Centric)

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Remote
Hybrid
10 Locations
13000 Employees

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account