Data Engineer / Data Scientist

Posted 2 Days Ago
Be an Early Applicant
Noida, Gautam Buddha Nagar, Uttar Pradesh
Mid level
Artificial Intelligence • Big Data • Information Technology • Security • Software
The Role
The Data Engineer/Data Scientist will design and maintain data pipelines, optimize workflows for large datasets using Databricks and Spark, develop and deploy machine learning models, and monitor their performance. They should promote best practices and present insights to non-technical audiences.
Summary Generated by Built In

Location: Noida, India

Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000 organizations already rely on us to verify the identities of people and things, grant access to digital services, analyze vast quantities of information and encrypt data to make the connected world more secure.

Present in India since 1953, Thales is headquartered in Noida, Uttar Pradesh, and has operational offices and sites spread across Bengaluru, Delhi, Gurugram, Hyderabad, Mumbai, Pune among others. Over 1800 employees are working with Thales and its joint ventures in India. Since the beginning, Thales has been playing an essential role in India’s growth story by sharing its technologies and expertise in Defence, Transport, Aerospace and Digital Identity and Security markets.

Data Engineer Cum Data Scientist

Key Responsibilities:

Data Engineering:

  • Design, build, and maintain scalable and efficient data pipelines on Databricks for machine learning workflows.
  • Optimize data processing workflows to handle large-scale datasets using Spark and Delta Lake.
  • Implement best practices for data versioning, quality, and governance.

Model Development & Deployment:

  • Develop, train, and fine-tune machine learning models like Cross-sell, classification and segmentation to meet business objectives.
  • Use MLflow to track experiments, manage model lifecycle, and ensure reproducibility of results.
  • Register and version models in Databricks Model Registry and maintain model lineage.

Model Productionization:

  • Deploy models to production environments and integrate them into real-time or batch systems.
  • Implement robust CI/CD pipelines for machine learning workflows in Databricks.
  • Monitor model performance in production using metrics and feedback loops.

Innovation, Collaboration & Best Practices:

  • Stay updated with the latest trends and advancements in data engineering and machine learning technologies.
  • Promote best practices in machine learning, including feature engineering, hyperparameter tuning, and model evaluation.
  • Present results and insights from machine learning experiments to non-technical audiences.

Qualifications & Skills:

Must-Have Skills:

  • Experience: 3+ years in data engineering and machine learning roles, with expertise in Databricks.
  • Technical Stack: Strong experience with Databricks, PySpark, SQL, MLflow, Model Registry, and Apache Spark, MLOps, Terraform
  • Programming: Proficiency in Python for data processing and machine learning.
  • Model Deployment: Hands-on experience with deploying and managing models in production.
  • ML Lifecycle: Deep understanding of the end-to-end machine learning lifecycle, including model evaluation, monitoring, and maintenance.
  • Cloud Platforms: Experience with cloud platforms like GCP for deploying Databricks solutions.

Good-to-Have Skills:

  • Familiarity with generative AI and deep learning frameworks such as TensorFlow or PyTorch.

At Thales we provide CAREERS and not only jobs. With Thales employing 80,000 employees in 68 countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Great journeys start here, apply now!

Top Skills

Python
The Company
Arlington, VA
63,258 Employees
On-site Workplace

What We Do

Thales is a global high technology leader investing in digital and “deep tech” innovations – connectivity, big data, artificial intelligence, cybersecurity and quantum technology – to build a future we can all trust, which is vital to the development of our societies. The company provides solutions, services and products that help its customers – businesses, organisations and states – in the defence, aeronautics, space, transportation and digital identity and security markets to fulfil their critical missions, by placing humans at the heart of the decision-making process.

Similar Jobs

Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
1535 Employees

Fednav Logo Fednav

Data Engineer / Data Scientist

Logistics • Transportation
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
361 Employees

R1 RCM Logo R1 RCM

Lead Software Engineer

Fintech • Healthtech • Analytics
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
10001 Employees

CoreLogic Logo CoreLogic

Data Engineer

Real Estate • PropTech
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
4880 Employees

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account