Company Description
Blend is a premier AI services provider, committed to co-creating meaningful impact for its clients through the power of data science, AI, technology, and people. With a mission to fuel bold visions, Blend tackles significant challenges by seamlessly aligning human expertise with artificial intelligence. The company is dedicated to unlocking value and fostering innovation for its clients by harnessing world-class people and data-driven strategy. We believe that the power of people and AI can have a meaningful impact on your world, creating more fulfilling work and projects for our people and clients. For more information, visit www.blend360.com
Job Description
About the Role
We are seeking a highly skilled and motivated Senior Data Scientist to join our dynamic team. The ideal candidate will have a proven track record in data science, with expertise in predictive modeling, natural language processing, and the ability to work with large language models (LLMs) and generative AI. If you're passionate about turning complex datasets into actionable insights and working in a fast-paced, innovative environment, this role is for you.
Key Responsibilities
- Data Exploration & Analysis: Perform data cleaning, quality control, and exploratory data analysis to uncover actionable insights.
- Predictive Modeling: Design and implement supervised and unsupervised predictive models using Python and advanced machine learning techniques.
- Natural Language Processing (NLP): Apply NLP techniques, including tokenization, named entity recognition (NER), sentiment analysis, and KPI extraction, to process and analyze unstructured data.
- Generative AI (GenAI) & LLMs: Leverage and fine-tune pre-trained large language models (e.g., GPT, BERT) for domain-specific tasks. Proficiency in prompt engineering, contextual embeddings, and text summarization is essential.
- Data Integration: Preprocess and prepare unstructured audio and text data for analysis and reporting.
- Collaboration: Work closely with cross-functional teams to define project requirements and deliver actionable solutions.
- Cloud Computing: Manage and process large datasets in cloud environments, ensuring scalability and efficiency.
- Communication: Translate technical findings into clear, concise insights for non-technical stakeholders.
Qualifications
- Proven experience as a Data Scientist with a strong portfolio of projects.
- Proficiency in Python and database technologies, including SQL.
- Deep expertise in natural language processing (NLP) and machine learning techniques.
- Hands-on experience with platforms such as OpenAI and Hugging Face for fine-tuning LLMs.
- Solid understanding of data engineering, including cleaning and preprocessing unstructured data.
- Excellent problem-solving, communication, and collaboration skills.
- Ability to handle large-scale datasets and work effectively in cloud environments.
- Fluent in English, with the ability to QA nuanced, complex medical text.
Additional Information
- Familiarity with relevant industries (e.g., manufacturing, energy, public sector, automotive, and utilities).
- Understanding of agile principles, including sprint planning, backlog grooming, and iterative development.
- Experience collaborating with cross-functional teams using tools like Jira or Trello.
Top Skills
What We Do
Our Vision is to build a company of world-class people that helps our clients optimize business performance through data, technology and analytics.
Blend360 has two divisions:
Data Science Solutions: We work at the intersection of data, technology and analytics.
Talent Solutions: We live and breathe the digital and talent marketplace.