Data Research Engineer - Data Extraction

Posted 4 Days Ago
Be an Early Applicant
Hiring Remotely in Mumbai, Maharashtra
Remote
Mid level
Insurance • Software • Energy • Financial Services
The Role
The Data Research Engineer will develop AI solutions, manage data extraction processes, build data pipelines, and leverage AI technologies to improve operations within the team.
Summary Generated by Built In

Company Description

Forbes Advisor is a new initiative for consumers under the Forbes Marketplace umbrella that provides journalist- and expert-written insights, news and reviews on all things personal finance, health, business, and everyday life decisions. We do this by providing consumers with the knowledge and research they need to make informed decisions they can feel confident in, so they can get back to doing the things they care about most.

At Marketplace, our mission is to help readers turn their aspirations into reality. We arm people with trusted advice and guidance, so they can make informed decisions they feel confident in and get back to doing the things they care about most.

We are an experienced team of industry experts dedicated to helping readers make smart decisions and choose the right products with ease. Marketplace boasts decades of experience across dozens of geographies and teams, including Content, SEO, Business Intelligence, Finance, HR, Marketing, Production, Technology and Sales. The team brings rich industry knowledge to Marketplace’s global coverage of consumer credit, debt, health, home improvement, banking, investing, credit cards, small business, education, insurance, loans, real estate and travel.

The Data Extraction Team is a brand-new team who plays a crucial role in our organization by designing, implementing, and overseeing advanced web scraping frameworks. Their core function involves creating and refining tools and methodologies to efficiently gather precise and meaningful data from a diverse range of digital platforms. Additionally, this team is tasked with constructing robust data pipelines and implementing Extract, Transform, Load (ETL) processes. These processes are essential for seamlessly transferring the harvested data into our data storage systems, ensuring its ready availability for analysis and utilization.

A typical day in the life of a Data Research Engineer will involve coming up with ideas regarding how the company/team can best harness the power of AI/LLM, and use it not only simplify operations within the team, but also to streamline the work of the research team in gathering/retrieving large sets of data. The role is that of a leader who sets a vision for the future of AI/LLM’s use within the team and the company. They think outside the box and are proactive in engaging with new technologies and developing new ideas for the team to move forward in the AI/LLM field. The candidate should also at least be willing to acquire some basic skills in scraping and data pipelining.

 

Job Description

Responsibilities:

● Develop methods to leverage the potential of LLM and AI within the team.
● Proactive at finding new solutions to engage the team with AI/LLM, and streamline processes
in the team.
● Be a visionary with AI/LLM tools and predict how the use of future technologies could be
harnessed early on so that when these technologies come out, the team is ahead of the game
regarding how it could be used.
● Assist in acquiring and integrating data from various sources, including web crawling and API
integration.
● Stay updated with emerging technologies and industry trends.
● Explore third-party technologies as alternatives to legacy approaches for efficient data
pipelines.
● Contribute to cross-functional teams in understanding data requirements.
● Assume accountability for achieving development milestones.
● Prioritize tasks to ensure timely delivery, in a fast-paced environment with rapidly changing
priorities.
● Collaborate with and assist fellow members of the Data Research Engineering Team as
required.
● Leverage online resources effectively like StackOverflow, ChatGPT, Bard, etc., while
considering their capabilities and limitations.

Qualifications

Skills and Experience

● Bachelor's degree in Computer Science, Data Science, or a related field. Higher
qualifications is a plus.
● Think proactively and creatively regarding the next AI/LLM technologies and how to use
them to the team’s and company’s benefits.
● “Think outside the box” mentality.
● Experience prompting LLMs in a streamlined way, taking into account how the LLM can
potentially “hallucinate” and return wrong information.
● Experience building agentic AI platforms with modular capabilities and autonomous
task execution. (crewai, lagchain, etc.)
● Proficient in implementing Retrieval-Augmented Generation (RAG) pipelines for
dynamic knowledge integration. (chromadb, pinecone, etc)
● Experience managing a team of AI/LLM experts is a plus: this includes setting up goals
and objectives for the team and fine-tuning complex models.
● Strong proficiency in Python programming
● Proficiency in SQL and data querying is a plus.
● Familiarity with web crawling techniques and API integration is a plus but not a must.
● Experience in AI/ML engineering and data extraction
● Experience with LLMs, NLP frameworks (spaCy, NLTK, Hugging Face, etc.)
● Strong understanding of machine learning frameworks (TensorFlow, PyTorch)
● Design and build AI models using LLMs
● Integrate LLM solutions with existing systems via APIs
● Collaborate with the team to implement and optimize AI solutions
● Monitor and improve model performance and accuracy
● Familiarity with Agile development methodologies is a plus.
● Strong problem-solving and analytical skills with attention to detail.
● Creative and critical thinking.
● Ability to work collaboratively in a team environment.
● Good and effective communication skills.
● Experience with version control systems, such as Git, for collaborative development.
● Ability to thrive in a fast-paced environment with rapidly changing priorities.
● Comfortable with autonomy and ability to work independently.

Additional Information

Perks:

● Day off on the 3rd Friday of every month (one long weekend each month)

● Monthly Wellness Reimbursement Program to promote health well-being

● Monthly Office Commutation Reimbursement Program

● Paid paternity and maternity leaves

Top Skills

AI
ETL
Git
Llm
Nlp Frameworks
Python
PyTorch
SQL
TensorFlow
Web Scraping
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Jersey City, New Jersey
563 Employees
On-site Workplace
Year Founded: 1917

What We Do

Forbes Advisor is a global platform dedicated to helping consumers make the best financial choices for their individual lives.

We support your pursuit of success by making smart financial decisions simple, to help you get back to doing the things you care about most.

We do this by helping turn your aspirations into reality. By arming you with trusted advice and guidance, you can make informed financial decisions you feel confident in and achieve your financial goals.

Visit Forbes Advisor for unbiased personal finance advice, news and reviews, plus a comparison marketplace that helps you find the financial products that best fit your life and goals.

Similar Jobs

Forbes Advisor Logo Forbes Advisor

Data Research Engineer - Data Extraction (India - Remote)

Insurance • Software • Energy • Financial Services
Remote
Mumbai, Maharashtra, IND
563 Employees

Capco Logo Capco

QA -ESG

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote
Hybrid
India
6000 Employees

MetLife Logo MetLife

Associate Consultant- Analytics

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote
Hybrid
India
43000 Employees

Capco Logo Capco

Big Data Engineer - (Hadoop/Hive/python/Spark/Scala)

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote
Hybrid
Pune, Maharashtra, IND
6000 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account