We are looking for data acquisition expert with strong skills and knowledge of web scraping, web services, file transfers, relational and non-relational databases to join our Provider Relations for Data Acquisition team on the position of Web Scraping / Database Administrator. The team has a special focus on novel approaches to data acquisition and analytical functions that emphasize cross-team collaboration and communication, as well as domain expertise. If this sounds like an opportunity you are interested in, then we would love to talk to you!
About You – experience, education, skills, and accomplishments
-
Bachelor’s degree in Computer Science or equivalent experience
-
2 + years of experience with Puppeteer library and/or JavaScript
-
2+ years of experience with relational and non-relational databases.
-
2+ years of hands-on industry experience working with large data sets.
It would be great if you also have...
-
Experience with document processing (OCR, XML parsing etc.).
-
Experience with Python, Java, or any other object-oriented programming languages; as well as with Node.js, Selenium, Crawlee, Axios and/or Playwright
-
Excellent knowledge of HTTP protocols, Captcha solving techniques, proxy, and Tor networks; as well as outstanding knowledge of HTML, CSS, XML, JSON, CSV, and other textual formats; knowledge of AWS or GCP.
-
Knowledge of intellectual property data sources.
-
Proven record of communicating and working effectively with multi-disciplinary teams to reach resolution, share knowledge, and maintain strong relationships. Clear understanding of information retrieval.
What will you be doing in this role?
-
Design and develop a variety of tools and infrastructure to automate the extraction of publicly available and proprietary information (writing web scrapers, calling third party APIs, creating SQL queries, etc.)
-
Create tools and processes to download data, parse it for relevant content, and store it in existing data management systems.
-
Gather and process raw data at scale (including writing scripts, web scraping, calling APIs, write SQL queries, etc.)
-
Create documentation and draft requirements for production implementation teams.
-
Contribute to problem solving, collaboration, process improvement, and information sharing.
About the Team
The team is multi-disciplinary consisting of administrative stuff as well as analysts and data acquisition experts. The team is responsible for maintaining the entire vendor network administration and data acquisition to support the Provider Relations team for data acquisition in delivering a timely, complete, and accurate data for different Clarivate IPG products as well as the other teams within Clarivate.
Hours of Work
This is a permanent full-time position. The working hours would be standard EMEA working hours.
This is a hybrid position; you will be expected to work from our Belgrade office 3 days every other week.
#LI-Hybrid
We Offer:
-
Holidays: 25 days paid leave per annum
-
Private Health Insurance
-
Paid Lunch
-
Yearly Bonus
-
Yearly Merit Plan
-
My Learning Platform
-
Fit Pass
-
Life Insurance
-
Accident Insurance
-
Company bicycles for rent free of charge
Only shortlisted candidates will be contacted.
At Clarivate, we are committed to providing equal employment opportunities for all persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.
Top Skills
What We Do
Clarivate™ is a global leader in providing solutions to accelerate the lifecycle of innovation. Our bold mission is to help customers solve some of the world’s most complex problems by providing actionable information and insights that reduce the time from new ideas to life-changing inventions in the areas of science and intellectual property. We help customers discover, protect and commercialize their inventions using our trusted subscription and technology-based solutions coupled with deep domain expertise. For more information, please visit clarivate.com.