About Us
At Zyte, we're pioneering the next revolution in web data extraction, empowering businesses to focus on innovation. Companies often struggle with challenges like navigating dynamic web structures, countering anti-scraping measures, and ensuring data accuracy. We tackle these obstacles by providing best-in-class tools for seamless, reliable web data extraction. With over a decade of expertise, we’ve become the trusted solution for over 3,000 companies and a million developers, helping them achieve a competitive edge and drive sustainable growth.
We are a diverse and enthusiastic team of data nerds, tech aficionados, and hackers. At Zyte, you'll join an inclusive community where everyone belongs. We thrive on collaborating with our smart, passionate teammates and customers to solve complex problems in data science and engineering.
Join us and be part of the data revolution!
About The Job
As a Principal Reverse Engineer, you will be integral to the development of Zyte’s flagship product, Zyte API. You will collaborate closely with the product and engineering teams engineers to build and optimize a high-performance scraping service capable of downloading data at over 10k RPS. This role offers the opportunity to innovate with anti-detect browsers, advanced automation tooling, constantly pushing the limits of web scraping technology.
Roles & Responsibilities:
- Lead the research of new methods and technologies to enhance web scraping techniques, identify patterns in web data, and design data-driven approaches that utilize these patterns to optimize scraping efficiency and accuracy.
- Devise new scraping strategies and algorithms to handle various website structures and anti-scraping measures
- Detect and analyze changes in website structures, content delivery methods, and anti-scraping technologies to adapt and refine scraping tools.
- Provide strategic insights of modern web scraping challenges to our products, rapidly developing prototypes for overcoming anti-scraping techniques and ensuring data retrieval consistency.
- Collaborate with cross-functional software engineering teams in developing robust and scalable web scraping solutions. Contribute to Zyte’s anti-detect browser and browser automation tooling.
- Identify and assess emerging risks in web scraping, ensuring compliance with legal and ethical standards.
- Mentor engineers to plan and conduct advanced web scraping research, fostering a culture of innovation and continuous improvement.
Requirements:
- Embrace the inherent uncertainty of this role and enjoy the freedom to carve your own path.
- Possess a hacker mindset, able to rapidly develop proof of concepts to achieve results.
- Approach project prioritization logically and with a measurement-backed strategy, thriving in collaborative environments with like-minded colleagues.
- Have a solid understanding of techniques and tools for crawling, extracting, and processing data
- Demonstrate strong proficiency in JavaScript/Node.js, Python, and Java, coupled with a broad programming background and exceptional problem-solving abilities.
- Possess strong research, debugging, fuzzing, and reverse engineering skills, including static, dynamic, and concolic analysis.
- Have a deep understanding of operating systems and computer networking concepts.
- Adept with tools such as Wireshark, Burp Suite etc to intercept and debug network traffic.
- Understand browser engines, browser fingerprinting, and ad-blocker mechanisms.
Bonus points for:
- Experience with Decompilers, IDA Pro, Ghidra or Frida, Jadx, and Babel
- Experience with C/C++
- Core contributions to Mozilla or Chromium projects
By joining the Zyte team, you will:
- As a new Zytan, you will:
- Become part of a self-motivated, progressive, multi-cultural team.
- Have the freedom and flexibility to work from where you do your best work.
- Attend conferences and meet with team members from across the globe.
- Work with cutting-edge open source technologies and tools.
Top Skills
What We Do
At Zyte, we’re all about empowering data-driven organizations to ethically and accurately collect web data to power their business. With over 14 years experience and our early authorship and ongoing maintenance of Scrapy, we’ve shaped the web scraping industry from Day 1.
We help our clients…
- With easy-to-use ways to collect, format and deliver web data, quickly, dependably and at scale,
- Spend more time gleaning insights from highly accurate, business-critical data, and
- Spend less money on the total cost of ownership in web data extraction.
Zyte API abstracts away a historically disparate web data extraction tech stack into a single tool. Zyte API automates most anti-bot and proxy management, so developers can spend more time on strategy.
Zyte API is a full-stack solution that crawls, unblocks and extracts data in minutes with the power of AI. Developers skip the hassle of creating manual parsing code and extract public data at unlimited scale.
Zyte Data is an expert web data extraction team in your pocket. Our white glove service extracts any web data your business needs, regardless of project size and complexity. This includes a dedicated team and round-the-clock support.
Zyte’s legal team is our backbone and is made up of the leading minds in web data extraction compliance. They stay on top of the ever-changing and opaque laws that loom over the industry. They evaluate compliance risks and inform customers about best practices.
Zyte is certified by and a co-founder of the Ethical Web Data Collection Initiative (EWDCI) which recognizes web data providers operating with the highest level of ethical and legal standards.
Come work for us!
We encourage a flexible and diverse work environment, so we embraced the benefits of remote work from our very early beginnings. Our team includes over 200 employees in over 30 countries. All sharing the same drive, to do more with web data.