Work somewhere with the creativity of a scaleup and expertise of an enterprise.
We are looking for a savvy AI Software Engineer to join our growing AI Research Team in Milan.
In this role, you will be responsible for designing, building, and maintaining robust data pipelines to support complex model training operations. You will work with diverse data sources spanning text, image, and audio modalities. Additionally, you’ll collaborate with cross-functional teams—including researchers, engineers, and legal advisors—to ensure all data is collected, transformed, and used in a compliant and efficient manner.
Should you be successful, you would:
- Design, develop, and optimize large-scale data pipeline architectures for AI model training.
- Manage data ingestion processes (e.g., web crawling, data extraction, search) to gather high-quality, diverse, and multimodal training datasets.
- Oversee end-to-end data flow and transformation, ensuring consistent data delivery for multiple ongoing projects.
- Collaborate with various research teams to incorporate the latest methods for enhancing pre-training datasets.
- Implement Infrastructure-as-Code (IaC) practices to streamline data processing and system deployment.
- Ensure all data collection and processing initiatives comply with legal and data privacy regulations.
- Build and maintain distributed systems capable of handling terabytes of data efficiently and securely.
- Continuously research and evaluate new techniques to improve dataset quality and pipeline performance.
What You Have
- At least 5 years’ proven experience in AI Software Engineer role or similar
- A degree in Computer Science, Informatics, Information Systems or similar
- Strong programming skills (e.g., Python, Java, or similar languages) and familiarity with distributed systems frameworks.
- Advanced working knowledge and experience with a variety of databases, such as SQL and NoSQL.
- Experience building and optimizing data pipelines and architectures with data processing technologies (e.g., Apache Spark, Hadoop, or similar) and cloud-based platforms.
- Experience with CI/CD tools and Infrastructure-as-Code solutions (Jenkins, Travis, Argo CD, Terraform, CloudFormation...)
- Experience with MLOps is a plus.
- Experience with HPC is a plus.
Who You Are
- An enthusiastic individual, excited by the prospect of optimizing or even contributing to designing our company’s data architecture
- An analytical thinker with a passion for data
- Proactive, comfortable working alone or as part of a team
- Super organized
- Accurate, with strong attention to detail
- Familiar with Agile development
- Fluent in English
Perks
- Learning Friday. If our team members know more, so do we. That’s why we give everyone a training budget that they can spend on books, online courses or other training materials.
- Smart Working. Trains can be a drag, you can save some commuting time by working from home.
- Salary is based on experience, and topped up with other bonuses.
We offer a competitive salary, as well as an opportunity to receive company equity. The typical salary for this role ranges between € 50.000 and € 80.000. As you gain experience and make more significant contributions to the business, your compensation will be reviewed to match your impact.
Additionally, depending on your seniority and your performance, you'll have the opportunity to receive stock options, with a variable value calculated from your base salary, giving you the chance to directly participate in the company’s success.
About iGenius
iGenius is a deep-tech company specialized in the development of Artificial Intelligence solutions for companies operating in highly regulated industries, including financial services, government, or heavy industry. iGenius’ main product, Unicorn, offers tailored solutions for companies looking to integrate AI safely and effectively, mainly through two proprietary Large Language Models (LLMs). Italia 10B, is a multi-language model optimized for regulated sectors and elevated computational efficiency, while Colosseum 355B, built with latest-generation NVIDIA technology, is fit for mission-critical use cases. In addition to Unicorn, iGenius’ product offer includes Crystal, an AI agent for Decision Intelligence that analyzes business data in natural language and accurately supports strategic, insight-driven decision-making. In December 2024, iGenius joined forces with NVIDIA to build Colosseum – one of the largest AI supercomputers in the world – to support the deployment of its models with unrivaled speed, performance, and efficiency.
Active in both Europe and the United States, iGenius is one of the leading AI unicorns in the European landscape, and has attracted Fortune 500 companies, including Allianz and Intesa Sanpaolo. This led Gartner to recognize iGenius as a “Cool Vendor” in the AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI.
Please review our Privacy Policy here .
What We Do
iGenius is an Italian AI company that develops language models and AI solutions tailored for highly regulated industries, including financial services and government.
With a mission to democratize business knowledge, iGenius is one of the top AI unicorns in Europe today.
iGenius’ flagship product, Crystal, is a Decision Intelligence tool that analyzes data in natural language, making it accessible for anyone within a business organization. From Allianz to Intesa San Paolo, Crystal has become the AI solution of choice for a number of Fortune 500 companies. This led Gartner to recognize iGenius as a “Cool Vendor” in the AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI and include it in the 2024 Magic Quadrant for Analytics and BI platforms.
In June 2024, iGenius released Italia, an open-source Foundational Large Language Model, developed in collaboration with Cineca, and trained entirely on high-quality native Italian sources.
That same month, iGenius launched a new enterprise solution called Unicorn, to support companies in adopting AI safely and effectively, with AI solutions tailored to their needs.