Work somewhere with the creativity of a scaleup and expertise of an enterprise.
We are looking for a savvy AI Software Engineer to join our growing AI Research Team in Milan.
In this role, you will be responsible for designing, building, and maintaining robust data pipelines to support complex model training operations. You will work with diverse data sources spanning text, image, and audio modalities. Additionally, you’ll collaborate with cross-functional teams—including researchers, engineers, and legal advisors—to ensure all data is collected, transformed, and used in a compliant and efficient manner.
Should you be successful, you would:
- Design, develop, and optimize large-scale data pipeline architectures for AI model training.
- Manage data ingestion processes (e.g., web crawling, data extraction, search) to gather high-quality, diverse, and multimodal training datasets.
- Oversee end-to-end data flow and transformation, ensuring consistent data delivery for multiple ongoing projects.
- Collaborate with various research teams to incorporate the latest methods for enhancing pre-training datasets.
- Implement Infrastructure-as-Code (IaC) practices to streamline data processing and system deployment.
- Ensure all data collection and processing initiatives comply with legal and data privacy regulations.
- Build and maintain distributed systems capable of handling terabytes of data efficiently and securely.
- Continuously research and evaluate new techniques to improve dataset quality and pipeline performance.
What You Have
- At least 5 years’ proven experience in AI Software Engineer role or similar
- A degree in Computer Science, Informatics, Information Systems or similar
- Strong programming skills (e.g., Python, Java, or similar languages) and familiarity with distributed systems frameworks.
- Advanced working knowledge and experience with a variety of databases, such as SQL and NoSQL.
- Experience building and optimizing data pipelines and architectures with data processing technologies (e.g., Apache Spark, Hadoop, or similar) and cloud-based platforms.
- Experience with CI/CD tools and Infrastructure-as-Code solutions (Jenkins, Travis, Argo CD, Terraform, CloudFormation...)
- Experience with MLOps is a plus.
- Experience with HPC is a plus.
Who You Are
- An enthusiastic individual, excited by the prospect of optimizing or even contributing to designing our company’s data architecture
- An analytical thinker with a passion for data
- Proactive, comfortable working alone or as part of a team
- Super organized
- Accurate, with strong attention to detail
- Familiar with Agile development
- Fluent in English
Perks
- Learning Fridays. If our team members know more, so do we. That’s why we give everyone a training budget that they can spend on books, online courses or other training materials
- Smart Working. Trains can be a drag, so we let our team members work from home when they can
- Salary is based on experience and topped up with other bonuses.
About iGenius
iGenius is an Italian AI company that develops language models and AI solutions tailored for highly regulated industries, including financial services and government. With a mission to democratize business knowledge, iGenius is one of the top AI unicorns in Europe today. iGenius’ flagship product, Crystal, is a Decision Intelligence tool that analyzes data in natural language, making it accessible for anyone within a business organization. From Allianz to Intesa San Paolo, Crystal has become the AI solution of choice for a number of Fortune 500 companies. This led Gartner to recognize iGenius as a “Cool Vendor” in the AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI and include it in the 2024 Magic Quadrant for Analytics and BI platforms. In June 2024, iGenius released Italia, an open-source Foundational Large Language Model, developed in collaboration with Cineca, and trained entirely on high-quality native Italian sources. That same month, iGenius launched a new enterprise solution called Unicorn, to support companies in adopting AI safely and effectively, with AI solutions tailored to their needs.
Please review our Privacy Policy here
Top Skills
What We Do
iGenius is an Italian AI company that develops language models and AI solutions tailored for highly regulated industries, including financial services and government.
With a mission to democratize business knowledge, iGenius is one of the top AI unicorns in Europe today.
iGenius’ flagship product, Crystal, is a Decision Intelligence tool that analyzes data in natural language, making it accessible for anyone within a business organization. From Allianz to Intesa San Paolo, Crystal has become the AI solution of choice for a number of Fortune 500 companies. This led Gartner to recognize iGenius as a “Cool Vendor” in the AI Core Technologies category, as well as mention the company in its Market Guide for Conversational AI and include it in the 2024 Magic Quadrant for Analytics and BI platforms.
In June 2024, iGenius released Italia, an open-source Foundational Large Language Model, developed in collaboration with Cineca, and trained entirely on high-quality native Italian sources.
That same month, iGenius launched a new enterprise solution called Unicorn, to support companies in adopting AI safely and effectively, with AI solutions tailored to their needs.