Comet is building the development platform for teams who want to ship robust, reliable, and responsible AI applications. Opik, our open source LLM evaluation framework, has quickly become one of the most popular tools in the space. Our experiment management platform is used by data scientists at companies like Uber, Netflix, and Etsy. Tens of thousands of researchers, engineers, academics, and hobbyists use Comet every day to build the future of AI.
Working at Comet will give you access to the most exciting work being done in all areas of machine learning. Some of the top researchers and companies working on self-driving cars, drug discovery, particle research, diffusion models, and LLMs use Comet every day. Your work has the potential to accelerate the development of some of the most impactful technology in the world, and you will be doing it alongside a team of passionate, caring individuals. If that sounds exciting, Comet is the right place for you.
Comet is backed by more than $63 million in venture capital funding and powers some of the best machine-learning teams in the world, including Netflix, Uber, Etsy, and Mobileye. We are a remote-first company with offices in New York City (USA) and Tel Aviv (Israel).
About the Position:
Comet is seeking a highly skilled and experienced Senior Engineering Manager - Backend to lead our backend engineering team. In this critical leadership role, you will drive the development of robust, scalable, and high-performance backend systems that power Comet’s platform. You will work closely with cross-functional teams to ensure that our platform continues to meet the evolving needs of our customers, enabling them to optimize their machine learning models effectively.
You are:
- A seasoned manager, with proven track record of managing high performing BE or FS teams
- A squad leader, who has proven experience managing projects under strict deadlines and in complex systems
- Proficient in agile methodologies and able to lead scrum teams and drive execution through collaboration
- Have a strong product understanding and data science related experience (or will be able to learn the domain quickly)
Responsibilities:
Technical Strategy & Execution:
- Architect and oversee the development of scalable, secure, and resilient backend systems that support the complex workflows of machine learning models
- Ensure the reliability and performance of backend services through robust monitoring, logging, and incident response processes
- Leverage cloud platforms (e.g., AWS, GCP, Azure) to build and maintain infrastructure that supports the scaling needs of our platform
- Drive the adoption of modern software engineering practices, including microservices architecture, Agile development, and continuous integration/continuous deployment (CI/CD).
Collaboration & Project Management:
- Work closely with the Frontend and Devops teams to ensure the backend infrastructure supports efficient model training, versioning, deployment, and monitoring
- Manage project timelines, set priorities, and allocate resources effectively to ensure timely delivery of features and enhancements
- Identify and mitigate risks associated with backend development, ensuring that technical debt is managed and reduced over time
Leadership & Vision:
- Lead, mentor, and inspire a team of backend engineers to deliver innovative and high-quality software solutions
- Develop and execute a comprehensive backend development strategy that aligns with Comet’s product roadmap and business goals
- Collaborate with product managers, data scientists, and frontend teams to design and implement backend services that support seamless integration with various ML tools and frameworks
- Promote a culture of excellence in software engineering, encouraging best practices in code quality, performance optimization, and security
Team Building & Development:
- Recruit, hire, and retain top-tier backend engineering talent
- Provide regular feedback, conduct performance reviews, and help team members grow in their careers
- Foster a collaborative, inclusive, and innovative work environment where team members feel empowered to contribute and take ownership of their work
Requirements:
- 8+ years of hands-on BE Java/C# development
- 5+ years of management experience of teams with 4+ developers
- 2+ years of remote team management
- Proven experience in building and scaling backend systems for Enterprise, preferably in the MLOps or Data Science space
- Strong full-stack capabilities
- Strong project management skills, proficient in Agile, Lean, and scrum methodologies
- Demonstrated ability to mentor, coach, and lead high-performing engineering teams
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams
- A passion for continuous learning and staying up-to-date with industry trends and emerging technologies
- Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field
Why Join Comet?
- Play a key role in shaping the future of machine learning operations
- Work with a talented and passionate team on cutting-edge technology
- Competitive salary, benefits, and opportunities for career growth
- A dynamic and inclusive work environment that values innovation and creativity
Comet is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees without regard to race, religion, color, sex, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship status, uniform service member status, marital status, pregnancy, age, medical condition, physical or mental disability, genetic information/characteristics, and any other characteristic protected by State or Federal law.
Top Skills
What We Do
Comet is a meta machine learning platform designed to help AI practitioners and teams build reliable machine learning models for real-world applications by streamlining and connecting the machine learning model lifecycle. By leveraging Comet, users can employ machine learning experiment tracking to track, compare, explain and reproduce their models. Backed by thousands of users and multiple Fortune 100 companies, Comet provides insights and data to build better, more accurate AI models while improving productivity, collaboration and visibility across teams.