AI Cloud Platform Software Engineer - New Grad

Posted 2 Days Ago
Be an Early Applicant
Toronto, ON
Entry level
Artificial Intelligence
The Role
As a software engineer on the AI cloud platform, you will optimize deployment for low latency and efficient load balancing while ensuring the reliability and scalability of the distributed infrastructure. You'll develop tools to identify system bottlenecks and enhance API server management.
Summary Generated by Built In

Cerebras has developed a radically new chip and system to dramatically accelerate deep learning applications. Our system runs training and inference workloads orders of magnitude faster than contemporary machines, fundamentally changing the way ML researchers work and pursue AI innovation.

We are innovating at every level of the stack – from chip, to microcode, to power delivery and cooling, to new algorithms and network architectures at the cutting edge of ML research. Our fully-integrated system delivers unprecedented performance because it is built from the ground up for deep learning workloads.

Cerebras is building a team of exceptional people to work together on big problems. Join us!

About the Role

As a software engineer on our AI cloud platform, you will work on our cloud platform for AI model training and inference. In this role, you will be responsible for optimizing deployment for minimal latency and efficient load balancing, and on ensuring high reliability, scalability, security, and observability of our distributed training and inference infrastructure. You will define and document production requirements, implement scaling strategies, and ensure robust API server management and high availability. 

You will develop tools to map out system bottlenecks and sources of instability, and design and build solutions to address them. 

We're looking for talented software engineers who thrive in ambiguity, view change as an opportunity, and have a voracious desire to learn and share knowledge clearly and concisely. 

Requirements

  • Enrolled in the University of Toronto's PEY program with a degree in Computer Science, Computer Engineering, or other related disciplines.
  • Exposure or experience with cloud services: web consoles, cloud-based APIs, and cloud services.
  • Exposure or experience in developing or building ML serving and/or training services, preferably for modern generative AI models.
  • Familiarity with the latest AI model architectures and efficient implementation of serving systems for these models.
  • Strong problem-solving skills and excellent communication skills.
  • A real passion for AI!

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

The Company
HQ: Sunnyvale, CA
402 Employees
On-site Workplace
Year Founded: 2016

What We Do

Cerebras Systems is a team of pioneering computer architects, computer scientists, deep learning researchers, functional business experts and engineers of all types. We have come together to build a new class of computer to accelerate artificial intelligence work by three orders of magnitude beyond the current state of the art.

The CS-2 is the fastest AI computer in existence. It contains a collection of industry firsts, including the Cerebras Wafer Scale Engine (WSE-2). The WSE-2 is the largest chip ever built. It contains 2.6 trillion transistors and covers more than 46,225 square millimeters of silicon. The largest graphics processor on the market has 54 billion transistors and covers 815 square millimeters. In artificial intelligence work, large chips process information more quickly producing answers in less time. As a result, neural networks that in the past took months to train, can now train in minutes on the Cerebras CS-2 powered by the WSE-2.

Join us: https://cerebras.net/careers/

Similar Jobs

Cash App Logo Cash App

Staff Software Engineer, Cash App Web Infrastructure

Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Remote
Hybrid
Kitchener, ON, CAN
3500 Employees
196K-304K Annually

Square Logo Square

Senior Software Engineer, Bank Accounts

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote
Hybrid
Toronto, ON, CAN
12000 Employees
162K-251K Annually

Block Logo Block

Senior Software Engineer, Bank Accounts

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Remote
Hybrid
Toronto, ON, CAN
12000 Employees
162K-251K Annually
10 Locations
2674 Employees

Similar Companies Hiring

RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
HERE Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account