Principal BMC Engineer

Posted 17 Hours Ago
Be an Early Applicant
Bristol, England
Senior level
Artificial Intelligence • Semiconductor
The Role
The Principal BMC Engineer will lead the design and implementation of the Baseboard Management Controller software stack for AI server systems, collaborate with hardware teams to influence design, work with customers to meet their needs, and ensure security compliance. They will also mentor team members and contribute to code development and documentation.
Summary Generated by Built In

About Graphcore

How often do you get the chance to build a technology that transforms the future of humanity?

Graphcore products have set the standard in made-for-AI compute hardware and software, gaining global attention and industry acclaim. Now we are developing the next generation of artificial intelligence compute with systems that will allow AI researchers to develop more advanced models, help scientists unlock exciting new discoveries, and power companies around the world as they put AI at the heart of their business.

Graphcore recently joined SoftBank Group – bringing large and ongoing investment from one of the world’s leading backers of innovative AI companies.

Job Summary

Reporting to the Firmware Manager, the Principal BMC Engineer is responsible for designing and supporting the delivery of the Baseboard Management Controller software stack for the next generation of AI server class systems. They will work in close conjunction with our partners, hardware vendors and our customers to help build best-in-class systems for large scale data-centre environments. The Principal BMC engineer is expected to demonstrate technical leadership and support any future growth of the team.

About the Firmware Team

The firmware team writes the software that ensures the full and complete bootup of Graphcore Hardware and Silicon. We design and maintain interfaces to allow our Drivers software to interact with Graphcore Silicon. We present telemetry and monitoring data for use by the host system via the SMBus interface, and by data-centre operatives via the BMC.

Responsibilities and Duties

  • Leading in the design and implementation of an OpenBMC solution for a server-class platform for Artificial Intelligence
  • Working closely with hardware teams to influence hardware design and review HW architecture & schematics.
  • Working closely with customers and end users to align on expectations and deliverables for their deployment needs and environments
  • Working with our partners and security team to ensure we meet product security goals.
  • Designing solutions for errors, stats & configuration appropriate to CPU, GPU, DIMM, SSDs, NICs, IB, PSU, BMC, FPGA, CPLD etc. for enterprise readiness of Server platforms.
  • Designing and developing performance optimised active monitoring BMC solutions using DMTF Standards including MCTP, Redfish, SPDM and PLDM specifications
  • Developing and reviewing code, writing and reviewing design documents, reviewing QA test plans and working closely with all team members to achieve consensus for design and testability as per product requirements.
  • Assisting in the hiring of additional team members, act as a mentor, promote excellence across the team.

Candidate Profile

Essential:

  • Domain expertise in BMC Firmware development on x86 or ARM Platforms including BMC-BIOS communications, thermal management, power management, firmware update, device monitoring, firmware security, etc.
  • Excellent programming and scripting skills using C/C++, Rust, Python, Go, Bash. both for Linux user-space programs and system programs with thorough code reviewing skills.
  • Strong in Linux fundamentals, various Linux distributions and packages, Linux upgrade mechanisms, building and deploying Linux images.
  • Out-of-band and In-band System Management experience with exposure to standards including IPMI, KCS, DMTF Standards (PLDM, MCTP, Redfish, etc), PMBus, NVMe, etc.
  • Understanding of REST architecture style especially JSON over HTTPs with OAuth
  • Expertise in system software and platform security for x86 or ARM based Rack/Blade server systems.
  • Possess excellent written and oral communication skills, good work ethics, high sense of team-work, love to produce quality work and commitment to finish your tasks every single day. You are a self-starter who loves to find creative solutions to challenging problems

Desirable

  • Industry experience with using Data Centre – Secure Control Modules (DC-SCMs) at either version 1 or version 2 of the OCP specification.
  • Board Bring-up expertise with hands-on experience in Device drivers like I2C/I3C, SPI, PCIe, SMBus, Mail-box etc. as well as the device trees for uboot and Linux kernel.
  • Ideally made some contributions to industry standards like the Open Compute Project, OpenBMC, IPMI, DMTF etc.

Benefits

In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments

Top Skills

Bash
C
C++
Go
Python
Rust
The Company
HQ: Palo Alto, CA
389 Employees
On-site Workplace
Year Founded: 2016

What We Do

Graphcore has created a new processor, the Intelligence Processing Unit (IPU), specifically designed for artificial intelligence. The IPU’s unique architecture means developers can run current machine learning models orders of magnitude faster. More importantly, it lets AI researchers undertake entirely new types of work, not possible using current technologies, to drive the next great breakthroughs in general machine intelligence.

Our next generation 3D Wafer-on-Wafer Bow IPU systems are helping AI innovators worldwide to build better, more innovative AI solutions, whether their focus is on language and vision, exploring graph neural networks and LSTMs or creating something entirely new.

We believe our IPU technology will become the worldwide standard for artificial intelligence compute. The performance of Graphcore’s IPU is going to be transformative across all industries and sectors whether you are a medical researcher, roboticist or building autonomous cars.

Our team is at the forefront of the artificial intelligence revolution, enabling innovators from all industries and sectors to expand human potential with technology. What we do, really makes a difference.

We're always interested in hearing from exceptional people to join our team.

Similar Jobs

Motorola Solutions Logo Motorola Solutions

Senior System Engineer

Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Hybrid
Rugby, Warwickshire, England, GBR
21000 Employees

Crunchyroll Logo Crunchyroll

Principal Software Engineer (Backend), Partner Engineering

Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
London, Greater London, England, GBR
1200 Employees

Snyk Logo Snyk

Software Engineer (Machine Learning)

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
Hybrid
London, Greater London, England, GBR
1000 Employees

Snyk Logo Snyk

Manager, Engineering - Orchestration Group

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
Hybrid
London, Greater London, England, GBR
1000 Employees

Similar Companies Hiring

Bectran, Inc Thumbnail
Software • Machine Learning • Information Technology • Fintech • Automation • Artificial Intelligence
Schaumburg, IL
51 Employees
Eastwall Thumbnail
Software • Information Technology • Consulting • Cloud • Big Data Analytics • Artificial Intelligence • App development
Denver, CO
20 Employees
Smartcat Thumbnail
Natural Language Processing • Machine Learning • Conversational AI • Artificial Intelligence
Boston, Massachusetts
242 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account