Principal BMC Engineer

Posted 8 Days Ago
Be an Early Applicant
Gdańsk, Województwo pomorskie
Senior level
Artificial Intelligence • Semiconductor
The Role
The Principal BMC Engineer designs and supports the Baseboard Management Controller software stack for AI server systems, collaborating with hardware teams and customers to ensure system performance and security. Responsibilities include implementing OpenBMC solutions, developing monitoring solutions, and mentoring team members.
Summary Generated by Built In

About Graphcore

How often do you get the chance to build a technology that transforms the future of humanity?

Graphcore products have set the standard in made-for-AI compute hardware and software, gaining global attention and industry acclaim. Now we are developing the next generation of artificial intelligence compute with systems that will allow AI researchers to develop more advanced models, help scientists unlock exciting new discoveries, and power companies around the world as they put AI at the heart of their business.

Graphcore recently joined SoftBank Group – bringing large and ongoing investment from one of the world’s leading backers of innovative AI companies.

Job Summary

Reporting to the Firmware Manager, the Principal BMC Engineer is responsible for designing and supporting the delivery of the Baseboard Management Controller software stack for the next generation of AI server class systems. They will work in close conjunction with our partners, hardware vendors and our customers to help build best-in-class systems for large scale data-centre environments. The Principal BMC engineer is expected to demonstrate technical leadership and support any future growth of the team.

About the Firmware Team

The firmware team writes the software that ensures the full and complete bootup of Graphcore Hardware and Silicon. We design and maintain interfaces to allow our Drivers software to interact with Graphcore Silicon. We present telemetry and monitoring data for use by the host system via the SMBus interface, and by data-centre operatives via the BMC.

Responsibilities and Duties

  • Leading in the design and implementation of an OpenBMC solution for a server-class platform for Artificial Intelligence
  • Working closely with hardware teams to influence hardware design and review HW architecture & schematics.
  • Working closely with customers and end users to align on expectations and deliverables for their deployment needs and environments
  • Working with our partners and security team to ensure we meet product security goals.
  • Designing solutions for errors, stats & configuration appropriate to CPU, GPU, DIMM, SSDs, NICs, IB, PSU, BMC, FPGA, CPLD etc. for enterprise readiness of Server platforms.
  • Designing and developing performance optimised active monitoring BMC solutions using DMTF Standards including MCTP, Redfish, SPDM and PLDM specifications
  • Developing and reviewing code, writing and reviewing design documents, reviewing QA test plans and working closely with all team members to achieve consensus for design and testability as per product requirements.
  • Assisting in the hiring of additional team members, act as a mentor, promote excellence across the team.

Candidate Profile

Essential:

  • Domain expertise in BMC Firmware development on x86 or ARM Platforms including BMC-BIOS communications, thermal management, power management, firmware update, device monitoring, firmware security, etc.
  • Excellent programming and scripting skills using C/C++, Rust, Python, Go, Bash. both for Linux user-space programs and system programs with thorough code reviewing skills.
  • Strong in Linux fundamentals, various Linux distributions and packages, Linux upgrade mechanisms, building and deploying Linux images.
  • Out-of-band and In-band System Management experience with exposure to standards including IPMI, KCS, DMTF Standards (PLDM, MCTP, Redfish, etc), PMBus, NVMe, etc.
  • Understanding of REST architecture style especially JSON over HTTPs with OAuth
  • Expertise in system software and platform security for x86 or ARM based Rack/Blade server systems.
  • Possess excellent written and oral communication skills, good work ethics, high sense of team-work, love to produce quality work and commitment to finish your tasks every single day. You are a self-starter who loves to find creative solutions to challenging problems

Desirable

  • Industry experience with using Data Centre – Secure Control Modules (DC-SCMs) at either version 1 or version 2 of the OCP specification.
  • Board Bring-up expertise with hands-on experience in Device drivers like I2C/I3C, SPI, PCIe, SMBus, Mail-box etc. as well as the device trees for uboot and Linux kernel.
  • Ideally made some contributions to industry standards like the Open Compute Project, OpenBMC, IPMI, DMTF etc.

Benefits

In addition to a competitive salary, Graphcore offers flexible working, annual leave policy, medical and dental health plans, a gym card, medical assessments and employee pension (matched up to 4%). We also have an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We review our benefits on a yearly basis to ensure we offer a valuable and rewarding benefits programme to our employees. We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Top Skills

Bash
C
C++
Go
Python
Rust
The Company
HQ: Palo Alto, CA
389 Employees
On-site Workplace
Year Founded: 2016

What We Do

Graphcore has created a new processor, the Intelligence Processing Unit (IPU), specifically designed for artificial intelligence. The IPU’s unique architecture means developers can run current machine learning models orders of magnitude faster. More importantly, it lets AI researchers undertake entirely new types of work, not possible using current technologies, to drive the next great breakthroughs in general machine intelligence.

Our next generation 3D Wafer-on-Wafer Bow IPU systems are helping AI innovators worldwide to build better, more innovative AI solutions, whether their focus is on language and vision, exploring graph neural networks and LSTMs or creating something entirely new.

We believe our IPU technology will become the worldwide standard for artificial intelligence compute. The performance of Graphcore’s IPU is going to be transformative across all industries and sectors whether you are a medical researcher, roboticist or building autonomous cars.

Our team is at the forefront of the artificial intelligence revolution, enabling innovators from all industries and sectors to expand human potential with technology. What we do, really makes a difference.

We're always interested in hearing from exceptional people to join our team.

Similar Jobs

Arrow Electronics, Inc. Logo Arrow Electronics, Inc.

Junior Engineering Operations Specialist

Cloud • Enterprise Web • Hardware • Information Technology • Internet of Things • Robotics • Semiconductor
Gdańsk, Województwo pomorskie, POL
22000 Employees

Acoustic Logo Acoustic

Senior Software Engineer (.NET)

Marketing Tech • Software
Gdańsk, Województwo pomorskie, POL
650 Employees

Acoustic Logo Acoustic

Senior Software Engineer (Data)

Marketing Tech • Software
Gdańsk, Województwo pomorskie, POL
650 Employees
Gdańsk, Województwo pomorskie, POL
1000 Employees

Similar Companies Hiring

Eastwall Thumbnail
Software • Information Technology • Consulting • Cloud • Big Data Analytics • Artificial Intelligence • App development
Denver, CO
20 Employees
Smartcat Thumbnail
Natural Language Processing • Machine Learning • Conversational AI • Artificial Intelligence
Boston, Massachusetts
242 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account