Principal MLOps Engineer

Posted 9 Days Ago
Be an Early Applicant
Hiring Remotely in US
Remote
140K-225K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Cybersecurity • Defense
The Role
The Principal MLOps Engineer will support the development of a real-time data platform for the DoD, involving deployment of ML infrastructure, building MLOps pipelines, and developing a full-lifecycle ML platform while processing vast amounts of data with low latency.
Summary Generated by Built In

This is a U.S. based position. All of the programs we support require U.S. citizenship to be eligible for employment. All work must be conducted within the continental U.S.

Who we are:

Raft (https://TeamRaft.com) is a customer-obsessed non-traditional small business with a purposeful focus on Distributed Data Systems, Platforms at Scale, and Complex Application Development, with headquarters in McLean, VA. Our range of clients includes innovative federal and public agencies leveraging design thinking, cutting-edge tech stack, and cloud-native ecosystem. We build digital solutions that impact the lives of millions of Americans.

We’re looking for an experienced Principal MLOps Engineer to support our customers and join our passionate team of high-impact problem solvers.

About the role:

Raft is building a real-time data platform for the Department of Defense (DoD), aimed at enhancing operators' awareness of critical events, including instances like the Chinese balloon and Cessna over the White House incidents. Central to this data platform is the aggregation of real-time data from over 750 sensors. This data is subsequently enriched, rendered queryable, and ultimately presented as a common operational picture to empower operators in making time critical and pertinent decisions. Our system efficiently handles the processing of over a billion events daily, all achieved with millisecond-level latency. Key technologies include Kafka, Kafka Streams, Pinot, Java, Scala, and Kubernetes. Your involvement in this role will encompass hands-on collaboration with a team of accomplished individuals, collectively striving towards excellence. Your primary responsibilities will be to deploy ML infrastructure, build MLOps pipelines, and contribute to the development of a full-lifecycle ML platform. 
What we are looking for:

  • 7+ years of relevant hands-on experience
  • 5+ years experience with Docker and Kubernetes, provisioning production clusters and maintaining their compliance.
  • 5+ years experience supporting enterprise Cloud applications or infrastructure (AWS, Azure, etc.)
  • Solid understanding of Helm Charts
  • Practical experience with Machine Learning on Kubernetes
  • Experience managing clusters with GPU machines
  • Experience building and maintaining machine learning platforms and pipelines
  • Practical programming and scripting skills (Python preferred)
  • Fast learner, analytical thinker, creative, hands-on, strong communication skills
  • Able to work both independently and as part of a team
  • Excellent problem-solving skills and attention to detail.
  • Proven experience with modern software development and engineering practices including scrum/agile, Git, and DevOps
  • Ability to obtain a Security+ certification within the first 90 days of employment with Raft

Highly preferred:

  • Currently Cleared or a Clearance in the past 
  • Experience with or managing KubeFlow deployments  
  • Knowledge of Istio
  • Comfortable provisioning and debugging complex CI/CD pipelines
  • Prior experience with Terraform 

Clearance Requirements:

  • Ability to obtain and maintain a Top Secret clearance 

Work Type: 

  • Remote (Local to Tampa, FL is highly preferred)
  • May require up to 30% travel

Salary Range:

  • $140,000 - $225,000 
  • The determination of compensation is predicated upon a candidate's comprehensive experience, demonstrated skill, and proven abilities

What we will offer you:

  • Highly competitive salary
  • Fully covered healthcare, dental, and vision coverage
  • 401(k) and company match
  • Take as you need PTO + 11 paid holidays
  • Education & training benefits
  • Annual budget for your tech/gadgets needs
  • Monthly box of yummy snacks to eat while doing meaningful work
  • Remote, hybrid, and flexible work options
  • Team off-site in fun places!
  • Generous Referral Bonuses
  • And More!

Our Vision Statement: 

We bridge the gap between humans and data through radical transparency and our obsession with the mission. 

Our Customer Obsession: 

We will approach every deliverable like it's a product. We will adopt a customer-obsessed mentality. As we grow, and our footprint becomes larger, teams and employees will treat each other not only as teammates but customers. We must live the customer-obsessed mindset, always. This will help us scale and it will translate to the interactions that our Rafters have with their clients and other product teams that they integrate with. Our culture will enable our success and set us apart from other companies.

How do we get there? 

Public-sector modernization is critical for us to live in a better world. We, at Raft, want to innovate and solve complex problems. And, if we are successful, our generation and the ones that follow us will live in a delightful, efficient, and accessible world where out-of-box thinking, and collaboration is a norm. 

Raft’s core philosophy is Ubuntu: I Am, Because We are. We support our “nadi” by elevating the other Rafters. We work as a hyper collaborative team where each team member brings a unique perspective, adding value that did not exist before. People make Raft special. We celebrate each other and our cognitive and cultural diversity. We are devoted to our practice of innovation and collaboration. 

We’re an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Top Skills

Java
Python
Scala
The Company
HQ: Reston, VA
200 Employees
Hybrid Workplace
Year Founded: 2018

What We Do

Raft is a digital consulting firm with niche expertise in the rapid delivery of modern, user-first, scalable, and data-intensive digital solutions. We accelerate the missions of our federal partners through human-centered design (HCD) and agile development practices, bringing deep technical expertise in DevSecOps, Kubernetes management, cloud-native microservice architectures, and secure, open source delivery.

We put our people and our culture first. We work with some of the smartest, hardest working experts in their field. We owe it to them to make sure our team is free of meaningless restrictions and internal politics so they can focus on what they love, finding innovative solutions.

We build off the Open-Source and resist vendor lock in. That way you’re not paying us to reinvent the wheel, our solutions play nice with others and are always adaptable.

We know digital innovation doesn’t always operate during bank hours, so neither do we. (It's also why we don’t dress like bankers.) We aren’t afraid to go the extra mile or put in the extra time to find a solution. We get it right the first time; we won’t waste our time (and your money) trying to fix something that’s fundamentally broken. It’s kind of like making an exceptional cup of coffee, if the beans are roasted wrong, it doesn’t matter how much pumpkin spice you dump in your cup, it’s still not going to taste great.

And above all else we’re up for a challenge; it’s in our DNA. Ask us about something that’s never been done, or better yet something that can’t be done, and we’ll find a way to do it. It’s just who we are.

If you need a partner that’s looking toward the future and not the past, don’t go traditional, go Raft.

Why Work With Us

Raft partners with public agencies to solve hard complex problems that impact the lives of millions of Americans. We work hard to make it easy for humans to connect with data.

Gallery

Gallery

Similar Jobs

Rackspace Technology Logo Rackspace Technology

Principal MLOPs Engineer

Cloud • Information Technology • Software
Remote
United States
7509 Employees
Remote
The Center, IN, USA
87 Employees

General Motors Logo General Motors

JR-202420327 Staff Machine Learning Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote
District of Columbia, USA
165000 Employees
158K-242K Annually

Square Logo Square

Software Engineer (Backend), Buyer Foundations

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote
Hybrid
8 Locations
12000 Employees
139K-245K Annually

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account