Engineering Farm Engineer

Posted 6 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
Senior level
Artificial Intelligence • Hardware • Robotics • Software • Metaverse
The Role
The Engineering Farm Engineer is responsible for maintaining and architecting solutions for a large compute cluster, enhancing efficiency through automation and performance tuning. Key responsibilities include monitoring system health, resolving production issues, and engaging in the entire service lifecycle from design to operation, alongside mentoring junior engineers.
Summary Generated by Built In

For two decades, we have pioneered visual computing, the art and science of computer graphics. With our invention of the GPU - the engine of modern visual computing - the field has expanded to encompass video games, movie production, product design, medical diagnosis, and scientific research. Today, we stand at the beginning of the next era, the AI computing era, ignited by a new computing model, GPU deep learning. This new model - where deep neural networks are trained to recognize patterns from massive amounts of data - has shown to be deeply effective at solving some of the most complex problems in everyday life.

Engineering Farm Engineer is responsible for architecting solutions around our large compute cluster to make it work efficiently and improve the user experience for customers as well as engineers supporting the cluster. Much of our SW engineering work focuses on eliminating manual work through automation, performance tuning, and growing the efficiency of production systems. Practices such as limiting time spent on reactive operational work, blameless postmortems, and proactive identification of potential outages factor into iterative improvement that is key to product quality and interesting and dynamic day-to-day work. We promote self-direction to work on meaningful projects, while we also strive to build an environment that provides the support and mentorship needed to learn and grow.

What you will be doing:

  • Maintain server infrastructure and services once they are live by measuring and monitoring availability, latency, and overall system health.

  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.

  • Work with different SMEs and help provide quality resolution to the production issues to the customer

  • Practice sustainable incident response and blameless postmortems.

  • Understand complex and vast infrastructure and support it during on-call weeks

  • Independently Architect and design solutions with SW engineering approach using the right and efficient algorithms, implemented with regular SDLC process that includes requirements gathering, SW design, testing, deployment, & release.  

  • Support large-scale server infrastructure with monitoring, logging, and alerting with promised uptime.

  • Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement.

  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management, and launch reviews.

What we need to see:

  • BS degree with 8+ years of experience in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.

  • Experience with SW Design, Algorithms, data structures, and software design.

  • Experience in one or more of the following: Python, Perl, Go, or Ruby using an Object-oriented approach.

  • Experience in mentoring junior engineers or leading a team.

  • Basic understanding of SQL & NoSQL Data platforms, database queries, and data analysis.

  • Interest in crafting, analyzing, and fixing large-scale distributed systems.

  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

  • Ability to debug and optimize code and automate routine tasks.

  • Ability to learn quickly and adapt to different platforms as per the needs of the project.

Ways to stand out of the crowd:

  • Demonstrated experience with architecting and building scalable and maintainable tools following SW best practices

  • Demonstrated experience with leading a project from inception to completion along with significant independent contribution

  • Good hands-on experience with schedulers like LSF and SLURM

  • Good understanding of Linux Administration or done automation around it

  • Experience in debugging infrastructure or UNIX-related issues

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Go
Perl
Python
Ruby
The Company
HQ: Santa Clara, CA
21,960 Employees
On-site Workplace
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Arrow Electronics, Inc. Logo Arrow Electronics, Inc.

Senior Engineer / Technical Lead (C Linux, Yocto)

Cloud • Enterprise Web • Hardware • Information Technology • Internet of Things • Robotics • Semiconductor
Bangalore, Bengaluru, Karnataka, IND
22000 Employees

EchoStar Logo EchoStar

Staff Software engineer

Aerospace • Cloud • Digital Media • Information Technology • Mobile • News + Entertainment • Retail
Bengaluru, Karnataka, IND
14500 Employees

SmartBear Logo SmartBear

Zephyr Squad Senior Software Engineer Customer support

Cloud • Internet of Things • Software • App development • Automation
Easy Apply
Bengaluru, Karnataka, IND
800 Employees

SmartBear Logo SmartBear

Senior Product Support Engineer - BugSnag

Cloud • Internet of Things • Software • App development • Automation
Easy Apply
Bengaluru, Karnataka, IND
800 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account