Software Engineering Manager - Triton Inference Server

NVIDIA
Apply Now

Job Description

We are looking for Software Engineering Manager to lead the development efforts for the Triton Inference Server team! Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to recommenders to large language models. We are a fast-paced, agile team building tools and software to make design and deployment of new deep learning models easier and accessible to more inference solution providers and data scientists.

In this role, you will manage an engineering team designing, developing, and optimizing software that streamlines AI inferencing. Ideal candidates will not only have experience leading an agile, system software engineering team, but also motivated to push the boundaries of what is possible with AI inferencing on both CPUs and GPUs. If this sounds exciting, we would love to hear from you!

What you'll be doing:

  • Lead, mentor, and grow the Triton engineering team and be responsible for planning and execution of projects as well as the quality and performance of the Triton Inference Server.
  • Work closely with Product and Program Management to establish feature roadmaps and coordinate project dependencies; load-balance asynchronous requests across available resources; and collaborating on all feature designs.
  • Engage with internal and external partners and costumers to understand their use cases and requirements.

What we need to see:

  • Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
  • 8+ years of overall experience in developing customer facing software.
  • 3+ years of experience recruiting, training, and leading software engineering teams.
  • Strong fundamentals in building and deploying cloud services using HTTP REST, gRPC, protobuf, and related technologies.
  • Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design. Emphasis on clean and SOLID object-oriented programming principles are a plus.
  • Experience running a large open source project - use of GitHub, bug tracking, branching and merging code, OSS licensing issues handling patches, etc.
  • Experience with agile software development practices is a requirement, including familiarity with tools such as JIRA and AHA.

Ways to stand out from the crowd:

  • Experience working in a globally distributed organization.
  • Experience with machine learning algorithms and frameworks. Especially experience frameworks such as TensorFlow, PyTorch, ONNX, TensorRT, OpenVino, and vLLM.
  • Good knowledge of CPU and/or GPU hardware architecture.
  • Knowledge of GPU programming such as OpenCL or CUDA

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence!

The base salary range is 216,000 USD - 414,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Company Info.

NVIDIA

NVIDIA’s invention of the GPU sparked the PC gaming market. The company’s pioneering work in accelerated computing—a supercharged form of computing at the intersection of computer graphics, high performance computing and AI—is reshaping trillion-dollar industries, such as transportation, healthcare and manufacturing, and fueling the growth of many others.

  • Industry
    Cloud computing,Video games,Computer software,Semiconductors,Computer hardware,Consumer electronics,Artificial intelligence
  • No. of Employees
    22,473
  • Location
    2701 San Tomas Expressway, Santa Clara, CA 95050, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

NVIDIA is currently hiring Software Engineering Manager Jobs in Hillsboro, OR, USA with average base salary of $216,000 - $414,000 / Year.

Similar Jobs View More