Solutions Architect, Retrieval Augmented Generation

NVIDIA
Apply Now

Job Description

A successful candidate will be working with ground breaking LLM models that are fundamentally changing the way people use technology! You will be the first line of technical expertise between NVIDIA and our customers. Your duties will vary from working on proof-of-concept demonstrations, to driving relationships with key executives and managers in order to promote adoption of RAG pipelines streamline their deployment to production. Dynamically engaging with developers, scientific researchers, data scientists, IT managers and senior leaders is a significant part of the Solutions Architect role and will give you experience with a range of partners and technologies.

What You’ll Be Doing:

  • Work directly with key customers to understand their technology and provide the best solutions.
  • Develop and demonstrate solutions based on NVIDIA’s and open source LLM technology.
  • Perform in-depth analysis and optimization of RAG pipeline components to ensure the best performance on GPU systems.
  • Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations
  • Build industry expertise and become a contributor in integrating NVIDIA technology into Enterprise Computing architectures.

What We Need to See:

  • MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields
  • Excellent verbal, written communication, and technical presentation skills in English
  • 6+ years' work or research experience with Python/ C++ / other software development
  • Academic and/or experience in fields related to machine learning, deep learning and/or data science.
  • Work experience deploying and maintaining AI based systems and knowledge of modern DevOps / MLOps tools and standards.
  • Understanding of key libraries used for LLM and RAG development: for NLP models development (e.g. NeMo, DeepSpeed, HuggingFace), for deployment (e.g. TensorRT-LLM, Triton Inference Server) for Information Retrieval (e.g. RAPIDS, Milvus, Pinecone, Elastic Search).
  • You are excited to work with multiple levels and teams across organizations (Engineering, Product, Sales and Marketing team) and Capable of working in a constantly evolving environment without losing focus.
  • Ability to multitask in a fast-paced environment and Driven with strong analytical and problem-solving skills.
  • Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects
  • You are a self-starter with demeanor for growth, passion for continuous learning and sharing findings across the team

Ways to Stand Out from The Crowd:

  • Experience working with larger transformer-based architectures for NLP, CV, ASR or other.
  • Experience optimizing DNN architecture using tools such as TRT/TRT-LLM or model compression.
  • Understanding of AI/HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.

Company Info.

NVIDIA

NVIDIA’s invention of the GPU sparked the PC gaming market. The company’s pioneering work in accelerated computing—a supercharged form of computing at the intersection of computer graphics, high performance computing and AI—is reshaping trillion-dollar industries, such as transportation, healthcare and manufacturing, and fueling the growth of many others.

  • Industry
    Cloud computing,Video games,Computer software,Semiconductors,Computer hardware,Consumer electronics,Artificial intelligence
  • No. of Employees
    22,473
  • Location
    2701 San Tomas Expressway, Santa Clara, CA 95050, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

NVIDIA is currently hiring Solutions Architect, AI Jobs in Switzerland with average base salary of CHf100,000 - CHf140,000 / Year.

Similar Jobs View More