Senior Deep Learning Data Scientist, Speech

Apply Now

Job Description

NVIDIA is looking for Speech Data Scientists to develop high-impact, high-visibility Speech AI product Riva & improve the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join our Riva Product engineering team. For more details on Riva check

What you’ll be doing:

  • Train Speech Recognition Acoustic, Language, Punctuation models.
  • Measure and benchmark model performance.
  • Maintain ASR model evaluation system.
  • Analyze model accuracy and bias and recommend the next course of action & Improvements.
  • Improve processes for speech data processing, augmentation, filtering & ASR Training sets preparation.
  • Gather knowhow on speech datasets for training & evaluation.
  • Characterize performance and quality metrics across platforms for various speech AI components.
  • Collaborate with various teams on new product features and improvements of existing products.
  • Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.
  • Help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

What we need to see:

  • Bachelor's degree or Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 2+ years of experience
  • Native or near-native fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic / Hindi / Korean / Italian / Portuguese
  • Excellent programming skills in Python as well as strong fundamentals in Programming, optimizations and Software design.
  • Strong knowledge of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers.
  • Strong knowledge of RNNT and CTC decoders and know how of Deep learning applications to Speech and NLP.
  • Hands-on experience on Speech Technologies like Automatic Speech Recognition, Speech Command detection, Text to Speech, Speaker Recognition and Identification, speaker diarization, Noise robustness Techniques, Voice activity detection, End of utterance detection etc.
  • Experience with Training acoustic models and Experience with KenLM, OpenLM and other tools to create Language models.
  • Experience with “PyTorch” Deep Learning Frameworks.
  • Exposure to basic speech digital signal processing and feature extraction techniques like FFT, MFCC, Mel Spectrogram, etc.
  • General background around version control and code review tools like Git, Gerrit, Gitlab.

Ways to stand out from the crowd:

  • Strong C++ programming skills.
  • Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT
  • Background with Dockers and Kubernetes
  • Background with deploying machine learning models on data center, cloud, and embedded systems

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Company Info.


NVIDIA’s invention of the GPU sparked the PC gaming market. The company’s pioneering work in accelerated computing—a supercharged form of computing at the intersection of computer graphics, high performance computing and AI—is reshaping trillion-dollar industries, such as transportation, healthcare and manufacturing, and fueling the growth of many others.

  • Industry
    Cloud computing,Video games,Computer software,Semiconductors,Computer hardware,Consumer electronics,Artificial intelligence
  • No. of Employees
  • Location
    2701 San Tomas Expressway, Santa Clara, CA 95050, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

NVIDIA is currently hiring Data Scientist, Deep Learning Jobs in Hyderabad, Telangana, India with average base salary of ₹90,000 - ₹250,000 / Month.

Similar Jobs View More