Deep Learning Infrastructure Engineer, Autonomy Software

Tesla
Apply Now

Job Description

As a Software Engineer within the Autonomy group, you will work on reinforcing, optimizing, and scaling our neural network training & auto-labeling infrastructure both for Autopilot and the Humanoid robot.

At the core of our autonomy capabilities are multiple neural networks that the Deep Learning team is designing to train on very large amounts of data, across large-scale GPU clusters and soon our supercomputer Dojo. Robustly training networks at scale, should it be for production models or quick experiments, and completing them in the shortest amount of time possible, is critical to our mission.

Responsibilities

  • Write robust Python software code in our machine learning training repository while applying best software practices to support machine learning scientists in tasks such as fetching training data, preprocessing it, and orchestrating the training runs.
  • Integrate the training software into our continuous integration cluster to support metrics persistence across experiments, weekly/nightly neural network builds, and other unit / throughput tests.
  • Profile performance of training software in our training cluster, identify bottlenecks in and between CPU/GPU code execution, and work on optimizing its throughput and scalability within and across nodes to ultimately reduce convergence time.
  • Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning.

Requirements

  • Practical experience programming in Python and/or C/C++.
  • Proficient in system-level software, in particular hardware-software interactions and resource utilization.
  • Understanding of modern machine learning concepts and state of the art deep learning.
  • Experience working with training frameworks, ideally PyTorch.
  • Demonstrated experience scaling neural network training jobs across clusters of GPU’s.
  • Optional: Experience programming in Cuda.
  • Optional: Profiling and optimizing CPU-GPU interactions (pipelining compute/transfers, etc).
  • Optional: Devops experience, in particular dealing with clusters of training nodes, and filesystems for very large amount of training data.

Compensation and Benefits

Benefits

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire:

  • Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
  • Company Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA
  • Healthcare and Dependent Care Flexible Spending Accounts (FSA)
  • LGBTQ+ care concierge services
  • 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
  • Company paid Basic Life, AD&D, short-term and long-term disability insurance
  • Employee Assistance Program
  • Sick and Vacation time (Flex time for salary positions), and Paid Holidays
  • Back-up childcare and parenting support resources
  • Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
  • Weight Loss and Tobacco Cessation Programs
  • Tesla Babies program
  • Commuter benefits
  • Employee discounts and perks program

Expected Compensation

$104,000 - $360,000/annual salary + cash and stock awards + benefits

Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

Company Info.

Tesla

Tesla, Inc., headquartered in Austin, Texas, is an American multinational corporation specializing in electric vehicle design and manufacturing, as well as the production of stationary battery energy storage solutions spanning from residential to grid-scale applications. Additionally, Tesla offers a range of solar panels, solar shingles, and complementary products and services within the clean energy sector.

  • Industry
    Automotive,Energy,Autonomous technology
  • No. of Employees
    127,000
  • Location
    13101 Tesla Road, Austin, TX, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Tesla is currently hiring Deep Learning Infrastructure Engineer Jobs in Palo Alto, CA, USA with average base salary of $104,000 - $360,000 / Year.

Similar Jobs View More