Technical Lead, Foundation Model Infrastructure, Machine Learning Platform

Waymo
Apply Now

Job Description

Join the Training Infrastructure group within ML Platform, and help us make training and evaluating ML models at Waymo easier, faster, and better! We develop and maintain a set of frameworks and tools on top of Tensorflow that address many of the pain points experienced by ML practitioners: training fast and at scale, discovering optimal hyper-parameters, automatically retraining nets on a schedule, computing reliable and noiseless metrics on validation sets, and validating newly trained nets when deployed into the full onboard software stack. We work hand in hand with machine learning experts in all parts of the company and our collaborators across Alphabet.

We are looking for an individual contributor (IC) with a strong background in ML deployment toolchain, ML accelerator (GPU/TPU) profiling and application, and remote procedure call (RPC) service management and optimization. Non-exhaustive examples of our work include:

  • Build a comprehensive and user friendly toolchain for scalable and flexible ML deployment workflow
  • Develop and improve our scalable and performant ML training library
  • Profile ML performance on accelerators (e.g. GPU and TPU) at both model level and system level, identify performance bottlenecks and optimization opportunities 

At a minimum, we’d like you to have:

  • BS in Computer Science, Math, or equivalent real-world experience
  • Solid Python or C++ skills
  • Passionate about infrastructure work, building libraries, tools, and pipelines for machine learning practitioners
  • Experience with Tensorflow, Keras, ie: distributed training and distribution strategies
  • Experience contributing to the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems

It’s preferred if you have:

  • MS or PhD in Computer Science, Math, or equivalent real-world experience
  • Practical expertise in training models with TPU
  • Experience building and architecting large-scale, production quality backend systems, especially in applied machine learning or data pipeline
  • Knowledge and experience with machine learning algorithms
  • Experience with multi-threaded and stream-based programming models
  • Familiarity working with RPC services
  • Experience with high performance computing or data mining

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process. 

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. 

Salary Range

$270,000—$334,000 USD

While at Waymo, you will enjoy benefits that cover…

Health and wellness: Our people are at the heart of everything we do. At Waymo, you can enjoy top-notch medical, dental and vision insurance, mental wellness support, a Flexible Spending Account (FSA), a Health Saving Account (HSA), on-site physicians and/or nurses in some locations, and special wellness programs.

Financial wellness: Your financial peace of mind is important to us. At Waymo, we offer competitive compensation, bonus opportunities, equity, a generous 401(k) plan or regional retirement plans, 1-on-1 financial coaching, a 529 College Savings Plan and lots of other perks and employee discounts. 

Company Info.

Waymo

Waymo LLC, previously recognized as the Google Self-Driving Car Project, operates as an American company specializing in autonomous driving technology. Its headquarters are situated in Mountain View, California, and it functions as a subsidiary of Alphabet Inc., the overarching entity of Google.

  • Industry
    Robotics company,Autonomous technology
  • No. of Employees
    2,301
  • Location
    Mountain View, CA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Waymo is currently hiring Technical Lead, Machine Learning Jobs in San Francisco, CA, USA with average base salary of $270,000 - $334,000 / Year.

Similar Jobs View More