Senior Performance Engineer, Automation Framework - Machine Learning and HPC

Advanced Micro Devices, Inc.
Apply Now

Job Description

What you do at AMD changes everything

At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies – building blocks for gaming, immersive platforms, and the data center.

Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.

Senior Performance Engineer, Automation Framework - Machine Learning and HPC

THE ROLE:

Our team ensures AMD-based systems and GPUs are operating at their best before they are deployed to solve the world’s most challenging problems. We are seeking software developers to design, build and maintain a world-class workload automation system for running large-scale, GPU-enabled data center applications. In this role, you will create a system to provision, run, monitor and analyze workloads used in supercomputing, academia, and the largest data centers on the planet. You will also design and development of web applications that enable users to sift through mountains of ML / HPC performance data and create insightful, graphically rich reports.

KEY RESPONSIBILITIES:

  • Build and maintain a workload automation system based on Ansible using an infrastructure-as-code model
  • Work with ML/HPC experts to help them automate their workloads so they can self-serve
  • Create database schemas and interfaces that enable the automation system to store workload performance results
  • Develop a custom web application that allows users to search, retrieve, display and report performance results
  • Build a reporting system that automates the process of creating informative tables and graphs for engineers, business units and their management

PREFERRED EXPERIENCE:

  • Experience with workload automation and management systems
  • Extensive Python and shell script experience
  • Experience with web application development frameworks such as Django
  • Experience with data visualization tools such as Tableau
  • Experience with SQL and NoSQL databases
  • Previous use of GitHub

Ways to Stand Out

  • Experience with Ansible
  • Experience with Django

LOCATION: The team is based in Austin, TX, but we are open to hiring the following AMD sites (Austin, TX, Bellevue, WA, Orlando, FL, San Diego, CA, and Boxborough, MA).

Company Info.

Advanced Micro Devices, Inc.

Advanced Micro Devices, Inc. (AMD) is an American multinational semiconductor company based in Santa Clara, California, that develops computer processors and related technologies for business and consumer markets. While it initially manufactured its own processors, the company later outsourced its manufacturing, a practice known as going fabless, after GlobalFoundries was spun off in 2009. AMD's main products include microprocessors, motherboard

  • Industry
    Artificial intelligence,Video games,Semiconductors,Computer hardware
  • No. of Employees
    15,500
  • Location
    Santa Clara, CA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Advanced Micro Devices, Inc. is currently hiring Machine Learning Engineer Jobs in Austin, TX, USA with average base salary of $160,000 - $240,000 / Year.

Similar Jobs View More