Research Intern - Model Distillation

Microsoft
Apply Now

Job Description

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

We're looking for a research intern to work on distillation of large language models (LLMs) -- i.e., training smaller and more efficient LLMs from larger models without serious drops in performance. We're looking for distilled models for some of the applications we are building on the Special Projects team in Microsoft Research (MSR). You would be applying cutting-edge distillation methods such as the approach used to train Phi in the “Textbooks Are All You Need” paper. The standard approach to distillation encourages the distilled model to emulate the hidden states of the larger teacher model. In this internship, we're looking to augment that standard approach with methods that align more structured domain knowledge that we might see in a knowledge graph, simulator/process model, or some other structured representation of knowledge.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program,
  • OR a research master’s degree program with the intention of completing a PhD, in Computer Science, Artificial Intelligence, or a related field.
  • At least 1 year in programming languages used in AI research, such as Python, and familiarity with ML frameworks (e.g., TensorFlow, PyTorch).

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications above, you’ll need to submit a minimum of two reference letters for this position. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.

Preferred Qualifications

  • Background in machine learning, particularly in natural language processing and model distillation.
  • Experience with techniques for distilling large language models.
  • Knowledge of structured domain knowledge integration into AI models.
  • Proficient analytical, problem-solving, and research skills.
  • Ability to work collaboratively in a fast-paced research environment.

The base pay range for this internship is - Applied Sciences IC2 : USD $5,090 - $10,120 per month. There is a different range applicable to specific work locations, with the San Francisco Bay area and New York City Metropolitan area, and the base pay range for this role in those locations is USD $6,690 -$11,030 per month.

The base pay range for this internship is- Applied Sciences IC3 : USD $6,290 - $12,170 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,060 - $13, 240 per month.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

  • Implement and refine distillation techniques for LLMs
  • Explore and integrate structured representations of domain knowledge (e.g., from knowledge graphs, entity-relational graphs, markup-languages, causal models, simulators, process models) into the distillation process.
  • Collaborate with a multidisciplinary team of researchers and engineers to apply distilled models to practical applications.
  • Analyze performance metrics to ensure minimal loss of effectiveness compared to larger models.
  • Document and present research findings

Company Info.

Microsoft

Microsoft Corporation is an American multinational technology company with headquarters in Redmond, Washington. It develops, manufactures, licenses, supports, and sells computer software, consumer electronics, personal computers, and related services. It is one of the Big Five American information technology companies, alongside Google, Amazon, Apple, and Meta.

  • Industry
    Information Technology,Computer software,Consumer electronics
  • No. of Employees
    223,000
  • Location
    Redmond, WA, USA
  • Website
  • Jobs Posted

Get Similar Jobs In Your Inbox

Microsoft is currently hiring Research Internship Jobs in Redmond, WA, USA with average base salary of $5,090 - $10,120 / Month.

Similar Jobs View More