Senior Researcher – Large Language Models

The Microsoft Applied Sciences Group incubates disruptive technologies for Microsoft’s next-gen hardware products and is working on several exciting projects that will shape how computers and other devices perceive the user and the user’s environment. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling new experiences to the market. A lot of these experiences will be powered by speech and computer vision – and as part of this team, you will have the unique opportunity to work on almost every aspect of a shipping audio and vision system: camera optics, sensors, data pipeline and of course, developing and implementing the algorithms that make magic happen!

We are seeking a S enior Researcher (Large Language Models) with a proficient coding and research skills to join our dynamic team. Our mission is to revolutionize the use of Large Language Models on edge and limited-resource devices such as laptops and phones. We aim to achieve this by focusing on model compression and capability expansion techniques, including compression, quantization, sparsification, and the extension of input and output scopes and modalities.

As a S enior Researcher (Large Language Models) you will play a crucial role in developing cutting-edge techniques to optimize the performance of large language models on edge devices and potentially data centers amplifying the impact of your contributions. You will work closely with a team of dedicated professionals and contribute to pushing the boundaries of what’s possible in both AI research and cutting-edge technology

Qualifications:
Required/Minimum Qualifications

  • Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)
    • OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
    • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
    • OR equivalent experience.
  • Proficient coding skills, proficiency in Python and familiarity with machine learning frameworks such as TensorFlow, PyTorch, or ONNX.
  • Demonstratable Proficiency with natur al language processing and language model architectures like GPT, BERT, and Transformer models.

Additional or Preferred Qualifications

  • Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
    • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
    • OR equivalent experience.
  • 3+ years experience creating publications (e.g., patents, libraries, peer-reviewed academic papers).
  • Experience presenting at conferences or other events in the outside research/industry community as an invited speaker.
  • 3+ years experience conducting research as part of a research program (in academic or industry settings).
  • 1+ year(s) experience developing and deploying live production systems, as part of a product team.
  • 1+ year(s) experience developing and deploying products or systems at multiple points in the product cycle from ideation to shipping.

Applied Sciences IC4 – The typical base pay range for this role across the U.S. is USD $112,000 – $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 – $238,600 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Responsibilities:

  • Develop and implement state-of-the-art techniques for model compression, and scope/modality expansion in Large language models.
  • Benchmark model performance across various dimensions – language coherency and cognition, speed, vRAM usage and utilization, power consumption, etc.
  • Stay abreast of the latest research in the field and contribute to the research community through publications.
Job Category
Data and Analytics
Job Type
Full Time/Permanent
Salary
USD 238,600.00 per year
Country
United States
City
Redmond
Career Level
unspecified
Company
Microsoft
JOB SOURCE
https://jobs.careers.microsoft.com/global/en/job/1597809/