Microsoft’s DeepSpeed is an open-source library built on the PyTorch (machine learning framework) ecosystem that combines numerous research innovations and technology advancements to make deep learning efficient and easier to use. DeepSpeed can parallelize across thousands of GPUs (graphics processing unit) and train models with trillions of parameters. Our OSS (open source software) has powered many advanced models like MT-530B and BLOOM, and it supports unprecedented scale and speed for both training and inference.
Our past research has been incorporated into Microsoft’s platforms such as Bing and Azure Machine Learning (ML). Many of our systems artifacts have been open-sourced and have had a broad impact across industry and academia. Our research has had significant impact on the academic community with influential publications at top conferences such as NeurIPS, ICML, ICLR, SC, USENIX ATC, etc.
The DeepSpeed team is looking for Senior Researcher with passion for innovations and for building high-quality algorithms and systems that will make significant impact inside and outside of Microsoft. Our team is collaborative, innovative, and end-user obsessed. We are looking for researchers with algorithm and system backgrounds and passionate about driving innovations to improve the efficiency and effectiveness of deep learning algorithms and systems. We value creativity, agility, accountability, and a desire to learn new technologies.
Qualifications:
Required Qualifications:
- Doctorate in relevant field
- OR equivalent experience.
- 2+years of experience in Machine Learning, Computer Vision, Natural Language Processing, Multimodal or related fields.
Preferred Qualifications:
- PhD degree in Computer Science, a related field, or equivalent practical experience.
- Research publications/submissions for conferences, journals, or public repositories as a main contributor.
- Passionate about delivering research.
- Ability to communicate technical concepts and insights to non-technical audiences.
Research Sciences IC4 – The typical base pay range for this role across the U.S. is USD $112,000 – $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 – $238,600 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#DeepSpeed #MachineLearning #Cloud #WebXTPlatform
Responsibilities:
- Make breakthrough innovations to advance deep learning systems and algorithms.
- Identify new and upcoming research areas by interacting with potential external and internal collaborators.
- Drive cutting-edge research prototypes and assist in preparation for production deployment.
- Discover/solve impactful technical problems, advance state-of-the-art technologies, and translate ideas into production.
- Develop and maintain a cutting-edge open-source project to advance deep learning at scale.