The Microsoft Applied Sciences Group incubates disruptive technologies for Microsoft’s next-gen hardware products and is working on several exciting projects that will shape how computers and other devices perceive the user and the user’s environment. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling new experiences to the market. A lot of these experiences will be powered by speech and computer vision – and as part of this team, you will have the unique opportunity to work on almost every aspect of a shipping audio and vision system: camera optics, sensors, data pipeline and of course, developing and implementing the algorithms that make magic happen!
We are looking for a Senior Researcher – Applied Science in the field of audio, speech natural language and/or computer vision with expertise in deep learning techniques to help our devices compute better understanding of the user and the environment. The ability to analyze multimodal sensor data and interpret various human and human-object interactions is key to Applied Sciences’ mission of enabling a seamless set of human computer interactions. As part of this team, you will be working with a growing team of talented researchers already dedicated to this mission and use data and hardware only available to a select few. Naturally, the opportunity for you to push the state of the art in this field is huge.
Qualifications:
Required/Minimum Qualifications
- Doctorate degree Computer Science, Electrical Engineering, or related field,
- OR Master’s degree and 3+ years of related experience.
- OR equivalent experience.
- 3+ years experience with deep leaning techniques (i.e. CNN, RNN, Transformer, reinforcement learning), AND with machine learning frameworks and tools (i.e. Python, Tensorflow, PyTorch, and/or Onnx experience).
- Publication in top-tier audio/speech/NLP/vision/ML conferences and journals (i.e. ICASSP, InterSpeech, CVPR, ECCV, ICCV, ICML, ICLR, etc.).
Additional or Preferred Qualifications
- Doctorate degree in Computer Science, Electrical Engineering, or related field, AND 2+ years elated research experience
- OR Master’s degree and 5+ years of related experience.
- OR equivalent experience
- 5+ years experience with deep leaning techniques (i.e. CNN, RNN, Transformer, reinforcement learning), AND with machine learning frameworks and tools (i.e. Python, Tensorflow, PyTorch, and/or Onnx experiences).
- Publication as lead author or essential contributor in top-tier audio/speech/NLP/vision/ML conferences and journals (i.e. ICASSP, InterSpeech, CVPR, ECCV, ICCV, ICML, ICLR, etc.).
- Experience participating in a top conference in relevant research domain
- Proficient knowledge on Computer Science and Signal Processing and ability to understand and implement complex algorithms.
- Proficiency with C/C++
Applied Sciences IC4 – The typical base pay range for this role across the U.S. is USD $112,000 – $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 – $238,600 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Responsibilities:
- Research and develop audio/speech/NLP/vision algorithms with Python and other relevant programming languages
- Train deep learning models in TensorFlow and PyTorch, including training data engineering
- Build pipeline to test algorithms and models and analyze the results
- Optimize algorithms and models for speed and accuracy on target hardware