Senior Software Engineer – High Performance Computing

As a Senior Software Engineer on the High Performance Computing (HPC)/AI (Artifical Intelligence) team, you’ll have the opportunity to work on cutting-edge technology that powers our cloud AI supercomputers (Azure High Performance Computing documentation | Microsoft Learn). You will be working directly with GPU (Graphics Processing Unit) and other accelerators, delivering world-class performance and enabling breakthroughs in industry, research, and AI. Join us and help shape the future of high-performance computing!

You will play a critical role in delivering and maintaining the infrastructure for our cloud supercomputers and enabling the revolution of Artifical Intelligence. You will be responsible owning the delivery and burn-in of clusters into Azure independently, ensuring that the hardware is stable for customers to run their applications. This will involve working closely with hardware vendors and other teams to ensure that the clusters are properly configured and optimized for performance across CPU (Central Processing Unit), accelerators, and network infrastructure.

In addition, you will be responsible for automating the quality process and debugging issues as they arise, ensuring successful resolution. This will involve developing and maintaining tools and processes to automate testing and ensure that quality is built into every step of the development process. You will also work closely with other teams to diagnose and resolve issues, and to ensure that our customers have a seamless experience using our cloud supercomputers.

Your attention to detail will be critical in this role, as you will be responsible for ensuring that quality is always front and center. This will involve reviewing code and configurations, and working with other teams to identify and address any issues that arise. You will also be responsible for documenting processes and procedures, and for ensuring that our team is following best practices and standards for software development and deployment.

This opportunity will allow you to participate in a highly agile and fast paced environment of High Performance Computing, accelerate your career growth and be part of the Artifical Intelligence revolution.

Qualifications:
Required/Minimum Qualifications:

  • Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Additional or Preferred Qualifications:

  • Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Experience with large scale and distributed system automation/execution frameworks preferred.

Software Engineering IC4 – The typical base pay range for this role across the U.S. is USD $112,000 – $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 – $238,600 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Responsibilities:

  • Collaborates with appropriate stakeholders to determine user requirements for a scenario.
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform.
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup’s project plans, release plans, and work items.
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
Job Category
Software Engineering
Job Type
Full Time/Permanent
Salary
USD 238,600.00 per year
Country
United States
City
Redmond
Career Level
unspecified
Company
Microsoft
JOB SOURCE
https://jobs.careers.microsoft.com/global/en/job/1560346/