Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into fans.
We are customer obsessed problem-solvers. We orchestrate deep engagements in areas like incident management, support and enablement. We analyze and amplify those customer voices, both within our own team, and across the Cloud + Artificial intelligence (AI) team, bringing the customer connection to the Quality vision for Azure. We innovate ways to scale what we learn across our customer base. Diversity and inclusion are central to who we are, how we work, and what we enable our customers to achieve. We know that empowering our customers starts with empowering our team to show up authentically, work in ways that are best for them, and achieve their career goals.
Azure Reliability’s vision is to make it easy for everyone to create, consume, and manage planetary-scale, reliable cloud production services and infrastructure to achieve more. Azure Reliability is dedicated to making “Azure the safest and most reliable Cloud”. We are an engineering organization working horizontally across Azure to systemically lift overall platform reliability and security to help our customer experience of Azure as a whole. As a team, we bring together significant and complementary capabilities and tooling across Security, Risk Management, Incident Lifecycle, and Site Reliability Engineering in new ways to increase Azure platform reliability.
In this role as a Software Engineer -Site Reliability, you will understand the distributed systems design, interactions between cloud technology layers and components, basic dependencies at scale, and the code that defines infrastructures. We support ongoing engagements with product engineering teams by participating in static analysis reviews, meetings, on-call rotations, and incident responses throughout product development and operations cycles. We focus on insights from analyses of telemetry data to propose potential improvements to for a specific reliability or security problem with guidance from other engineers.
We strive to improve reliability and security fundamentals via software engineering, preferring long-lasting platform improvements delivered as engineering projects over repetitive manual operations. We contribute to the product fundamentals and architecture, share knowledge, and code, and prefer reuse over re-invention, always looking for ways to make what we build useful to multiple teams and products.
This role provides an exciting opportunity to expand knowledge and skills and to work with a globally distributed system. We’re looking for individuals who are inclusive, self-motivated, and curious. If you are enthusiastic about Site Reliability Engineering and understanding and improving our customer experience Microsoft is a great place to grow in your career in a team-oriented environment.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications:
Required/Minimum Qualifications
- Bachelor’s Degree in Computer Science, or related technical discipline with proven experience coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Additional or Preferred Qualifications
- Bachelor’s Degree in Computer Science or related technical field AND 1+ year(s) technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Master’s Degree in Computer Science or related technical field with proven experience coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 2+ years of experience using programming languages such as C#, Java, Python or Go.
- 2+ years experience with data pipelines and analytical skills.
- 2+ years of SRE or other dev/ops experience
Software Engineering IC2 – The typical base pay range for this role across the U.S. is USD $76,400 – $151,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $100,300 – $165,400 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#AzCXP #AzRelJobs
Responsibilities:
The Software Engineer-Site Reliability key focus areas are:
- Improving our systems’ production posture via targeted observability and operability enhancements (telemetry, alerting, incident management, change management, making production changes safe).
- Building reusable automation to empower multiple teams to achieve their security and reliability goals.
- Must be able to participate in a global multi-location on-call rotation.
We are looking for an engineer passionate about the above areas who are also interested in:
- Providing technical leadership for engineers across multiple teams within Azure.
- Mentoring engineers on site reliability engineering principles, practices, and tools.