Industry Job Opportunities
Position Type: Full-time
Locations: Greater Seattle Area
Qualifications: Hiring at all levels
Are you looking to be a key part of one of Microsoft’s fastest-growing businesses? Do you want to build a planet-scale AI system that powers the state-of-the-art bleeding-edge AI workloads? We have a number of positions open/opening up for you on Singularity!
Who We Are
We are the engineers on Singularity team. We believe that building a planet-scale AI supercomputer from the ground-up which addresses the fundamental pain-points of data scientists and AI practitioners and takes AI to the unprecedented scale is an opportunity of a lifetime. If you share the same dream as us, come join us!
What is Singularity?
Ultimately, democratization of AI is all about enabling data scientists to productively build, scale, experiment, and iterate their models on top of a robust, performant, scalable and cost-effective distributed infrastructure built for AI. Singularity is a globally distributed, multi-tenant service that provides robust, cost-effective and competitive AI infrastructure (compute, networking and storage) for AI training and inferencing. On the team, we are constantly seeing to apply the best ideas from AI, ML, distributed systems, database, machine learning, information retrieval, networking, and security.
Software Engineer – Singularity, Control Plane
As an engineer on the Singularity Control-plane team, you will be at the forefront of building a planet-scale, fully decentralized control plane, which is at the core of AI Supercomputer global distribution infrastructure.
Software Engineer - Singularity, Distributed Scheduler
You will be working on distributed scheduler for AI workloads, which is both aware of the AI workloads, the capabilities of the diverse accelerator resources and dynamic environments.
Software Engineer, Singularity, Storage/Distributed Cache
You will work on co-located, co-partitioned AI cache layer to help facilitate in speeding up the training job and increasing utilization of compute and hardware accelerator resources.
Software Engineer - Singularity, Distributed Training and Inferencing
You will be working on devising native support for diverse distributed execution strategies in Singularity: data parallel, model parallel, general pipelining, 2D/3D parallelism (e.g., PipeDream, GPipe) techniques.
Software Engineer - Inferencing
Do you want to work on building a planet-scale artificial intelligence (AI) system? The central Azure team is looking for truly exceptional software engineers to be part of a specialized startup team to build the next generation of cloud-based AI systems. Our work encompasses a wide array of hardware, compilers, distributed systems, operating systems, networking, and datacenter technologies. The platform and infrastructure part of the new AI systems team is looking for engineers who are passionate about solving problems associated with large scale AI service’s performance, scalability, reliability, and efficiency. To build and manage one of the largest AI services in the industry.
Software Engineer – Distributed Service
In this role you will be responsible for building the scheduling sub-system that is responsible for delivering on the SLAs for AI training and inferencing workloads. Specifically, you will be working on building the fault detection mechanisms, topology aware scheduling algorithms, checkpoint/restore, and elasticity capabilities across hardware and software stacks.
Software Engineer, Singularity, Data Plane/Compute
You will shape the future of the compute technology in AI supercomputer, including cluster availability orchestration, containerization/virtualization technology to bring distributed deep learning training and inferencing to life.
Software Engineer - Singularity, Data Plane, Networking
You will work on programming special high-bandwidth network to optimize for the performance of synchronous SGD based distributed model training. You will work on providing network integration with diverse accelerator types, including GPUs, with InfiniBand (IB).
Software Engineer - Singularity, GPU/FPGA/AI Accelerators
You will be working on a consistent model to provide support for diverse set of accelerators (GPU/FPGA/AI Accelerators) and enabling provisioning and scaling accelerator devices based on AI workload needs.
Software Engineer - Billing Service
You will be working on building a Billing Service for Singularity to provide support for diverse set of financial offers for AI training and inferencing workloads targeting diverse accelerators (GPU/FPGA/AI Accelerators) and enabling the lowest possible cost infrastructure for our customers based on AI workload needs.
Software Engineer - Singularity, Quality of Service (QoS)
The platform and infrastructure part of the new AI systems team is looking for engineers who are passionate about solving problems associated with large scale AI service’s performance, scalability, reliability, and efficiency. To build and manage one of the largest AI services in the industry.
Software Engineer - Singularity, PyTorch, TensorFlow Internals
You will work on deeply integrating frameworks like PyTorch and TensorFlow and within Singularity, and provide native support for elasticity, checkpointing, data loading and other optimizations as the model execution progresses.
Equal Opportunity Statement
Cloud Background check: The ability to meet Microsoft, customer and/or government security screening requirements are required for these roles. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form. Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
How to Apply