Software Dev Engineer – AI/ML, AWS Neuron Distributed Training

Job Type: Full Time
Job Location: United States

Company Overview

Launched in 2006, Amazon Web Services (AWS) began exposing key infrastructure services to businesses in the form of web services — now widely known as cloud computing. The ultimate benefit of cloud computing, and AWS, is the ability to leverage a new business model and turn capital infrastructure expenses into variable costs. Businesses no longer need to plan and procure servers and other IT resources weeks or months in advance. Using AWS, businesses can take advantage of Amazon’s expertise and economies of scale to access resources when their business needs them, delivering results faster and at a lower cost. Today, Amazon Web Services provides a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in 190 countries around the world. With data center locations in the U.S., Europe, Singapore, and Japan, customers across all industries are taking advantage of our low cost, elastic, open and flexible, secure platform.

About the job

Description

Are you excited about Machine Learning, chip acceleration, compilers, storage, systems or EC2? Are you passionate about delivering high quality services that affect hundreds of thousands of users? We are the dubbed the “secret sauce” behind AWS’s success with development centers in the U.S. and Israel, Annarpuna is at the forefront of innovation by combining cloud scale with the world’s most talented engineers.

The Annapurna team hires for multiple disciplines Software and Hardware engineers including but not limited to complier engineer, machine learning engineer, runtime engineer, performance engineer and ML chip accelerator, ASIC, physical designs, SDE in Test. Because of our teams’ breadth of talent, we’ve been able to improve AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), in compute with AWS Graviton and F1 EC2 Instances, in machine learning with AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe.

As an SDE ML Apps Engineer, you will work alongside Research Engineers and Applied Scientists to build backend science components, including deep learning models that power our platform. Our platform enables non-tech-savvy customers to understand and solve their computer vision

Key job responsibilities

  • Innovating and delivering creative SW Designs to develop new services, solve operational problems, drive improvements in developer velocity, or positively impact operational safety
  • Writing requirements capturing documents, design documents, integration test plans, and deployment plans
  • Communicating status and progress of deliverables to schedule, and sharing learnings/ innovations with your team and stakeholders

Basic Qualifications

  • Currently enrolled in, or completed a Bachelor’s degree program or higher in Computer Science, Computer Engineering, Electrical Engineering or related field
  • To qualify, applicants should have earned a Bachelor’s or Master’s degree between April 2022 to September 2024. Possible start dates for this role are between March 2024 to October 2024.
  • Programming experience in internship or coursework with programming language such as Python and/or C or C++.
  • 1+ years of internship or coursework in deep learning, transformer architectures

Preferred Qualifications

  • Previous software engineer (internship/professional) experience with Pytorch/Jax/Tensorflow, Distributed libraries and Frameworks, end-to-end model training, and sharding. The group presents lot of opportunity for optimization and scaling large deep learning models on Trainium (AWS Machine Learning acceleration) architecture.
  • Experience with distributed, multi-tiered systems, algorithms, and relational databases.
  • Experience in optimization mathematics such as linear programming and nonlinear optimization

    APPLY

Apply for this position

Allowed Type(s): .pdf, .doc, .docx