Machine Learning Engineer, Multimodal Data

Job Type: Full Time
Job Location: United States
Company Name: Runway

About the job

Runway is a research company pioneering new tools for human imagination. Runway has been at the forefront of multi-modal AI systems ensuring that the future of content creation is accessible, controllable and empowering for creatives. Runway’s mission is to ensure that anyone anywhere can tell their stories. We believe that deep learning techniques applied to audiovisual content will forever change art, creativity, and design tools.

Runway is leading a shift to generative media that is unlocking an unprecedented level of creative potential. The invention of the camera 200 years ago forever changed our world – AI is a new kind of camera that will reshape storytelling forever and lead to full feature films that are entirely generated.

About the role

We’re looking for Dataset Engineers to help curate, build, and optimize datasets for model training. The ideal candidate for this role has strong machine learning skills, extensive experience working with and analyzing large-scale datasets, and an understanding of creativity tools. You should be proficient in ensuring data quality and tight feedback loops between data preprocessing and model training.

What you’ll do

  • Develop and maintain large-scale, multimodal datasets for training and evaluating models
  • Optimize models for data preprocessing tasks
  • Create and run evaluations and benchmark analyses for datasets and models
  • Implement fast iteration cycles and feedback loops to continuously improve model datasets
  • Work with a world-class research team to push the boundaries of content creation
  • Evaluate new datasets and models for upstream data tasks that feed into our products

What you’ll need

  • 4+ years of relevant experience in machine learning or dataset engineering, ideally with multimodal datasets
  • Experience with running and optimizing models offline at large scale
  • Excellent data modeling skills and experience with data curation
  • Proficiency in model finetuning and optimization for data preprocessing
  • Strong data analysis and SQL skills
  • Experience in creating evaluations and running benchmark analyses
  • Solid knowledge of at least one machine learning framework (e.g. PyTorch, JAX, TensorFlow)
  • Very strong programming skills and ability to write clean and maintainable code
  • Deep interest in building human-in-the-loop systems for creativity
  • Ability to rapidly prototype solutions and iterate on them with tight product deadlines
  • Strong familiarity with tools such as Ray, Kubernetes, Airflow, Prefect
  • Excellent communication, collaboration, and documentation skills

    APPLY

Apply for this position

Allowed Type(s): .pdf, .doc, .docx