The Machine Learning Podcast

Build More Reliable Machine Learning Systems With The Dagster Orchestration Engine



December 2nd, 2022  •  45 mins 43 secs  •  Download (29.4 MB)  •  Link with Timestamp

RSS Feed

Building a machine learning model one time can be done in an ad-hoc manner, but if you ever want to update it and serve it in production you need a way of repeating a complex sequence of operations. Dagster is an orchestration engine that understands the data that it is manipulating so that you can move beyond coarse task-based representations of your dependencies. In this episode Sandy Ryza explains how his background in machine learning has informed his work on the Dagster project and the foundational principles that it is built on to allow for collaboration across data engineering and machine learning concerns.