Powering Up Your Data Versioning and Data Lineage With Pachyderm

Powering Up Your Data Versioning and Data Lineage With Pachyderm

Wednesday, May 5, 2021 7:30 PM to 7:40 PM · 10 min. (Africa/Abidjan)

Information

Pachyderm is powerful data versioning and data lineage engine for machine learning that solves six unique problems in the AI/ML world:  Data Versioning; Data Lineage; Data driven pipelines; Incrementality; Language agnosticism; Scalability.  I'll walk you through the platform's robust copy-on-write filesystem and rock solid immutability to show you how it keeps track of every change, from your data, to your models, to your code, while they're all changing at the same time.  I'll also show you how Pachyderm lets you bring any tool you want to the table, whether that's Python, R, C++, Rust, or any cutting edge framework that just came fresh from the research lab.

Log in