What is Data Lineage?


Learn more about a staple in the machine learning industry: data lineage.

Data lineage is one of those problems an AI research team doesn’t know it has until it starts to scale. When a team is only working on one or two projects, it’s easy to keep track of things with spreadsheets,
word of mouth, and Slack. But as the team grows and they take on  more projects, that kind of ad-hoc system breaks down fast.

To really take control of your data and your AI development workflows you need a strong, fourth generation version control system like Pachyderm. Building on that platform lets you build for tomorrow’s challenges today.