The Data Foundation for Machine Learning

Pachyderm is the data layer that powers your machine learning lifecycle

  • Data Driven Automation
  • Petabyte Scalability
  • End-to-End Reproducibility
Information Graphic

Trusted by Forward-Thinking Companies

  • “The difference was an order of magnitude faster...if it took 10 hours on the old system then it would only take an hour with Pachyderm”

    George Bonev, PHD Machine Learning Engineering, Liveperson
  • “Prior to using Pachyderm, we thought we’d never be able to execute those training sessions so fast. But because the data preparation process became so short, the research team was able to deliver much faster and create a lot of new models because of it”

    Voice AI Product Manager at Large Identity Provider

What is Pachyderm

Enterprise Edition

Pachyderm Enterprise Edition is designed for large-scale collaboration in highly secure environments.

Learn More


With Pachyderm Hub, you get enterprise-grade Pachyderm on-demand, without installing or maintaining Kubernetes, unless you want to (see Enterprise or Community Editions).

Learn More

Community Edition

This is our open source version of Pachyderm. With Pachyderm Community Edition you get the core Data Versioning and Pipeline features of Pachyderm, and can deploy locally or in the cloud of your choosing.

Learn More


All over the world data scientists and ML engineers are discovering how much better applied data science can be when Pachyderm is involved. Here's just a few examples of they're saying.