Engineered to make data science

Explainable. Repeatable. Scalable.

Our free and open-source version of Pachyderm built and backed by a community of experts. With Pachyderm Community Edition, you can quickly and easily build, train, and deploy your data science workloads on whatever Kubernetes deployment you call home.

What is Pachyderm

Pachyderm is a data science platform that combines Data Lineage with End-to-End Pipelines on Kubernetes, engineered for the enterprise. And… It’s open source!

Data Lineage

Think, "git for data", but better. Pachyderm version-controls all data types, but it also delivers true data lineage. Data Lineage means knowing, with certainty, the complete journey of your data, code, models, and the relationships between them.

Learn More

End-To-End Pipelines

Pachyderm makes it simple to build end-to-end data science workflows using any language or framework you want. Turn your existing manual processes into an automated workflow where everything is tracked and versioned regardless of what data, language, or framework you use.

Learn More

Enterprise Scale

Kubernetes makes software scalable. We built Pachyderm on top of Kubernetes to provide you with a direct path to production, using your choice of infrastructure. It doesn't matter if you're still in the POC phase, or processing petabytes of data, Pachyderm makes scaling simple.

Learn More

What’s your Pachyderm use case?

Pachyderm brings together version control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to develop their code in any language, framework, or tool of their choice. Pachyderm has been chosen time and time again to be the ideal foundation for teams looking to solve real-world AI and ML problems reliably.

Learn more

Automotive

Since its inception, the automotive industry has been high tech and fueled by data. From the moment the first Model-T rolled off the assembly line, the entire world has witnessed first hand what technology driven by data can do.

Learn More

Banking

Algorithms have supported financial services for decades. They have helped companies grow nest eggs and enhance products for customers since before the computer was invented

Learn More

BioTech

As a discipline, Life Sciences has long been at the forefront of data science.

Learn More

Testimonials

All over the world data scientists and ML engineers are discovering how much better applied data science can be when Pachyderm is involved. Here's just a few examples of they're saying.

"A Pachyderm Hub cluster equals Data Scientist autonomy!"

Raanan Hadar Data Scientist

"Setting up and provisioning a Kubernetes cluster can be a huge pain, so seeing a cluster spin up immediately on Hub was immensely satisfying."

Matt Usifer Software Engineer

"We use Pachyderm as our data pipeline orchestrator. For us, the fact that you can deploy it so easily to a k8s cluster, and use language-agnostic, container-based workloads are absolute killer features."

Guilherme Caminha Senior Software Engineer - Precis Digital

"Pachyderm makes it easy to organise and run complicated pipelines. It gets you up and running in a matter of seconds."

Samanvay Karambhe Data Scientist @ Nearmap

Companies who use Pachyderm

Agbiome logo. logo. Digital Reasoning logo. logo. General Fusion logo. logo.

News and Announcements

Say Hello To Pachyderm: Hub

Today we’re excited to announce the public beta of our …

Learn More

(Kubernetes as a Service) as a Service

At Pachyderm, we just released our hosted service for public beta. …

Learn More

Pachyderm 1.9 is GA

The Pachyderm team is proud to announce our first major release of …

Learn More

Resources to get you started

There's a better way to do data science, and we'll show you how. Enterprise-grade data science is hard enough. Our advice, get the easy stuff right. Here are a few links that will get you started.

Get Started

The quickest and easiest way for you to get started is with Pachyderm Hub. With Hub, you get to skip the hassle of managing your own infrastructure and get right to building scalable, repeatable data science pipelines.

Documentation

Open Office Hours

Sometimes you need to talk it out. That’s why we host bi-weekly open office hours sessions where you can ask questions, discuss ideas, or troubleshoot something with Pachyderm employees. The floor is yours.

Sign Up