eBooks
A collection of eBooks and other content.
Practical Data-Centric AI in the Real World
The focus on models has driven innovative and stable machine learning tools for research and enterprise. The rise of data-centric AI shifts the focus from models and code to the quality and context of your datasets.
Accelerate Time to Value for Machine Learning Projects
The time required to realize machine learning value can be a significant investment, but the right tools and strategy can help teams achieve this success in less time.
Top Reasons to Choose Pachyderm
Why do people choose Pachyderm for machine learning data versioning and pipelines? See the top ten reasons here.
What People Are Saying about Pachyderm
Top quotes about why data engineering and machine learning teams are choosing Pachyderm
Pachyderm Solution Brief
This PDF provides a fast and easy guide to Pachyderm's capabilities for data engineering teams.
Corporate Fact Sheet
This PDF has all the facts about Pachyderm as a company, our growth so far, and who leads our team.
5 Tips and Tricks: Scaling ML with Pachyderm
Learn some best practices for file management, command line fundamentals, and common workflow improvements for working with Pachyderm.
Completing the Machine Learning Loop
The machine learning workflow loop involves two CI/CD pipelines - one for code, and one for data. This ebook demonstrates how they work for MLOps.
Pachyderm and NVIDIA DGX Systems
As an NVIDIA DGX partner, Pachyderm's data-driven, reproducible pipelines and automated versioning improve machine learning productivity, output, and efficacy.
What is Data Lineage?
Data lineage is one of those problems an AI research team doesn’t know it has until it starts to scale. Learn more about a staple in the machine learning industry: data lineage.
Machine Learning and the Coming Transformation of Finance
Gain future insights on the finance industry due to machine learning and AI.
ThoughtWorks Guide to Evaluating MLOps Platforms
This guide includes an overview of the future of MLOps, and profiles the platforms and tools used by machine learning leaders today.