Harness the power and elasticity of Amazon Web Services to automate data transformations with data versioning and lineage. Pachyderm actively participates in AWS ISV Accelerate and runs seamlessly on AWS services such as Elastic Kubernetes Services (EKS), Simple Cloud Storage (S3), Relation Data Services (RDS), Elastic Block Storage (EBS), AWS Fargate, and more.
import cv2
import numpy as np
from matplotlib import pyplot as plt
import os
# edges.py reads an image and outputs transformed image
def make_edges(image):
img = cv2.imread(image)
tail = os.path.split(image)[1]
edges = cv2.Canny(img,100,200)
plt.imsave(os.path.join("/pfs/out", os.path.splitext(tail)[0]+'.png?as=webp'), edges, cmap = 'gray')
# walk images directory and call make_edges on every file found
for dirpath, dirs, files in os.walk("/pfs/images"):
for file in files:
make_edges(os.path.join(dirpath, file))
Pachyderm is available in two editions, Enterprise and Community. Choose the edition that is right for your use case. Read more.
For organizations that require advanced features and unlimited potential.
Pachyderm is cost-effective at scale and enables data engineering teams to automate complex pipelines with sophisticated data transformations
Deliver reliable results faster maximizes dev efficiency.
Automated diff-based data-driven pipelines.
Deduplication of data saves infrastructure costs.
Immutable data lineage ensures compliance.
Data versioning of all data types and metadata.
Familiar git-like structure of commits, branches, & repos.
Leverage existing infrastructure investment.
Language agnostic - use any language to process data
Data agnostic - unstructured, structured, batch, & streaming
Going beyond the limitations of SQL and using Python to speed development and insight with Snowflake.
Learn more about this feature in our documention website. This covers getting started with using Pachyderm on AWS.
This covers how to deploy a Pachyderm cluster on Amazon Elastic Kubernetes Service (EKS)..
Watch a short demo which outlines the product in action
Learn how companies around the world are using Pachyderm to automate complex pipelines at scale.
Request a Demo