On-Prem Deploy
Learn how to install on your premises.
May 30, 2023
Before you start #
Before you can deploy Pachyderm, you will need to perform the following actions:
- Install kubectl
- Install Helm
- Deploy Kubernetes on-premises.
- Deploy two Kubernetes persistent volumes for Pachyderm metadata storage.
- Deploy an on-premises object store using a storage provider like MinIO, EMC’s ECS, or SwiftStack to provide S3-compatible access to your data storage.
How to Deploy Pachyderm On-Premises #
1. Install Pachyderm via Helm #
helm repo add pachyderm https://helm.pachyderm.com
helm repo update
2. Add Storage classes to Helm Values #
Update your Helm values file to include the storage classes you are going to use:
etcd:
storageClass: MyStorageClass
size: 10Gi
postgresql:
persistence:
storageClass: MyStorageClass
size: 10Gi
3. Size & Configure Object Store #
- Determine the endpoint of your object store, for example
minio-server:9000
. - Choose a unique name for the bucket you will dedicate to Pachyderm.
- Create a new access key ID and secret key for Pachyderm to use when accessing the object store.
- Update the Pachyderm Helm values file with the endpoint, bucket name, access key ID, and secret key.
pachd:
storage:
backend: minio
minio:
endpoint: minio-server:9000
bucket: pachyderm-bucket
id: pachyderm-access-key
secret: pachyderm-secret-key
secure: false
💡
You can update your Helm values file using the following command:
helm upgrade pachyderm pachyderm/pachyderm -f values.yml
4. Install PachCTL #
Install PachCTL and PachCTL Auto-completion.