Run Commands
Read the GLOSSARY series >

Input Repository

Learn about the concept of an input repository, which is a location where data resides that is used as input for a pipeline.

About #

In Pachyderm, an input repository is a location where data resides that is used as input for a Pachyderm pipeline. To define an input repository, you need to fill out the input attribute in pipeline’s specification file.

There are several ways to structure the content of your input repos, such as:

Once you have defined an input repository, you can use it as the input source for a Pachyderm pipeline. The pipeline will automatically subscribe to the branch of the input repository and process any new data that is added to the branch according to the pipeline configuration.