Components of a Pipeline
Was this helpful?
Was this helpful?
A standard Pipeline consists of a Data Asset followed by a series of Capsules that write results to a Results Bucket.
Each Capsule is a standalone and fully reproducible process that reads data from the /data
folder and writes results to the /results
folder. When implemented in a Pipeline, the contents of each Capsule’s /data
folder are ignored. Input data can be specified by attaching a Capsule or Data Asset upstream of the Capsule so that the results of the first Capsule are passed to the /data
folder of the second Capsule. Results from each Capsule will only be saved if it is connected to the Results Bucket.
This section covers the main components of a pipeline: