Understand the top-level Flow entity, the catalog
A catalog is a collection of configurations that defines your entire Flow workflow. Catalogs tell Flow how to obtain, process, and persist data by defining various entities that Flow understands. These include captures, collections, derivations, materializations, and schemas.
The various pages in this documentation section discuss the details behind each of these key entities, as well as other important concepts that help the catalog run.
Large businesses can have thousands of separate data products, and Flow is designed with this in mind. Flow can manage catalogs of any size and has sufficient flexibility to enable collaboration, even in the largest organizations.
We assume you already have a basic knowledge of Flow, so if you haven't already, begin learning about Flow by following the tutorial and reading the high-level concepts page.
Alternatively, if you need information on the specific options and programming elements, see the catalog entities reference section instead.
Last modified 2mo ago
Copy link