Prefect Flows (Overview)
Last Updated: 2026-01-18
The data platform is driven by a small set of Prefect flows that extract, clean, and load data.
Flow graph
graph LR
A[Main Pipeline] --> B[Data Extraction]
A --> C[Data Cleaning]
A --> D[Graph Load]
B --> C
C --> D
Canonical flows
- Main pipeline: coordinates end-to-end runs
- Extraction: scrapes raw data from sources into the
rawlayer - Cleaning: normalizes raw data into graph-ready CSVs in the
cleanlayer - Graph load: creates constraints and loads nodes/relationships into Neo4j
Where they live
All flow entrypoints are in data/platform/flows/.