Prefect Flows (Overview)

Last Updated: 2026-01-18

The data platform is driven by a small set of Prefect flows that extract, clean, and load data.

Flow graph

graph LR
    A[Main Pipeline] --> B[Data Extraction]
    A --> C[Data Cleaning]
    A --> D[Graph Load]
    B --> C
    C --> D

Canonical flows

  • Main pipeline: coordinates end-to-end runs
  • Extraction: scrapes raw data from sources into the raw layer
  • Cleaning: normalizes raw data into graph-ready CSVs in the clean layer
  • Graph load: creates constraints and loads nodes/relationships into Neo4j

Where they live

All flow entrypoints are in data/platform/flows/.

Pages