Legacy Notes (Deprecated)

Last Updated: 2026-01-18

This page intentionally captures legacy / deprecated information that still shows up in older docs and scripts.

What changed

  • Older docs refer to blog_data / blog_code naming. In this monorepo, the canonical names are Rocket Club and data/platform.
  • Some earlier pipeline approaches have been superseded by the current Prefect flows + S3-layered data approach.

Guidance

If you find a doc referring to an “old way”, treat it as historical context unless it is also described in the pages above.

Archived notebooks

The original pipeline implementation used a sequence of Jupyter notebooks (00-03) that have since been replaced by Prefect flows.

Notebook folder (reference only): data/platform/notebooks/archive/

Flow mapping:

Old notebookNew flow
00_schema_generation.ipynbflows/schema_generation.py
01_data_extraction.ipynbflows/data_extraction.py
02_data_cleaning.ipynbflows/data_cleaning.py
03_graph_load.ipynbflows/graph_load.py
(new)flows/main_pipeline.py

The notebooks are kept for reference and should not be used for production data processing.