Legacy Notes (Deprecated)
Last Updated: 2026-01-18
This page intentionally captures legacy / deprecated information that still shows up in older docs and scripts.
What changed
- Older docs refer to
blog_data/blog_codenaming. In this monorepo, the canonical names are Rocket Club and data/platform. - Some earlier pipeline approaches have been superseded by the current Prefect flows + S3-layered data approach.
Guidance
- For current architecture, start at:
If you find a doc referring to an “old way”, treat it as historical context unless it is also described in the pages above.
Archived notebooks
The original pipeline implementation used a sequence of Jupyter notebooks (00-03) that have since been replaced by Prefect flows.
Notebook folder (reference only): data/platform/notebooks/archive/
Flow mapping:
| Old notebook | New flow |
|---|---|
| 00_schema_generation.ipynb | flows/schema_generation.py |
| 01_data_extraction.ipynb | flows/data_extraction.py |
| 02_data_cleaning.ipynb | flows/data_cleaning.py |
| 03_graph_load.ipynb | flows/graph_load.py |
| (new) | flows/main_pipeline.py |
The notebooks are kept for reference and should not be used for production data processing.