Two values in DatasetStatus are defined but never written by any code in the repo:
PENDING = 2 (src/datachain/dataset.py:263)
STALE = 6 (src/datachain/dataset.py:266)
Grep shows only reads:
PENDING - appeared in a defensive guard list at catalog.complete_dataset_version ([CREATED, PENDING]) and in the wipe-routing list in catalog.remove_dataset_version. Both removed in the soft-delete PR.
STALE - read in:
DatasetVersion.is_final_status() (dataset.py)
get_dataset_versions_to_clean() cleanup query (metastore.py)
catalog.remove_dataset_version routing (removed in the soft-delete PR)
No production code sets either status. Dead branches in the state machine.
Tasks
Two values in
DatasetStatusare defined but never written by any code in the repo:PENDING = 2(src/datachain/dataset.py:263)STALE = 6(src/datachain/dataset.py:266)Grep shows only reads:
PENDING - appeared in a defensive guard list at
catalog.complete_dataset_version([CREATED, PENDING]) and in the wipe-routing list incatalog.remove_dataset_version. Both removed in the soft-delete PR.STALE - read in:
DatasetVersion.is_final_status()(dataset.py)get_dataset_versions_to_clean()cleanup query (metastore.py)catalog.remove_dataset_versionrouting (removed in the soft-delete PR)No production code sets either status. Dead branches in the state machine.
Tasks
PENDINGandSTALEfrom theDatasetStatusenum