Skip to content

Fix KeyError and recursion issue in control_dataset.py when loading cached text embeddings#56

Open
tori29umai0123 wants to merge 1 commit intoFlyMyAI:mainfrom
tori29umai0123:main
Open

Fix KeyError and recursion issue in control_dataset.py when loading cached text embeddings#56
tori29umai0123 wants to merge 1 commit intoFlyMyAI:mainfrom
tori29umai0123:main

Conversation

@tori29umai0123
Copy link
Copy Markdown

Fix KeyError and recursion issue in control_dataset.py when loading cached text embeddings

This pull request fixes a bug that caused a KeyError('.txt') and subsequent infinite recursion in control_dataset.py when a missing text embedding was encountered.

Changes:
Added a safe check for missing keys in cached_text_embeddings
Prevented infinite recursion in getitem by handling missing samples gracefully
Improved logging for debugging missing .txt entries

Impact:
This change stabilizes data loading during training and prevents crashes due to invalid or missing text files.
No modification to train_qwen_edit_lora.py is required.

Fix KeyError and recursion issue in control_dataset.py when loading cached text embeddings
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant