Skip to content

Add training code and update documentation for InternVL and BAGEL#32

Merged
oneScotch merged 12 commits into
mainfrom
wrs/training_code
Apr 29, 2026
Merged

Add training code and update documentation for InternVL and BAGEL#32
oneScotch merged 12 commits into
mainfrom
wrs/training_code

Conversation

@oneScotch

Copy link
Copy Markdown
Collaborator
  • Add training code for BAGEL and InternVL. Introduces complete training pipelines under training/Bagel/ and training/InternVL/, including model definitions, data loaders, dataset configs, FSDP utilities, training scripts, and dependency files (requirements.txt, environment.yml, DeepSpeed zero-stage configs).
  • Update README (EN & CN) with training instructions. Adds step-by-step guides for downloading the SenseNova-SI-800K dataset, setting up environments (conda + uv), and launching training for Bagel, and InternVL.

@oneScotch oneScotch self-assigned this Apr 18, 2026
Comment thread README.md Outdated
@caizhongang

Copy link
Copy Markdown
Collaborator

Under training/, align capitalization (i.e., small letters for both Bagel/ and InternVL/)
Also, clean training_qwen3_vl.md to merge with the main README.md

@caizhongang

caizhongang commented Apr 18, 2026

Copy link
Copy Markdown
Collaborator

Consider fixing this issue in this PR: #25

Comment thread training/qwen3_vl/train_config.yaml
@oneScotch oneScotch merged commit 1eee868 into main Apr 29, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants