Adding training recipes for Nemotron 3 - Nano#46
Open
marcromeyn wants to merge 21 commits intomainfrom
Open
Conversation
Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
* Add initial CI Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Fix CI pull request target Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Fix copyright Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Add dev branch as GHA CI target Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Fix uv install in GHA Signed-off-by: Charlie Truong <chtruong@nvidia.com> * ruff format ignores .ipynb Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Ignore ipynb files in pre-commit Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Revert "Ignore ipynb files in pre-commit" This reverts commit 41c4d94. Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Revert "ruff format ignores .ipynb" This reverts commit 5fcce7e. Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Format notebooks Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* Fix broken links to usage-recipes (#9) Also updates a typo - usage cookbooks are now available, and examples are coming soon. Signed-off-by: Shashank Verma <shashank3959@gmail.com> * Final Pre-release commit (#10) * Examples Directory Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Examples Directory Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * add Nemotron_Parse_V1_1.ipynb & Nemotron_Nano2_VL.ipynb to usage-cookbook * update Nemotron-Parse endpoint URL * docs: add Nemotron Ideas Portal feature voting section to README * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> --------- Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> Co-authored-by: Chia-Chih Chen <chiachihc@nvidia.com> * Final pre-release updates (#11) * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> --------- Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Bug/rename notebook (#12) * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Final pre-release updates Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> --------- Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> * Nemotron Parse v1.1 and Nemotron Nano 2 VL notebook README.md (#13) Signed-off-by: Chia-Chih Chen <chiachihc@nvidia.com> * Add Data Science ML Agent Use Case Example (#14) * Add Data Science ML Agent example Signed-off-by: Allison Ding <allison.j.ding@gmail.com> * Add data folder with .gitkeep Signed-off-by: Allison Ding <allison.j.ding@gmail.com> * update the README.md in use-case-examples Signed-off-by: Allison Ding <allison.j.ding@gmail.com> * update the README in ds ml agent Signed-off-by: Allison Ding <allison.j.ding@gmail.com> --------- Signed-off-by: Allison Ding <allison.j.ding@gmail.com> * Adding structure Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Adding data_prep Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Adding data-prep for nano-3 Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Adding nemotron.kit Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Allowing defaults Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Integrate run Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Fixing data prep Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Improving wandb artifact integration Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Various improvements Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes (#7) * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * End of day commit Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> --------- Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Remove tui for now (#8) Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Removing tui properly (#9) Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Move data-prep configs (#10) Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Adding structure (#11) Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes for pre-training (#12) Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Fix linting Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Disable CI for now Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> --------- Signed-off-by: Shashank Verma <shashank3959@gmail.com> Signed-off-by: Chris Alexiuk <calexiuk@nvidia.com> Signed-off-by: Chia-Chih Chen <chiachihc@nvidia.com> Signed-off-by: Allison Ding <allison.j.ding@gmail.com> Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> Co-authored-by: Shashank Verma <shashank3959@gmail.com> Co-authored-by: Chris Alexiuk <161380339+chrisalexiuk-nvidia@users.noreply.github.com> Co-authored-by: Chia-Chih Chen <chiachihc@nvidia.com> Co-authored-by: Allison Ding <141533440+AllisonDing@users.noreply.github.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
* Adding tests to pretrain and fix bugs Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Trying to get sft to work Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> --------- Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
* Fixing data-prep for rl Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * fix rl data prep Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> --------- Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
…hing (#47) Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
* Updated pretrain and sft docs Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> * Some fixes and add update rl.md Signed-off-by: Marc Romeyn <marcromeyn@gmail.com> --------- Signed-off-by: Marc Romeyn <marcromeyn@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.