Skip to content

Conversation

@vukrosic
Copy link
Contributor

No description provided.

…hanced reinforcement learning, including a link for more information.
… rank effects, including detailed descriptions in figure captions for improved understanding of model performance and exploration benefits.
…planations of dynamic noise injection and its impact on training performance, and emphasize the benefits of exploration strategies in achieving superior model accuracy.
…RMSNorm scaling parameters, detailing the mechanism of noise addition for improved exploration without extra parameters or computational overhead.
…ance figure captions for better understanding of training performance, and emphasize the efficiency of QeRL in training large models on limited hardware resources.
…ine the document, updating the conclusion section for clarity, and correcting the names of collaborating institutions for accuracy.
…ntent to declutter the repository and improve organization.
@vukrosic vukrosic merged commit 8d29f4b into main Oct 17, 2025
1 check failed
@vukrosic vukrosic deleted the qerl branch October 17, 2025 16:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants