Add support for the Training Method for finetuning, and for Direct-Preference Optimization (DPO) #262
GitHub Advanced Security / CodeQL
succeeded
Mar 11, 2025 in 2s
No new alerts in code changed by this pull request
Loading