forked from anantn/hn-chatgpt-plugin
-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
Milestone
Description
User Story:
As a developer, I want to reconfigure Nvidia Triton to work with the embedding model I select, so that I can efficiently serve my model.
Acceptance Criteria:
Select the appropriate embedding model.
Update the Triton configuration to include this model.
Verify that the model is correctly loaded into Triton and perform a sample inference to confirm.
Definition of Done:
Triton configuration updated.
Model successfully loaded and sample inference complete.
Reactions are currently unavailable