Skip to content

Allow for control over which skills are loaded (ie, could be 'none') to do comparison grading #126

@richardpark-msft

Description

@richardpark-msft

This is the "is it possible that the LLM can do the task without a skill" type of testing, where you run the same prompt using a skill, and without a skill, and compare them to see if the task was accomplished.

CC: @ronniegeraghty

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions