You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The judge should have the goal/objective and response to do the rating. But am I missing something here?
P.S: I changed the prompt a bit but GPT4 is refusing to provide ratings. I marked this issue in JailBreakBench as well (JailbreakBench/jailbreakbench#34)
The text was updated successfully, but these errors were encountered:
Hi @patrickrchao and @eltociear,
Wonderful repo, thanks a lot!
I am wondering if the Judge System prompt for GPT is actually correct i.e Section E in the paper and/or code - https://github.com/patrickrchao/JailbreakingLLMs/blob/main/system_prompts.py#L50
The judge should have the goal/objective and response to do the rating. But am I missing something here?
P.S: I changed the prompt a bit but GPT4 is refusing to provide ratings. I marked this issue in JailBreakBench as well (JailbreakBench/jailbreakbench#34)
The text was updated successfully, but these errors were encountered: