Hi, @potsawee , thanks for sharing your nice work.
There is something unclear in your paper. I wonder if you can give some explanations?
After we obtain the three score, how to combine them for the final score? The three scores have different scale:
- $\mathcal{S}_{BERT} \in [0.0, 1.0]$
- $\mathcal{S}_{QA} \in [0.0, 1.0]$
- $\mathcal{S}_{n-gram} \in [0.0, inf]$
I noticed that in section 5.4, you wrote "As a result, we consider SelfCheckGPT-Combination, which is a simple combination of the normalized scores of the three variants...."
So what is the combination strategy?
Thanks in advance! Looking forward to your reply.