Hi, all:
Thanks for your usefull benchmark!
I would like to know the basis for your classification of question_type and the differences between them. For example, key information retrieval and entity recognition seem to be repeated or unclear.
Thanks in advance.