Skip to content

New Web Evaluation Application#171

Open
tanmay-9 wants to merge 71 commits intoqlever-dev:mainfrom
tanmay-9:new-eval-app
Open

New Web Evaluation Application#171
tanmay-9 wants to merge 71 commits intoqlever-dev:mainfrom
tanmay-9:new-eval-app

Conversation

@tanmay-9
Copy link
Collaborator

@tanmay-9 tanmay-9 commented Jul 9, 2025

No description provided.

hannahbast pushed a commit that referenced this pull request Jul 9, 2025
…ueries` (#126)

1. Support input file (with SPARQL queries and a description for each) in both TSV and YML; refactor the parsing code accordingly
2. Add option to generate a YML result file suitable for processing with our evaluation web app; see #171 
3. Add unit tests
4. Rename from `example-queries` to `benchmark-queries` because that is really what this command is doing; the old functionality, which was a special case, is still there using the `--example-queries` option

Co-authored-by: Hannah Bast <[email protected]>
Copy link
Collaborator

@hannahbast hannahbast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you please resolve the conflict?

Copy link
Collaborator

@ullingerc ullingerc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for adding the penalty option, however there is still a problem with the geometric mean. Can you please fix it?

bw_1_to_5 += 1
runtimes.append(runtime)
total_time += runtime
total_log_time += max(math.log(runtime), 0.001)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
total_log_time += max(math.log(runtime), 0.001)
total_log_time += math.log(runtime)

This line is the problem, why we see different results on the evaluation web app and it is not quite correct. While the inputs to the geometric mean are supposed to be positive (the times in this case), the result of the logarithm needn't be positive. Please change this line or just import the stdlib library statistics and apply query_data["gmeanTime"] = statistics.geometric_mean(runtimes). (The library also has mean and median)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, you are right. I have updated the code to use the statistics module for mean, geometric_mean and median.

tanmay-9 and others added 30 commits September 22, 2025 03:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants