New Web Evaluation Application by tanmay-9 · Pull Request #171 · qlever-dev/qlever-control

tanmay-9 · 2025-07-09T16:19:21Z

No description provided.

…rror handling

… branch

… hiding

…ueries` (#126) 1. Support input file (with SPARQL queries and a description for each) in both TSV and YML; refactor the parsing code accordingly 2. Add option to generate a YML result file suitable for processing with our evaluation web app; see #171 3. Add unit tests 4. Rename from `example-queries` to `benchmark-queries` because that is really what this command is doing; the old functionality, which was a special case, is still there using the `--example-queries` option Co-authored-by: Hannah Bast <[email protected]>

hannahbast

Can you please resolve the conflict?

ullingerc

Thanks a lot for adding the penalty option, however there is still a problem with the geometric mean. Can you please fix it?

ullingerc · 2025-08-08T09:40:00Z

src/qlever/commands/serve_evaluation_app.py

+                bw_1_to_5 += 1
+        runtimes.append(runtime)
+        total_time += runtime
+        total_log_time += max(math.log(runtime), 0.001)


Suggested change

total_log_time += max(math.log(runtime), 0.001)

total_log_time += math.log(runtime)

This line is the problem, why we see different results on the evaluation web app and it is not quite correct. While the inputs to the geometric mean are supposed to be positive (the times in this case), the result of the logarithm needn't be positive. Please change this line or just import the stdlib library statistics and apply query_data["gmeanTime"] = statistics.geometric_mean(runtimes). (The library also has mean and median)

Yes, you are right. I have updated the code to use the statistics module for mean, geometric_mean and median.

…to output yaml file in `benchmark-queries`

…and add long query to tooltip (but not copied text!)

…ge and fix UI issues

- Add --title and --description arguments to override title and description in --queries_yml - Make output yml structure the same as input yml structure for consistency

- Add --title-overview-page argument - Keep old query and sparql keys intact for query objects to not break the web app

…o new-eval-app

…rees page

…rees buttons to icon-only buttons

…trol into new-eval-app

…es and web-app

…es as dict

…ML file.

…t in all the tables

tanmay-9 added 10 commits July 4, 2025 13:34

example-queries from add-evaluation-web-app branch

2292c8f

Half-done main page with ag-grid and serve-evaluation-app command

2f929c9

Pull updated benchmark-queries code from add-evaluation-web-app pr

3379a88

First completed version of new eval web app

42762b6

Second completed version with comapre-exec-trees page and site-wide e…

e07a81a

…rror handling

First fully-completed version with all the functionalities

02d02ea

Added CompareExecTreesBtn to details page and moved some things around

8f81d5b

Add updates and tests from add-evaluation-branch

3bd7ec0

Add new pr changes to benchmark-queries from add-evaluation-web-app…

0a5ef5b

… branch

Maintain column order in tables when data changes and suppress column…

803e6d9

… hiding

hannahbast mentioned this pull request Jul 9, 2025

Extend and refactor example-queries command, rename to benchmark-queries #126

Merged

hannahbast reviewed Jul 10, 2025

View reviewed changes

tanmay-9 added 3 commits July 11, 2025 00:08

Merge remote-tracking branch 'origin/main' into new-eval-app

01b1ed9

Added penalized failed queries for aggregate metrics in web app

de569cf

Have some metrics shown when all queries fail

cf14ca0

ullingerc requested changes Aug 8, 2025

View reviewed changes

tanmay-9 and others added 13 commits August 8, 2025 12:18

Use stdlib statistics for mean, median and geometric mean

037a7c5

Split query text into short and long query and add index description …

702a47f

…to output yaml file in `benchmark-queries`

Serve additional_data (penalty and per kb index description) to web app

58bf62a

Add index description info pill to main page

abe3e9e

Add sparql formatter for sparql queries

71f94c2

Move page headers to nav-bar with resources

4ec2587

Split query into short and long query in details and comparison grid …

389c32b

…and add long query to tooltip (but not copied text!)

Add show metrics and order columns by metric options to comparison pa…

28f3958

…ge and fix UI issues

Some leftover changes related to order columns and index description

3140ddb

Merge branch 'ad-freiburg:main' into new-eval-app

9b46cbf

Merge branch 'ad-freiburg:main' into new-eval-app

2c98148

Change benchmark-queries command for the new --queries-yml format

e7557a5

- Add --title and --description arguments to override title and description in --queries_yml - Make output yml structure the same as input yml structure for consistency

Update serve-evaluation-app command for the new output yml format

19cd871

- Add --title-overview-page argument - Keep old query and sparql keys intact for query objects to not break the web app

tanmay-9 and others added 30 commits September 22, 2025 03:00

Merge remote-tracking branch 'origin/main' into new-eval-app

55ecde4

Merge remote-tracking branch 'origin/main' into new-eval-app

74e8309

Make the eval-app more responsive and look good on smartphones

4e186f4

SPARQL Engine -> RDF Graph Database and Engine -> System

460cf0c

Merge branch 'new-eval-app' of github.com:tanmay-9/qlever-control int…

2b1ab9d

…o new-eval-app

Merge remote-tracking branch 'origin/main' into new-eval-app

e976f56

More conflicts resolved

3e4f0b4

Rename "Compare Results" to "Detailed Results per Query"

922a77d

Fix column order on overview page and improve styling of compareExecT…

2d3e4ff

…rees page

Fix compareExecTrees zoom buttons not working because of icon change

8ef6314

Update styling of overview page table cards and change compare exec t…

70ddd06

…rees buttons to icon-only buttons

Further UI enhancements

3e9ef0d

Improve theme toggling code

4523257

Merge branch 'qlever-dev:main' into new-eval-app

7b5e402

Merge branch 'new-eval-app' of https://github.com/tanmay-9/qlever-con…

7db6872

…trol into new-eval-app

Merge branch 'qlever-dev:main' into new-eval-app

94cb777

Fix overview page kg-header padding on smaller screens

7ecee39

Remove old benchmark title and use benchmark name for benchmark-queri…

c067636

…es and web-app

Update the Details page header to have the full benchmark name

36f101a

Modify index-stats command to return the computed index times and siz…

653e1d0

…es as dict

Compute index stats inside benchmark-queries command and output to YA…

f02b74e

…ML file.

Send index stats to the web app

4e80547

Add index stats to main screen tables and have a space before the uni…

497ed5d

…t in all the tables

Dummy dropdown to show/hide columns of main screen tables

b0d423b

Merge branch 'qlever-dev:main' into new-eval-app

c08dd20

Add functionality to show/hide metrics on main screen tables

82150e6

Make qlever index-stats more extensible for other engines

2f9ae66

Add index stats to comparison page

4b69ef5

Merge remote-tracking branch 'origin/main' into new-eval-app

9973248

Merge branch 'qlever-dev:main' into new-eval-app

6b7ef04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Web Evaluation Application#171

New Web Evaluation Application#171
tanmay-9 wants to merge 71 commits intoqlever-dev:mainfrom
tanmay-9:new-eval-app

tanmay-9 commented Jul 9, 2025

Uh oh!

hannahbast left a comment

Uh oh!

ullingerc left a comment

Uh oh!

ullingerc Aug 8, 2025

Uh oh!

tanmay-9 Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	total_log_time += max(math.log(runtime), 0.001)
	total_log_time += math.log(runtime)

Conversation

tanmay-9 commented Jul 9, 2025

Uh oh!

hannahbast left a comment

Choose a reason for hiding this comment

Uh oh!

ullingerc left a comment

Choose a reason for hiding this comment

Uh oh!

ullingerc Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

tanmay-9 Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants