[Task Submission] Quantifier Understanding (`quantifier_understanding`) by lerow · Pull Request #20 · GenBench/genbench_cbt_2023

lerow · 2023-08-01T23:48:40Z

Quantifier Understanding

The task evaluates generalization in the understanding of quantifiers. It aims to measure
how well can language models capture the semantics of logical quantifiers in natural language.

Authors

Leroy Wang lryw@uw.edu
Shane Steinert-Threlkeld shanest@uw.edu

Implementation

The task re-implements the evaluation function to compute accuracy scores.

Usage

Given predictions and gold labels, evaluate_predictions() outputs the accuracy score.

Checklist

I and my co-authors agree that, if this PR is merged, the code will be available under the same license as the genbench_cbt repository.
Prior to submitting, I have ran the GenBench CBT test suite using the genbench-cli test-task tool.
I have read the description of what should be in the doc.md of my task, and have added the required arguments.
I have submitted or will submit an accompanying paper to the GenBench workshop.

kazemnejad · 2023-08-23T18:23:50Z

Thanks for submitting a task to GenBench. Please be aware that you're submitting the data files of the tasks in your PR. For the final submission, you will need to host the dataset files somewhere else (preferably as a Huggingface dataset). Additionally, in your current submission, the data_source field in the config.jsonnet does not contain the correct url. It should be probably https://raw.githubusercontent.com/lerow/genbench_cbt/quantifier_understanding/src/genbench/tasks/quantifier_understanding/test_data.jsonl
This is why the tests are failing.

vernadankers · 2023-09-01T10:31:08Z

Hello!

We are getting quite close to the deadline (September 1, 11:59PM anywhere on earth), which is why I wanted to remind you of the fact that your PR still needs some attention: please double-check the automated checks that failed, and ensure that the dataset files are hosted somewhere else; see Amir's message above.

Please don't forget to submit your accompanying paper to Openreview via https://openreview.net/group?id=GenBench.org/2023/Workshop by September 1.

Good luck finalising your PR and paper, feel free to tag us if you have questions.
Cheers, Verna
On behalf of the GenBench team

lerow · 2023-09-01T21:08:16Z

Hi,
Thanks for the reminder. I just fixed the URL - the tests should pass now.
Leroy

lerow added 4 commits August 1, 2023 13:34

added quant understand

f3fa937

added data

6b5ef0a

added url

1017944

added doc

210f810

vernadankers added the task-submission label Aug 2, 2023

lerow and others added 2 commits August 3, 2023 02:57

fixed style

16f34a4

Merge branch 'main' into quantifier_understanding

1625ece

kazemnejad added task-submission and removed task-submission labels Aug 23, 2023

Update config.jsonnet

ca2a457

kazemnejad added task-submission and removed task-submission labels Sep 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[Task Submission] Quantifier Understanding (`quantifier_understanding`)#20

[Task Submission] Quantifier Understanding (`quantifier_understanding`)#20
lerow wants to merge 7 commits intoGenBench:mainfrom
lerow:quantifier_understanding

lerow commented Aug 1, 2023

Uh oh!

kazemnejad commented Aug 23, 2023

Uh oh!

vernadankers commented Sep 1, 2023 •

edited

Loading

Uh oh!

lerow commented Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

lerow commented Aug 1, 2023

Quantifier Understanding

Authors

Implementation

Usage

Checklist

Uh oh!

kazemnejad commented Aug 23, 2023

Uh oh!

vernadankers commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lerow commented Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vernadankers commented Sep 1, 2023 •

edited

Loading