Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove fallback row add #32

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

sumukshashidhar
Copy link
Collaborator

There is an unnatural row add that occurs if the JSON is unparsable. We do not need this, as it produces a bogus question row

Copy link
Member

@clefourrier clefourrier left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would tend to agree that we don't want to have bogus rows among the normal rows, but at the same time it provides a good way to store the doc/chunk which failed.
If we remove this, I believe a logging of the chunks and docs in error should occur in a dedicated file.
wdyt @alozowski ?

@@ -291,19 +291,6 @@ def _process_responses_and_build_dataset(
logger.warning(
f"No parseable JSON found (or empty list) for row_index={row_index}, chunk_id={chunk_id}, model={model_name}. Creating fallback row."
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
f"No parseable JSON found (or empty list) for row_index={row_index}, chunk_id={chunk_id}, model={model_name}. Creating fallback row."
f"No parseable JSON found (or empty list) for row_index={row_index}, chunk_id={chunk_id}, doc_id={doc_id}, and model={model_name}."

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

unfortunately, we’ll also need to cascade catch this during answer generation, judging, etc, which seems like asking for inaccuracies / errors.

+1 for logging it somewhere however

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep I agree - just want to make sure with Alina that there isn't something we haven't thought of handled here

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure if this deletion might cause issues, @sumukshashidhar did you test it with this fallback_row removed?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes I did. the only difference is that we no longer have an "unknown" "unknown" row that would cause cascading errors

@clefourrier
Copy link
Member

Approving, but please create an issue so we remember to log this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants