news-interview-question-generation

Repository for: NewsInterview: a Dataset and a Playground to Evaluate LLMs' Ground Gap via Informational Interviews

To run the human game simulation, navigate to game_sim and run:

python conduct_interviews_advanced.py \
    --model_name "gpt-4o" \
    --batch_size 5 \
    --dataset_path "output_results/game_sim/outlines/final_df_with_outlines.csv" \
    --output_dir "test" --human_eval

If you enjoyed this work, please cite:

  title={NewsInterview: a Dataset and a Playground to Evaluate LLMs’ Grounding Gap via Informational Interviews},
  author={Lu, Michael and Kalyan, Sriya and Cho, Hyundong and Shi, Weiyan and May, Jonathan and Spangher, Alexander},
  journal={arXiv preprint arXiv:2411.13779},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
chain_of_thought		chain_of_thought
data_processing		data_processing
evaluators		evaluators
game_sim		game_sim
latex/figures		latex/figures
notebooks		notebooks
output_results/game_sim		output_results/game_sim
sbatch-scripts		sbatch-scripts
variations		variations
.gitignore		.gitignore
LLM_question_generation.py		LLM_question_generation.py
README.md		README.md
env_setup.sh		env_setup.sh
helper_functions.py		helper_functions.py
prompts.py		prompts.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

news-interview-question-generation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

news-interview-question-generation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages