Multi-Task Learning of Query Generation and Classification for Generative Conversational Question Rewriting

This repository contains the code and datasets for the EMNLP 2023 paper "Multi-Task Learning of Query Generation and Classification for Generative Conversational Question Rewriting". The paper proposes a novel multi-task learning approach to rewrite ambiguous conversational questions into well-defined queries while also identifying topic continuity. The models, based on BART and T5 architectures, demonstrate significant performance improvements over single-task baselines.

Install dependencies:

git clone
cd mtl_gen_class
pip install -r requirements.txt

Data Preparation

By default, we expect raw and processed data to be stored in ./data/ :

OR-QuAC

OR-QuAC files download

Download necessary OR-QuAC files and store them into ./data/or-quac:

mkdir data/or-quac
cd data/or-quac
wget https://ciir.cs.umass.edu/downloads/ORConvQA/all_blocks.txt.gz
wget https://ciir.cs.umass.edu/downloads/ORConvQA/qrels.txt.gz
gzip -d *.txt.gz
mkdir preprocessed
cd preprocessed
wget https://ciir.cs.umass.edu/downloads/ORConvQA/preprocessed/train.txt
wget https://ciir.cs.umass.edu/downloads/ORConvQA/preprocessed/test.txt
wget https://ciir.cs.umass.edu/downloads/ORConvQA/preprocessed/dev.txt

Baselines

To compare our results with the baseline, you can use the models and datasets provided at LIF GitHub Repository.

Downloading the Baseline

Navigate to LIF GitHub Repository.
Follow their installation and setup instructions to download the models and datasets.
Optionally, you can directly clone their repository using the following command:

git clone https://github.com/nusnlp/LIF.git

Using the Baseline for Comparison

Once you've downloaded the LIF baseline, follow their instructions to run the model and obtain results.

Then, you can use these results for a direct comparison with our Multi-Task Learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
lif-results		lif-results
src		src
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Task Learning of Query Generation and Classification for Generative Conversational Question Rewriting

Data Preparation

OR-QuAC

OR-QuAC files download

Baselines

Downloading the Baseline

Using the Baseline for Comparison

About

Releases

Packages

Languages

terrierteam/mtl_gen_class

Folders and files

Latest commit

History

Repository files navigation

Multi-Task Learning of Query Generation and Classification for Generative Conversational Question Rewriting

Data Preparation

OR-QuAC

OR-QuAC files download

Baselines

Downloading the Baseline

Using the Baseline for Comparison

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages