Understanding LLMs' Ability in Causal Discovery

This repository explores the factors that influence Large Language Models' (LLMs) understanding of causal discovery questions. We conduct experiments using various datasets and employ state-of-the-art LLMs to analyze the performance of LLMs in identifying causal relationships from textual data.

Dataset

The datasets used for our experiments are stored in the ./data folder. Please refer to the documentation within that folder for details on the structure and contents of each dataset.

Code.

interact_llm.py: Contains the implementation for LLM inference. This script is used to interact with pre-trained language models to evaluate their understanding of causal questions.
search_causal_relation.py: Manages the retrieval of queries related to causal discovery. For more details on the retrieval process, visit WIMBD.

Model Fine-tuning

We use the official OLMo implementation for fine-tuning the models on causal discovery tasks. The OLMo-7b-instruct model can be fine-tuned using the scripts and guidelines available at here. The sampled dataset from Dolma can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
LICENSE		LICENSE
README.md		README.md
interact_llm.py		interact_llm.py
search_causal_relation.py		search_causal_relation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Understanding LLMs' Ability in Causal Discovery

Dataset

Code.

Model Fine-tuning

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Understanding LLMs' Ability in Causal Discovery

Dataset

Code.

Model Fine-tuning

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages