diff --git a/README.md b/README.md index a47b7ff4..366f4740 100644 --- a/README.md +++ b/README.md @@ -30,7 +30,7 @@ GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthe GraphGen is a framework for synthetic data generation guided by knowledge graphs. It begins by constructing a fine-grained knowledge graph from the source text,then identifies knowledge gaps in LLMs using the expected calibration error metric, prioritizing the generation of QA pairs that target high-value, long-tail knowledge. -Furthermore, GraphGen incorporates multi-hop neighborhood sampling to capture complex relational information and employs style-controlled generation to diversify the resulting QA data. +Furthermore, GraphGen incorporates multi-hop neighborhood sampling to capture complex relational information and employs style-controlled generation to diversify the resulting QA data. Here is our [paper](https://github.com/open-sciencelab/GraphGen/tree/main/resources/GraphGen.pdf). ## 🚀 Quick Start diff --git a/resources/GraphGen.pdf b/resources/GraphGen.pdf new file mode 100644 index 00000000..3532a3e1 Binary files /dev/null and b/resources/GraphGen.pdf differ