Improving Real-World RAG Systems - Key Challenges and Practical Solutions

Detailed Article

We have worked with Analytics Vidhya to get a detailed writeup done on the content covered in this talk. Do check out this detailed article in the Analytics Vidhya Blog.

Free Course

We have worked with Analytics Vidhya to get a FREE short course created based on the content covered in this talk. Do check out this free short course.

Session Details

Everyone knows how to build RAG systems, but how do you improve them? Retrieval Augmented Generation (RAG) systems have quickly become among the industry's biggest successes for driving Generative AI use cases on custom enterprise data. However, with their success comes a whole list of pain points that can lead to failure or sub-optimal performance in RAG systems.

This session is inspired by the famous paper “Seven Failure Points When Engineering a Retrieval Augmented Generation System” by Barnett et al., which discusses some of the major challenges and points of failure in RAG Systems. However, clear solutions to these challenges are not mentioned in detail.

This session aims to bridge this gap where we will cover the major challenges and pain points when building real-world RAG systems, which include:

Missing Content
Missed the Top Ranked Documents
Not in Context
Not Extracted
Wrong Format
Incorrect Specificity
Incomplete

Besides discussing the challenges, we will also discuss practical solutions of how we could address these challenges using the latest and best techniques, including:

Better data cleaning and prompting
More intelligent chunking
Better retrieval strategies like Reranking and Compression
Effect of embedding models and how can we fine-tune such models
Output parsers for better response format adherence
Query transformations
Latest advancements in RAG systems like GraphRAG, Agentic RAG, CRAG, RAFT, etc
Can long-context LLMs help?

The overall structure of the talk would involve discussing each challenge, discussing potential solutions, and also showcasing some of these with hands-on code leveraging popular frameworks like LangChain and LlamaIndex.

Key Takeaways:

Learn about the common challenges and pain points when building real-world RAG Systems
Understand practical solutions for tackling each pain point which can lead to failure in RAG Systems
Learn concepts and hands-on implementations of solutions, including data processing, chunking, reranking, embedding models, parsers, query transformers, and more
Discuss some of the latest advancements in Generative AI and RAG systems like Agentic RAG, CRAG, RAFT, and long context LLMs

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
Demo_1_Solutions_for_Missing_Content_in_RAG.ipynb		Demo_1_Solutions_for_Missing_Content_in_RAG.ipynb
Demo_2_Solutions_for_Missed_Top_Ranked,_Not_in_Context,_Not_Extracted_&_Incorrect_Specificity.ipynb		Demo_2_Solutions_for_Missed_Top_Ranked,_Not_in_Context,_Not_Extracted_&_Incorrect_Specificity.ipynb
Demo_3_Solutions_for_Wrong_Format.ipynb		Demo_3_Solutions_for_Wrong_Format.ipynb
Improving Real-World RAG Systems Key Challenges & Practical Solutions - DJ Presentation - PDF.pdf		Improving Real-World RAG Systems Key Challenges & Practical Solutions - DJ Presentation - PDF.pdf
Improving Real-World RAG Systems Key Challenges and Practical Solutions - DJ Presentation - PDF.pdf		Improving Real-World RAG Systems Key Challenges and Practical Solutions - DJ Presentation - PDF.pdf
LICENSE		LICENSE
README.md		README.md
rag_course.gif		rag_course.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Real-World RAG Systems - Key Challenges and Practical Solutions

Detailed Article

Free Course

Session Details

Key Takeaways:

About

Releases

Packages

Languages

License

dipanjanS/improving-RAG-systems-dhs2024

Folders and files

Latest commit

History

Repository files navigation

Improving Real-World RAG Systems - Key Challenges and Practical Solutions

Detailed Article

Free Course

Session Details

Key Takeaways:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages