Name	Name	Last commit message	Last commit date
parent directory ..
img	img
.env.example	.env.example
chatbot.py	chatbot.py
config.py	config.py
document_processor.py	document_processor.py
main.py	main.py
readme.md	readme.md
requirements.txt	requirements.txt
vector_store.py	vector_store.py

RAG Chatbot with Qdrant, Gemma3, and Docling

This chatbot answers questions about your documents using Retrieval-Augmented Generation (RAG). Simply upload a PDF and start chatting! Inspired by Sabrina Aquino, this version offers a more structured approach with runnable scripts.

Features

Load and process PDF documents
Smart document search and retrieval
Interactive chat interface
Powered by Google Gemini AI
Easy setup and use

Architecture

graph TD
    %% Input
    PDF[PDF Document] 
    USER[User Question]
    
    %% Core Process
    PROCESS[Document Processing<br/>Docling + BAAI/bge-small-en-v1.5]
    STORE[Qdrant Vector DB<br/>Store Embeddings]
    SEARCH[Vector Search<br/>Find Similar Content]
    GENERATE[Gemini AI<br/>gemma-3-27b-it]
    
    %% Output
    ANSWER[AI Answer]
    
    %% Simple Flow
    PDF --> PROCESS
    PROCESS --> STORE
    USER --> SEARCH
    STORE --> SEARCH
    SEARCH --> GENERATE
    GENERATE --> ANSWER
    
    %% Styling
    classDef input fill:#e3f2fd,stroke:#1976d2,stroke-width:3px
    classDef process fill:#e8f5e8,stroke:#388e3c,stroke-width:2px
    classDef output fill:#fff3e0,stroke:#f57c00,stroke-width:3px

    class PDF,USER input
    class PROCESS,STORE,SEARCH,GENERATE process
    class ANSWER output

    %% Force black text color
    style PDF fill:#e3f2fd,stroke:#1976d2,color:#000
    style USER fill:#e3f2fd,stroke:#1976d2,color:#000
    style PROCESS fill:#e8f5e8,stroke:#388e3c,color:#000
    style STORE fill:#e8f5e8,stroke:#388e3c,color:#000
    style SEARCH fill:#e8f5e8,stroke:#388e3c,color:#000
    style GENERATE fill:#e8f5e8,stroke:#388e3c,color:#000
    style ANSWER fill:#fff3e0,stroke:#f57c00,color:#000

Project Structure

rag-chatbot/
├── main.py              # Main script to run
├── chatbot.py           # RAG chatbot logic
├── config.py            # Configuration settings
├── document_processor.py # PDF processing
├── vector_store.py      # Document storage
├── requirements.txt     # Dependencies
├── setup.py            # Setup script
├── .env                # Your API key (create this)
└── data/               # Your documents go here

Quick Start

1. Setup Environment

# Clone or download this project
# Navigate to the project folder

# Create virtual environment
python -m venv venv

# Activate it
# Windows:
venv\Scripts\activate
# Mac/Linux:
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

2. Get API Key

Go to Google AI Studio
Create a new API key
Create a .env file in the project folder:

You can use .env.example as a template

GEMINI_API_KEY=your_api_key_here

3. Add Your Document

# Create data folder
mkdir data

# Put your PDF in the data folder
# Example: data/my_document.pdf (e.g., your resume for quick testing)

4. Run the Chatbot

# Interactive chat mode
python main.py --document data/my_document.pdf

# Or use the default document path
python main.py

Usage Examples

Interactive Mode

python main.py --document data/rust_book.pdf

Then type your questions:

👤 You: What is ownership in Rust?
🤖 Bot: Ownership is a key concept in Rust...

Single Question Mode

python main.py --document data/rust_book.pdf --query "What is a variable?"

Check Status

# In interactive mode, type:
status

Commands

status - Show chatbot info
help - Show available commands
quit, exit, or q - Exit the chatbot

What to Expect After Running

Running Locally

Keep in mind that running the scripts locally can take some time. The duration depends on factors like file size, connection speed, and your computer's performance.

If you're still up for it, here's what you can expect:

For example, if you upload your resume and ask a simple question, you might see something like this:

Running on Colab

Here's an example of the output you might see when running on Colab:

If you use your resume (like mine, for example):

If you're using a different PDF, you can run a command like this:

!python main.py --document /content/rust_book.pdf

Note: This might take a while, especially since the book we are using Rust book is 670 pages long!

Running the Notebook on Colab (Recommended)

For a complete walkthrough and example outputs, check out this tutorial by Sabrina Aquino.

Troubleshooting

Common Issues

"GEMINI_API_KEY not found"

Make sure you created the .env file
Check that your API key is correct

"Document not found"

Make sure your PDF is in the data/ folder
Check the file path is correct

"Module not found"

Make sure you activated your virtual environment
Run pip install -r requirements.txt again

"Model not available"

The Gemini model might not be available in your region
Check Google AI Studio for available models

Need Help?

Make sure Python 3.8+ is installed
Check that all dependencies are installed
Verify your API key is valid
Ensure your document is a readable PDF

Configuration

You can modify settings in config.py:

MAX_TOKENS: Chunk size for document processing
RETRIEVAL_LIMIT: Number of relevant chunks to retrieve
GEMINI_MODEL: AI model to use

License

This project is open source. Feel free to modify and use it!

Happy chatting!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

RAG Chatbot with Qdrant, Gemma3, and Docling

Features

Architecture

Project Structure

Quick Start

1. Setup Environment

2. Get API Key

3. Add Your Document

4. Run the Chatbot

Usage Examples

Interactive Mode

Single Question Mode

Check Status

Commands

What to Expect After Running

Running Locally

Running on Colab

Running the Notebook on Colab (Recommended)

Troubleshooting

Common Issues

Need Help?

Configuration

License

FilesExpand file tree

Docling-Qdrant-RAG

Directory actions

More options

Directory actions

More options

Latest commit

History

Docling-Qdrant-RAG

Folders and files

parent directory

readme.md

RAG Chatbot with Qdrant, Gemma3, and Docling

Features

Architecture

Project Structure

Quick Start

1. Setup Environment

2. Get API Key

3. Add Your Document

4. Run the Chatbot

Usage Examples

Interactive Mode

Single Question Mode

Check Status

Commands

What to Expect After Running

Running Locally

Running on Colab

Running the Notebook on Colab (Recommended)

Troubleshooting

Common Issues

Need Help?

Configuration

License