AI Document Chatbot (RAG)

This project demonstrates a Retrieval Augmented Generation (RAG) system that answers questions about company reports using vector search.

Architecture

Documents (PDF)

↓

Chunking

↓

Embeddings (Sentence Transformers)

↓

FAISS Vector Search

↓

Relevant Text Chunks

↓

Displayed in Streamlit Web App

Project Structure

rag-chatbot

│

├── data

├── vectorstore

├── ingest.py

├── vectorstore.py

├── app.py

└── requirements.txt

How it works

ingest.py
- Loads PDFs
- Splits text into chunks
vectorstore.py
- Converts chunks into embeddings
- Creates FAISS vector index
app.py
- Streamlit interface
- User asks question
- System retrieves relevant text chunks

Run the project

Install dependencies

pip install -r requirements.txt

Create vector database

python ingest.py python vectorstore.py

Run the app

streamlit run app.py

Example Questions

What risks are mentioned in the report?
What sustainability initiatives are described?
What are the main financial highlights?

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
data		data
.gitignore		.gitignore
README.md		README.md
app.py		app.py
ingest.py		ingest.py
requirements.txt		requirements.txt
streamlit_rag_app.mp4		streamlit_rag_app.mp4
vectorstore.py		vectorstore.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Document Chatbot (RAG)

Architecture

Project Structure

How it works

Run the project

Example Questions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Document Chatbot (RAG)

Architecture

Project Structure

How it works

Run the project

Example Questions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages