Chat with PDFs

A Streamlit application that allows users to upload PDF documents and chat with them using LLM technology.

Overview

Chat with PDFs processes PDF documents with LlamaIndex and generates responses using various language models. The application features a user-friendly interface for document management, chat history, and document visualization.

Features

Document Processing: Upload and process single or multiple PDF documents
Chat Interface: Natural language interaction with document content
Citation Support: Responses include citations to specific parts of the document
PDF Viewer: View documents with highlighted citations
Image Extraction: Automatically extracts and displays images from documents
Multi-Document Support: Switch between multiple uploaded documents
Multi-Provider Support: Works with OpenAI, Ollama, and custom OpenAI-compatible API providers
Model Selection: Choose from different language models with provider-specific configurations

Project Structure

The application follows a modular architecture for better maintainability and extensibility:

chat-with-docs/
├── app.py                    # Main application
├── assets/img                # Static assets (images)
├── src/
│   ├── core/                 # Business logic
│   │   ├── __init__.py
│   │   ├── document_manager.py
│   │   ├── chat_engine.py
│   │   ├── state_manager.py
│   │   └── file_processor.py
│   ├── ui/                   # UI-related components
│   │   ├── __init__.py
│   │   ├── components.py
│   │   ├── layouts.py
│   │   └── handlers.py
│   ├── utils/                # Utility functions
│   │   ├── __init__.py
│   │   ├── common.py
│   │   ├── source.py
│   │   ├── image.py
│   │   └── logger.py
│   ├── config.py             # Configuration settings
│   ├── custom_retriever.py   # Custom retrieval logic
│   └── __init__.py

Getting Started

Prerequisites

Python 3.9+
OpenAI API key or other LLM provider credentials

Installation

Clone the repository:

git clone https://github.com/yourusername/chat-with-docs.git
cd chat-with-docs

Install the required packages:

pip install -r requirements.txt

Create a .env file with your API keys and configuration:

# Required for OpenAI models
OPENAI_API_KEY=your_api_key_here

# Optional: Logging level
LOG_LEVEL=INFO  # DEBUG, INFO, WARNING, ERROR

# Optional: For Ollama integration
OLLAMA_ENDPOINT=http://localhost:11434
OLLAMA_MODELS=llama3,gemma,mistral  # Comma-separated list of models

# Optional: For custom OpenAI-compatible providers
CUSTOM_API_ENDPOINT=https://your-custom-endpoint.com/v1
CUSTOM_API_KEY=your_custom_api_key
CUSTOM_MODELS=model1,model2  # Comma-separated list of custom models
CUSTOM_SUFFIX=(Custom)  # Display suffix for UI

# Optional: Model display names
OPENAI_SUFFIX=(OpenAI)  # Display suffix for OpenAI models
OLLAMA_SUFFIX=(Ollama)  # Display suffix for Ollama models

# Optional: Default chat model
DEFAULT_MODEL=gpt-4o-mini  # Set the default model for chatting (must be in MODELS, OLLAMA_MODELS, or CUSTOM_MODELS). If not set or invalid, defaults to "gpt-4o-mini".

# Optional: Default summary model
SUMMARY_MODEL=gpt-4o-mini  # Model to use for document summarization and automatic query generation

Running the Application

Run the application with Streamlit:

streamlit run app.py

Usage

Upload one or several PDF documents using the sidebar upload button
Wait for the document(s) to be processed
Ask questions about the document in the chat input
View responses with citations and annotations to the source material
Switch between different language models using the dropdown in the sidebar

TODO

Custom size of chunks
Chat with all the PDFs
Allow users to refer to images if the model supports vision

Authors

virtUOS

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
assets/img		assets/img
src		src
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat with PDFs

Overview

Features

Project Structure

Getting Started

Prerequisites

Installation

Running the Application

Usage

TODO

Authors

About

Releases 3

Packages

Languages

License

virtUOS/chat-with-pdfs

Folders and files

Latest commit

History

Repository files navigation

Chat with PDFs

Overview

Features

Project Structure

Getting Started

Prerequisites

Installation

Running the Application

Usage

TODO

Authors

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages