FileChat - RAG Application

This repository demonstrates a hybrid RAG (Retrieval-Augmented Generation) setup using local Ollama embeddings and Azure OpenAI for chat.

Project description

This is a RAG (Retrieval-Augmented Generation) application that scans files and folders, builds a FAISS index of document chunks using vector embeddings, and allows fast semantic search and chat over large local document collections. The index uses approximate nearest neighbors (ANN) for efficient similarity search. The typical workflow is:

Index: load documents, split them into chunks, compute vector embeddings, and store them in a FAISS index.
Retrieve: given a user question, retrieve the most relevant chunks using ANN similarity search.
Generate: pass retrieved context to a chat model to produce a concise answer, optionally returning source file paths.

This project uses nomic-embed-text (via the local Ollama embeddings) to compute vector representations and Azure OpenAI (chat) to generate responses.

Key capabilities

Build a FAISS index from a folder of documents for fast similarity search (ANN).
Chat with files to quickly extract information from large folders.
Return source file paths for traceability.

Tech stack and models

Language: Python
Vector store: FAISS (local)
Embeddings: nomic-embed-text (local via Ollama)
Chat / response generation: Azure OpenAI (chat deployment)
Orchestration / helper libraries: LangChain and related extensions
API: FastAPI (lightweight server example)
UI: Streamlit (simple demo)

Getting started

Copy .env.example to .env and fill in your Azure credentials. Do NOT commit .env.
Install dependencies: pip install -r requirements.txt
Ensure Ollama is running locally if you use local embeddings.
Build the index: python indexing/build_index.py
Run the API: uvicorn core.api_server:app --reload
Run the streamlit demo: streamlit run app/streamlit_app.py

Security

This repo uses environment variables for secrets.
FAISS index and documents are ignored by .gitignore.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FileChat - RAG Application

Project description

Key capabilities

Tech stack and models

Getting started

Security

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
core		core
faiss		faiss
indexing		indexing
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Sonit2006/fileChat

Folders and files

Latest commit

History

Repository files navigation

FileChat - RAG Application

Project description

Key capabilities

Tech stack and models

Getting started

Security

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages