RAG Application - Shyftlabs Assessment

This application integrates a Next.js frontend with a Flask backend to deliver a full-stack Retrieval-Augmented Generation (RAG) system powered by various Large Language Models (LLMs).

Getting Started

Clone the Repository

git clone https://github.com/sharukat/rag-pdf-assessment.git
cd rag-pdf-assessment

Frontend Setup (Next.js)

Install dependencies:

npm install

Create a .env.local file in the frontend directory with the following content:

NEXT_PUBLIC_API_URL=http://localhost:5328
GROQ_API_KEY=your_groq_api_key_here

Start the server:

The Next.js frontend will be available at http://localhost:3000.

Backend Setup (Flask)

Create and activate a virtual environment:

python -m venv venv

source venv/bin/activate

Install Python dependencies:

pip install -r requirements.txt

Create a .env file in the backend directory with the following content:

COHERE_API_KEY=your_cohere_api_key_here
NOMIC_API_KEY=your_nomic_api_key_here

Start the Flask server:

cd api
python3 index.py

OR

cd api
flask run --port=5328

The Flask backend will be available at http://localhost:5328.

Technology Stack

LLM Models

This application leverages two large language models (LLMs) through Groq:

llama-3.3-70b-versatil: Used for Hypothetical Document Embedding (HyDE).
deepseek-r1-distill-llama-70b: Used for final answer generationg when contextual information is provided.

Embedding Models

Nomic Embediing: A powerful embedding model that captures semantic relationships between text chunks, enabling accurate document retrieval.

Advanced RAG Techniques

Chunking Strategy

Documents are processed using an semantic chunking strategy that:

Leverage Nomic embeddings to determine breakpoints
Automatically adjusts chunk sizes based semantics of the text

Hypothetical Document Embeddings (HyDE)

This system implements HyDE to improve retrieval relevance:

The user query is expanded into a hypothetical document that might answer it
The hypothetical embeddings are used to search for relevant document chunks

Searching Strategy

This system uses Hybrid (Dense + Sparse) embeddings search technique.

Leverage Nomic embeddings for dense retrieval identifying semantic relationship.
The BM25 algorithm uses sparse embeddings to match specific terms.

Prefixing Query and Documents

Used search_document prefix for document chunks.
Used search_query prefix for the search query.

Repacking & Reranking

Reranking: Improve the relevance of the retrieved documents to ensure the most important information appears first.
Repacking: The order of the chunks might affect response generation. This technique repacks the chunks in ascending order.

API Documentation

The backend exposes the following endpoints:

POST /api/upload: Upload new documents, perform semantic chunking, and create a vector database.
POST /api/getdocuments: Retrieval of relevant contextual information from a vector database.”

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.next		.next
api		api
app		app
backend		backend
components		components
contexts		contexts
hooks		hooks
lib		lib
pages		pages
.env		.env
.gitignore		.gitignore
README.md		README.md
components.json		components.json
next-env.d.ts		next-env.d.ts
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
requirements.txt		requirements.txt
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Application - Shyftlabs Assessment

Getting Started

Clone the Repository

Frontend Setup (Next.js)

Backend Setup (Flask)

Technology Stack

LLM Models

Embedding Models

Advanced RAG Techniques

Chunking Strategy

Hypothetical Document Embeddings (HyDE)

Searching Strategy

Prefixing Query and Documents

Repacking & Reranking

API Documentation

License

About

Releases

Packages

Languages

sharukat/rag-pdf-assessment

Folders and files

Latest commit

History

Repository files navigation

RAG Application - Shyftlabs Assessment

Getting Started

Clone the Repository

Frontend Setup (Next.js)

Backend Setup (Flask)

Technology Stack

LLM Models

Embedding Models

Advanced RAG Techniques

Chunking Strategy

Hypothetical Document Embeddings (HyDE)

Searching Strategy

Prefixing Query and Documents

Repacking & Reranking

API Documentation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages