A lightweight, production-ready Agentic RAG (Retrieval-Augmented Generation) service that analyzes startup signals (hiring, funding, remote readiness) using semantic search + LLM reasoning.
Built with:
β’ FastAPI
β’ FAISS (vector search)
β’ Sentence Transformers
β’ Groq LLM (Llama 3.3 70B)
β’ Docker-ready architecture
β’ Tavily and Streamlit
β’ Render Deployement
- Embeds startup-related signals into a FAISS vector store
- Retrieves the most relevant context using semantic similarity
- Sends structured context to an LLM
- Returns structured JSON analysis
Example analysis output:
{
"startup_name": "ExampleAI",
"hiring_signal": true,
"remote_possible": true,
"funding_stage": "Seed",
"reasoning": "Raised seed round and actively hiring Flutter developer.",
"source_url": "https://example.com/post"
}User Query
β
Retriever (FAISS + Embeddings)
β
Context Assembly
β
Groq LLM (Structured JSON Output)
β
FastAPI Responseβ’ embedding_service.py β Generates sentence embeddings
β’ vector_store.py β FAISS index + persistence
β’ retrieval_service.py β Semantic retrieval logic
β’ llm_service.py β Groq structured JSON generation
β’ main.py β FastAPI endpoints
Python 3.10+
FastAPI
FAISS (CPU)
SentenceTransformers (all-MiniLM-L6-v2)
Groq LLM API
Docker
git clone https://github.com/bold-ronin/Lightweight-Agentic-RAG-Service.git
cd agentic-lead-ragpython -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activatepip install -r requirements.txtCreate .env file:
LLM_API_KEY=your_api_key_here [I used Groq you an whatever you want]uvicorn app.main:app --reloadOpen:
Use /analyze endpoint.
Dockerized FastAPI agent for RAG tasks.
Build image:
docker build -t agentic-lead-rag .Run container:
docker run -p 8000:8000 --env-file .env agentic-lead-ragVisit: http://localhost:8000/docs
Deployed on render for global access
Visit: https://agentic-lead-rag.onrender.com/docs
The GUI on Streamlit also deployed on render you can access it
Here: https://agentic-lead-rag-gui.onrender.com/
| Interface | Result |
|---|---|
![]() |
![]() |
| Search Agent | Output |
|---|---|
![]() |
![]() |
| Workflow |
|---|
![]() |
| Additional Example |
|---|
![]() |
- Structured JSON enforcement from LLM
- Async Groq integration
- Semantic search retrieval
- Source URL tracking
- FAISS index persistence
- Dockerized for portability
"Startup hiring Flutter developer remotely after seed funding"Returns structured intelligence analysis based on stored signals.
- Live internet ingestion
- Web signal scraping
- Scheduled background refresh
- Frontend dashboard
- Multi-source ingestion pipeline
- Deployment (Render / Railway)
- Usage-based monetization
This is not a chatbot.
It is a structured intelligence engine designed to extract startup signals for: β’ Freelancers β’ Recruiters β’ Founders β’ Investors
Built by Naol β AI-focused mobile + systems engineer exploring Agentic architectures and applied intelligence systems.
.png)
.png)
.png)
.png)
.png)
.png)