Regulation Document Chatbot

Setup

Prerequisites

Node.js 20+
Ollama installed locally

Install dependencies

npm install

Configure environment

Create .env.local based on .env.example:

cp .env.example .env.local

Pull a local model for Ollama:

ollama pull llama3

How to run

1) Ingest documents

This parses the HTML/PDF files in docs/, chunks them, generates embeddings, and writes data/knowledge-base.json.

npm run ingest

2) Start the app

npm run dev

Then open http://localhost:3000 and ask questions about the documents.

Approach and architecture

Ingestion: scripts/ingest.ts parses the HTML/PDFs, normalizes text, chunks it, and generates embeddings with @xenova/transformers. Output is stored locally as a JSON knowledge base.
Retrieval: src/lib/retrieval.ts loads the knowledge base, embeds the user query, computes cosine similarity, and returns top-K chunks with source metadata.
LLM: src/lib/ollama.ts calls a local Ollama model with the retrieved context.
API: /api/chat wires retrieval + Ollama and returns { answer, sources }.
UI: A simple chat UI renders messages and source citations.

Notes

The documents live in docs/ and are the only supported formats (HTML + PDF).
Source attribution is derived from document metadata (doc name, section, page).

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
docs		docs
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
README.md		README.md
components.json		components.json
eslint.config.mjs		eslint.config.mjs
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Regulation Document Chatbot

Setup

Prerequisites

Install dependencies

Configure environment

How to run

1) Ingest documents

2) Start the app

Approach and architecture

Notes

About

Uh oh!

Releases

Packages

Languages

phmoraesrodrigues/chatbot

Folders and files

Latest commit

History

Repository files navigation

Regulation Document Chatbot

Setup

Prerequisites

Install dependencies

Configure environment

How to run

1) Ingest documents

2) Start the app

Approach and architecture

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages