π Finalist at ConTech 2025! An AI-driven, RAG-based conversational assistant designed to optimize document processing, enhance legal compliance, and streamline construction management.
β¨ Features & Capabilities
πΉ Conversational AI π€ β’ AI-powered chatbot designed for intelligent document retrieval and question answering. β’ Uses RAG (Retrieval-Augmented Generation) to provide precise, context-aware responses. β’ Query Optimization for improved accuracy and Context Enhancement for better results.
π AI-Powered Document Processing β’ Extracts text, images, and numerical data from PDFs, legal documents, and reports. β’ Converts unstructured information into structured insights.
π Graphical Data Visualization β’ Automatically generates graphs & plots from extracted statistical/numerical data. β’ AI-driven analysis to enhance decision-making.
π Legal Compliance & Policy Checker β’ Extracts and analyzes legal regulations and policies from stored + live sources. β’ Ensures documents comply with construction laws & expiry rules.
π AI-Powered Report Generation β’ Summarizes extracted data into visually appealing, structured reports. β’ Supports editable and downloadable formats.
π Project Structure
π¦ BuildWise
βββ Backend
β βββ API Setup
β β βββ main.py # Exposes API using FastAPI for frontend interactions
β β βββ .env # Stores API keys, LLM credentials, and database credentials
β β
β βββ Data Extraction
β β βββ Dynamic Extracted Datas
β β β βββ download_info.json # Metadata of downloaded documents
β β β βββ google_search_results.json # Web-scraped data for legal compliance
β β β βββ links.csv # List of extracted URLs for dynamic legal research
β β βββ Uploaded
β β β βββ analyse_Data.py # Handles uploaded PDFs for text/image extraction
β β β βββ dynamic_data_extraction.py # Automates extraction & stores in Pinecone
β β β βββ scrape_Web.py # Web scraping for real-time legal updates
β β β βββ storeLawsInVdb.py # Stores structured legal info in vector DB
β β
β βββ Raw Datas
β β βββ Images
β β β βββ Image_data.md # Static image interaction logs
β β βββ PDFs
β β β βββ Pdf_data.md # Static PDF document metadata
β β
β βββ Utils
β β βββ legal_Data_Retriever.py # Fetches legal rules dynamically from the web
β β βββ llm_setup.py # LLM wrapper for sentence embeddings & custom functions
β β βββ query_answering.py # **Enhances query understanding & improves accuracy**
β β β π **Query Optimization** for relevant search retrieval
β β β π **Context Enhancement** to improve AI-generated responses
β β
βββ Frontend
β βββ (React.js UI with interactive dashboard & chatbot)
π οΈ Tech Stack
| Component | Tech Used |
|---|---|
| Frontend | React.js βοΈ |
| Backend API | FastAPI π |
| Database | Pinecone & VectorDB π |
| LLMs | OpenAI/GPT & Sentence Embeddings π§ |
| Data Processing | Python π, Pandas, NumPy |
| Visualization | Matplotlib, Plotly π |
| Storage & Logs | JSON, CSV, Pinecone |
πΈ Screenshots & Demo
π₯ Demo Video π Presentation Slides https://docs.google.com/presentation/d/1I3ZEpiBcTtXK0AsyRkMTCxSGAX7xXHCNHcf5aUEr9qY/edit?usp=sharing
π How to Run the Project
1οΈβ£ Clone the Repository
git clone https://github.com/rockramsri/BuildWise.git
cd BuildWise2οΈβ£ Install Dependencies
pip install -r requirements.txt3οΈβ£ Set Up Environment Variables
Create a .env file and add:
OPENAI_API_KEY=your_openai_key
PINECONE_API_KEY=your_pinecone_key
DATABASE_URI=your_database_url4οΈβ£ Run the Backend API
uvicorn Backend.API_Setup.main:app --reload5οΈβ£ Start the Frontend
cd Frontend
npm install
npm startπ Future Improvements
βοΈ Enhance RAG model for better retrieval accuracy βοΈ Expand legal compliance checker with more live sources βοΈ Add multilingual support for global adoption βοΈ Optimize query processing for even faster responses
π‘ Open a pull request or start a discussion if youβd like to contribute! π