Skip to content

SoumojitDalui/scipy-codebase-search-assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SciPy Codebase Search Assistant

Streamlit app for semantic search over a curated SciPy documentation and source-code corpus, with simple and advanced retrieval views across docs, code, and hybrid ranking.

Live demo: soumojitdalui-scipy-codebase-search-assistant-app-6r2myg.streamlit.app

Overview

This project explores retrieval over a large technical codebase by indexing both documentation chunks and source-code chunks, then exposing them through a searchable interface.

The app supports:

  • Simple mode for a cleaner product-style search experience
  • Advanced mode for separate Docs, Code, and Hybrid retrieval views
  • hybrid ranking with reciprocal rank fusion
  • result inspection at the chunk level

Why This Project

Large libraries like SciPy are difficult to navigate through keyword search alone. This project focuses on a more useful developer-search workflow:

  • search for APIs, usage examples, and concepts in natural language
  • retrieve grounded context from docs and source code
  • compare documentation-only, code-only, and fused retrieval behavior

Repository Structure

Tech Stack

  • Python
  • Streamlit
  • scikit-learn
  • pandas
  • NumPy

Run Locally

pip install -r requirements.txt
streamlit run app.py

Notes

  • The app uses a curated corpus derived from SciPy documentation and source code.
  • The full vendored SciPy repository is not included in this GitHub-ready project copy.

About

Streamlit app for semantic search over a curated SciPy docs and source-code corpus, with simple and advanced retrieval views across docs, code, and hybrid ranking.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors