🏛️ Open Navigator

title	Open Navigator
emoji	🏛️
colorFrom	blue
colorTo	green
sdk	docker
app_port	7860
pinned	false
license	apache-2.0

🏛️ Open Navigator

CommunityOne: One Map for Every Community

Every person deserves to find the help they need and have a voice in the decisions that shape their lives. But public resources are scattered, gaps go unseen, and communities are left navigating alone.

CommunityOne changes that. One platform connects residents, leaders, and funders to what's really happening on the ground — so no community has to fight just to be seen.

AI-powered civic engagement platform with React + FastAPI web interface

� Quick Links

⚛️ Open Navigator → - LIVE APPLICATION (search, filters, heatmap, data exploration)

📖 Documentation → - Complete guides, architecture, and feature details

The documentation site includes:

Features and capabilities
Data sources and integrations
Architecture and deployment options
Policy topics and advocacy tools
API reference and examples

Quick Start

Three Services

This project runs three separate services:

Service	Port (Local)	Live URL	Description
⚛️ Open Navigator 🚀	5173	www.communityone.com	MAIN APPLICATION - Search, filters, heatmap, data exploration
📚 Documentation	3000	www.communityone.com/docs	Docusaurus site with complete guides and tutorials
🔥 API Backend	8000	www.communityone.com/api	FastAPI server with AI agents

💡 LIVE DEMO: Visit www.communityone.com to use the application!

💻 LOCAL DEV: After running ./start-all.sh, visit http://localhost:5173

🚀 Deployment

Deploy to Hugging Face Spaces in 3 commands:

echo "HF_USERNAME=your_username" >> .env
./deploy-huggingface.sh
# Configure hardware and secrets at https://huggingface.co/spaces/YOUR_USERNAME/www.communityone.com

Full deployment guides:

Hugging Face Spaces - Docker deployment (~$22/month)
Databricks Apps - Enterprise deployment
Local Development - Complete deployment documentation

The deploy-huggingface.sh script automatically:

✅ Tests builds locally (catches errors before pushing)
✅ Creates the Space on Hugging Face
✅ Pushes code and triggers automatic build (~10-15 min)

Prerequisites

Python 3.11+
Node.js 18+
Docker (optional)
OpenAI API key

Installation

Option 1: Start Everything at Once (Recommended)

# Clone repository
git clone https://github.com/getcommunityone/open-navigator.git
cd open-navigator

# Install dependencies
./install.sh                          # Python backend
cd frontend && npm install && cd ..   # React app
cd website && npm install && cd ..    # Documentation

# Setup git hooks for build protection (one-time)
./setup-git-hooks.sh

# Start all services in tmux
./start-all.sh

Option 2: Using Makefile

# Install
make install
make install-frontend
make install-docs

# Start all services
make start-all

# Or individually:
make dev           # API only
make dev-frontend  # React app only
make dev-docs      # Docs only

Option 3: Manual Setup

# Python backend
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

# Optional: Spark + Delta Lake (only if you'll run Databricks/Spark scripts).
# Requires a Java runtime (e.g. `sudo apt install openjdk-17-jre-headless`).
# pip install -r requirements-spark.txt

# React app
cd frontend && npm install && cd ..

# Documentation
cd website && npm install && cd ..

# Configure environment
cp .env.example .env
# Edit .env with your API keys

# Start services (separate terminals)
source .venv/bin/activate && python main.py serve  # Terminal 1
cd frontend && npm run dev                          # Terminal 2
cd website && npm start                             # Terminal 3

Access Points

🌐 LIVE APPLICATION:

🚀 Open Navigator: https://www.communityone.com - Main application
📚 Documentation: https://www.communityone.com/docs - Guides and API reference
🔥 API Docs: https://www.communityone.com/api/docs - FastAPI interactive documentation

💻 LOCAL DEVELOPMENT:

🚀 Main App: http://localhost:5173
📚 Documentation: http://localhost:3000
🔥 API Docs: http://localhost:8000/docs

Stop Services

./stop-all.sh
# or
make stop-all

Usage

Command Line Interface

Always activate the virtual environment first:

source .venv/bin/activate

API Server

python main.py serve --host 0.0.0.0 --port 8000

Jurisdiction Discovery

# Test run
python main.py discover-jurisdictions --limit 100

# Single state
python main.py discover-jurisdictions --state CA

# Full discovery (~30k jurisdictions)
python main.py discover-jurisdictions

# View statistics
python main.py discovery-stats

Data Ingestion

# Census data (90,000+ jurisdictions)
python -m discovery.census_ingestion

# Census shapefiles (geographic boundaries)
python scripts/datasources/census/download_shapefiles.py --year 2023 --extract

# NCES school districts (13,000+)
python -m discovery.nces_ingestion

# Pre-built meeting datasets
python scripts/discovery/meetingbank_ingestion.py
python scripts/datasources/cityscrapers/city_scrapers_urls.py
python scripts/discovery/openstates_sources.py

# LocalView (requires Dataverse API key)
python scripts/discovery/localview_ingestion.py

Scraping & Analysis

# Scrape batch from discovered sites
python main.py scrape-batch --source discovered --limit 50

# Scrape single source
python main.py scrape --url "https://city.legistar.com" \
                      --state "CA" \
                      --municipality "San Francisco"

# Run analysis pipeline
python main.py analyze --targets-file examples/targets.json

# Generate heatmap
python main.py generate-heatmap --output heatmap.html

Publishing Datasets

# Publish to HuggingFace (requires HF_TOKEN in .env)
python main.py publish-to-hf --dataset all
python main.py publish-to-hf --dataset discovered-urls
python main.py publish-to-hf --dataset census --sample

API Usage

Start a workflow:

curl -X POST "http://localhost:8000/workflow/start" \
     -H "Content-Type: application/json" \
     -d '{
       "scrape_targets": [
         {
           "url": "https://example-city.legistar.com",
           "municipality": "Example City",
           "state": "CA",
           "platform": "legistar"
         }
       ]
     }'

Query opportunities:

curl "http://localhost:8000/opportunities?state=CA&urgency=critical"

Get heatmap:

curl "http://localhost:8000/heatmap" > heatmap.html

Python API

import asyncio
from agents.orchestrator import OrchestratorAgent
from agents.scraper import ScraperAgent
from agents.parser import ParserAgent
from agents.classifier import ClassifierAgent

# Initialize orchestrator
orchestrator = OrchestratorAgent()

# Register agents
orchestrator.register_agent(ScraperAgent())
orchestrator.register_agent(ParserAgent())
orchestrator.register_agent(ClassifierAgent())

# Execute pipeline
targets = [
    {
        "url": "https://city.legistar.com",
        "municipality": "Example City",
        "state": "CA",
        "platform": "legistar"
    }
]

results = await orchestrator.execute_pipeline(targets)

Project Structure

open-navigator/
├── agents/                 # Multi-agent AI system
├── api/                   # FastAPI application
├── frontend/             # React application (Open Navigator)
├── website/              # Docusaurus documentation
├── discovery/            # Data discovery modules
├── extraction/           # Document extraction
├── pipeline/             # Data pipeline components
├── visualization/        # Heatmap and charts
├── config/               # Configuration
├── tests/                # Test suite
├── dbt_project/          # dbt transformations (Bronze → Production)
├── scripts/              # Data ingestion and processing
├── main.py              # CLI entry point
└── requirements.txt     # Python dependencies

Data Pipelines

Hybrid Python + dbt Architecture

Open Navigator uses a hybrid ETL approach:

Python scripts (scripts/datasources/*/load_*.py) for data ingestion, API calls, and AI analysis
dbt (dbt_project/) for SQL-based transformations and data quality testing

# 1. Load data with Python
python scripts/datasources/gemini/load_meeting_transcripts_bronze.py  # AI extraction
python scripts/datasources/openstates/load_openstates_bulk.py        # State legislation
python scripts/datasources/irs/load_irs_bmf.py                       # Nonprofit data

# 2. Transform with dbt
cd dbt_project
dbt run      # Bronze → Production transformations
dbt test     # Data quality checks
dbt docs serve  # Interactive documentation

# 3. Export (optional)
python scripts/data/export_to_gold_parquet.py  # For HuggingFace distribution

Learn more:

dbt ETL Strategy - Full architecture
Bronze to Production Merge - Entity resolution strategy
dbt Project README - Quick start guide

Deployment Options

1. Databricks Apps (Production)

export DATABRICKS_HOST=https://your-workspace.cloud.databricks.com
export DATABRICKS_TOKEN=dapi...
export OPENAI_API_KEY=sk-...

./scripts/deploy-databricks-app.sh

See DATABRICKS_APP_GUIDE.md for details.

2. Docker

docker-compose up -d

Starts:

API server (port 8000)
Qdrant vector database (port 6333)
Jupyter notebook (port 8888)

3. Local Development

See Quick Start above.

⚡ Intel Arc GPU Optimization

Run Llama 4 at NVIDIA-like speeds on Intel Arc integrated graphics!

If you have Intel Core Ultra 7 (or similar) with Arc Graphics + NPU, you can use DuckDB + VSS for 10-50x faster legislative analysis:

# Setup Intel-optimized environment
./scripts/enrichment_ai/intel_llm_setup.sh
source .venv-intel/bin/activate

# Run DuckDB vector search demo
python scripts/enrichment_ai/duckdb_vss_demo.py

# Run legislative analysis with LLM
python scripts/enrichment_ai/legislative_analysis_intel.py

Why DuckDB for Local AI?

⚡ 10-50x faster than Postgres for context injection
🎯 < 20ms vector similarity search across 10K bills
🧠 Embedded - no server needed, runs locally
🤗 Hugging Face Integration - query HF datasets directly

Performance:

Context Injection: 20ms vs 500ms (Postgres) = 25x faster
LLM Inference: 1,200 tok/s (Arc GPU) vs 350 tok/s (CPU) = 3.4x faster
Vector Search: 18ms vs 800ms = 44x faster

Features:

Extract interest groups from legislative testimony
Identify lobbyists and their positions
Analyze support/oppose scores with confidence
Detect tradeoffs and compromises

See full guide: Intel Arc Optimization Guide

🤖 AI Integration (MCP Server)

Connect your civic data to Claude and other AI assistants!

Open Navigator includes a Model Context Protocol (MCP) server that lets AI assistants directly access your data:

# Install MCP dependencies
pip install mcp anthropic-mcp-sdk

# Run the server
python scripts/mcp/open_navigator_server.py

What AI assistants can do:

🏛️ Search 90,000+ jurisdictions by name or location
🏢 Query 1.8M nonprofits with Form 990 data
📜 Semantic search across 4.5M+ legislative documents
📊 Get real-time statistics and analytics
🔍 Vector search meetings and bills with natural language

Example queries to Claude:

"Find all cities named Springfield in the database"

"Show me 501c3 nonprofits in San Francisco focused on education"

"What bills related to oral health were introduced in California?"

Configure Claude Desktop:

Add to ~/.config/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "open-navigator": {
      "command": "python",
      "args": ["/path/to/open-navigator/scripts/mcp/open_navigator_server.py"],
      "env": {
        "DATABASE_URL": "postgresql://postgres:password@localhost:5433/open_navigator"
      }
    }
  }
}

See full guide: MCP Server Documentation

Testing

# Run all tests
pytest

# With coverage
pytest --cov=agents --cov=pipeline --cov=visualization

# Specific test file
pytest tests/test_agents.py

Configuration

Create .env file:

# OpenAI
OPENAI_API_KEY=sk-...

# Databricks (optional)
DATABRICKS_HOST=https://your-workspace.cloud.databricks.com
DATABRICKS_TOKEN=dapi...

# HuggingFace (optional)
HF_TOKEN=hf_...

# Dataverse (optional)
DATAVERSE_API_KEY=...

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

See CONTRIBUTING.md for details.

Documentation

Full Documentation - Complete guides and API reference
Architecture - System architecture overview
Quick Start - Detailed setup instructions
Quick Reference - Command reference card
MCP Server - AI assistant integration guide
Deployment - Production deployment guides
Case Studies - Real-world examples
CONTRIBUTING.md - How to contribute

Citations

This project uses several open datasets and research contributions. Please see CITATIONS.md for complete citation information.

Key Dataset:

MeetingBank: Hu et al., "MeetingBank: A Benchmark Dataset for Meeting Summarization", ACL 2023
- Used for meeting discovery and analysis
- 1,366 city council meetings from 6 U.S. cities
- See CITATIONS.md for full citation and BibTeX

License

Apache License 2.0 - see LICENSE file for details.

Support

GitHub Issues: github.com/getcommunityone/open-navigator-for-engagement/issues
Email: johnbowyer@communityone.com

Note: This system is designed to support advocacy efforts. All generated content should be reviewed by humans before use.

Name		Name	Last commit message	Last commit date
Latest commit History 1,547 Commits
.VSCodeCounter		.VSCodeCounter
.claude		.claude
.cursor/rules		.cursor/rules
.githooks		.githooks
.github		.github
.huggingface		.huggingface
api		api
dbt_project		dbt_project
django_ocd		django_ocd
docs		docs
packages		packages
prompts		prompts
r/local_view		r/local_view
scripts		scripts
sql/adhoc		sql/adhoc
tests		tests
web_app		web_app
web_docs		web_docs
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATIONS.md		CITATIONS.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.app		Dockerfile.app
Dockerfile.huggingface		Dockerfile.huggingface
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_HF.md		README_HF.md
SECURITY.md		SECURITY.md
__init__.py		__init__.py
app.yaml		app.yaml
dbt_project.yml		dbt_project.yml
docker-compose.socks-proxy.example.yml		docker-compose.socks-proxy.example.yml
docker-compose.verapdf.example.yml		docker-compose.verapdf.example.yml
docker-compose.yml		docker-compose.yml
eboard_cookies.json.example		eboard_cookies.json.example
git		git
main.py		main.py
package-lock.yml		package-lock.yml
packages.yml		packages.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-cpu.txt		requirements-cpu.txt
requirements-dbt.txt		requirements-dbt.txt
requirements-gemini-api.txt		requirements-gemini-api.txt
requirements-intel.txt		requirements-intel.txt
requirements-ollama-scraping.txt		requirements-ollama-scraping.txt
requirements-spark.txt		requirements-spark.txt
requirements-transcript-diarize.txt		requirements-transcript-diarize.txt
requirements.txt		requirements.txt
setup.py		setup.py
start-all.sh		start-all.sh
stop-all.sh		stop-all.sh
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

🏛️ Open Navigator

CommunityOne: One Map for Every Community

� Quick Links

Quick Start

Three Services

🚀 Deployment

Prerequisites

Installation

Access Points

Stop Services

Usage

Command Line Interface

API Usage

Python API

Project Structure

Data Pipelines

Hybrid Python + dbt Architecture

Deployment Options

1. Databricks Apps (Production)

2. Docker

3. Local Development

⚡ Intel Arc GPU Optimization

🤖 AI Integration (MCP Server)

Testing

Configuration

Contributing

Documentation

Citations

License

Support

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages