PiSight-Omni

PiSight-Omni is a privacy-aware Raspberry Pi edge vision toolkit with local face embeddings, FAISS vector search, optional VLM scene reasoning, safe action boundaries and dry-run swarm telemetry hooks. It uses OpenCV for camera I/O, facenet-pytorch for MTCNN face detection and 512-dimensional embeddings, FAISS for local vector search, and an optional advisory agent loop for scene-level reasoning.

The default workflow does not write cropped face images to data/faces/. Enrollment converts live camera frames into embeddings and stores a local FAISS index plus JSON label metadata.

Repository About

Privacy-aware Raspberry Pi edge vision toolkit with local face embeddings, FAISS vector search, optional VLM scene reasoning and safe PiSight-Omni agent mode.

Overview

PiSight-Omni is designed for edge computer vision experiments where raw camera frames should stay on the device by default. The project is not a cloud recognition service and is not a security authentication product.

The current pipeline is:

Camera / Video Source
  -> OpenCV Frame Capture
  -> MTCNN Face Detection
  -> InceptionResnetV1 Embeddings
  -> FAISS Vector Index
  -> Local Match Result
  -> Optional VLM Scene Reasoning
  -> Advisory Action Decision
  -> Optional Dry-Run Swarm Telemetry
  -> Console / Optional Preview Window

Design Philosophy

Local-first processing Frames are processed on the device and no upload path is included.
Image-minimizing enrollment The default CLI stores embeddings, not cropped face image datasets.
Explicit biometric risk Embeddings are still sensitive biometric-derived data. Treat the FAISS index and labels as private runtime artifacts.
Edge-device honesty Deep models are heavier than Haar/LBPH. Raspberry Pi CPU performance should be measured on the actual device before making FPS claims.

Architecture Maturity

Area	Implementation
Detection	MTCNN through `facenet-pytorch`
Embedding	`InceptionResnetV1(pretrained="vggface2")`, 512-dimensional vectors
Search	FAISS `IndexFlatL2` local vector index
Reasoning	Optional OpenAI/Ollama-compatible VLM analysis through `autonom` or `omni`
Actions	Dry-run action dispatcher; no GPIO, locks or alerts execute by default
Omni layer	Local telemetry, visual signal summary and optional MQTT dry-run swarm publisher
Privacy posture	No raw face crop storage in the default collect/enroll flow
Compatibility	Legacy Haar/LBPH helper modules remain for older tests and migration context
Operations	CLI, config files, systemd examples, doctor command and CI
Testability	Camera-independent tests with mocks/fakes; no real face images in CI

Core Features

Raspberry Pi camera or USB webcam input through OpenCV VideoCapture
Deep face detection and embedding path with MTCNN and InceptionResnetV1
Real-time vector enrollment through collect or enroll
Local FAISS index and JSON label metadata
Real-time recognition through vector nearest-neighbor search
Optional agentic autonom loop for Perception -> Reasoning -> Action demos
omni command with local telemetry and dry-run swarm output for multi-node experiments
train compatibility command that explains offline training is no longer required
JSON and YAML config support
Dataset/model audit command retained for legacy runtime hygiene checks
systemd service templates for Linux/Raspberry Pi deployment
Camera-independent pytest suite and GitHub Actions CI

Tech Stack

Python 3.9+
OpenCV for camera capture and drawing
NumPy
PyYAML
PyTorch
facenet-pytorch
FAISS CPU
OpenAI Python SDK for optional VLM reasoning
requests for future local agent integrations
paho-mqtt for optional PiSight-Omni swarm telemetry
pytest and ruff
systemd for Linux service deployment

Hardware Notes

Raspberry Pi 4/5 can run the pipeline, but deep embeddings are CPU-heavy.
For smoother real-time performance, test NVIDIA Jetson, Google Coral class accelerators, or a machine with CUDA/MPS support.
Lighting, camera quality, resolution and number of faces directly affect latency and match quality.
No fixed FPS or accuracy number is claimed by this repository.

Project Structure

raspberry_face_recognition/  CLI, config, vision, vector DB, agent and Omni helpers
pisight/                     Compatibility module for python -m pisight
docs/                        Architecture, setup, privacy, testing and deployment notes
examples/                    Placeholder config, commands, service file and console output
scripts/install_pi.sh        Raspberry Pi setup helper
systemd/pisight.service      Installable systemd service template
tests/                       Camera-independent tests
config.example.yaml          YAML config example
config.example.json          JSON config example
requirements.txt             Full deep-runtime dependency list
pyproject.toml               Package metadata, extras and console scripts

Installation

On Raspberry Pi OS, install OpenCV from apt so camera backends work cleanly:

sudo apt update
sudo apt install -y python3 python3-venv python3-pip python3-opencv python3-numpy
python3 -m venv .venv --system-site-packages
. .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -e ".[deep,agent,omni]"
cp config.example.yaml config.yaml

For development and CI-style checks without the heavy deep/agent runtime:

python -m pip install -e ".[dev]"
python -m pytest -q

Configuration

camera:
  source: 0
  width: 640
  height: 480
  fps: 15

paths:
  vector_index_path: "./data/embeddings/faiss.index"
  vector_labels_path: "./data/embeddings/labels.json"
  log_dir: "./logs"

recognition:
  backend: "deep"
  confidence_threshold: 0.8
  unknown_label: "unknown"
  draw_bounding_boxes: true

embedding:
  dim: 512

runtime:
  debug: false
  save_unknown_faces: false

agent:
  model: "gpt-4.1-mini"
  base_url: ""
  interval_frames: 30
  max_tokens: 300
  action_mode: "dry_run"

omni:
  device_id: "node-001"
  swarm_enabled: false
  swarm_host: "localhost"
  swarm_port: 1883
  swarm_topic: "pisight/omni/swarm"
  swarm_dry_run: true

Notes:

camera.source can be a camera index such as 0 or a video path.
paths.vector_index_path stores the FAISS index.
paths.vector_labels_path stores label metadata in JSON.
recognition.confidence_threshold is a vector distance threshold, not an accuracy percentage.
runtime.save_unknown_faces remains false by default and the current CLI does not save unknown face crops.
agent.base_url can point to an OpenAI-compatible local VLM endpoint. If it is empty, the OpenAI SDK default endpoint is used.
agent.action_mode is limited to dry_run or disabled; physical actions are not implemented.
omni.swarm_enabled is false by default. Even when enabled, omni.swarm_dry_run keeps MQTT output in dry-run mode unless explicitly changed in config or overridden with --swarm-live.
The visual signal summary in Omni mode is not a heart-rate, stress, liveness or health detector.

Usage

Check local dependencies:

pisight --config config.yaml doctor

Enroll embeddings for a placeholder label:

pisight --config config.yaml collect --name demo-user-001 --count 10

Equivalent alias:

pisight --config config.yaml enroll --name demo-user-001 --count 10

Run recognition:

pisight --config config.yaml recognize

Run headless over SSH or systemd:

pisight --config config.yaml recognize --no-window

Run the optional agentic vision loop:

pisight --config config.yaml autonom --no-window --interval-frames 30

autonom sends selected frames to the configured VLM endpoint for scene reasoning. Use a local VLM endpoint if the reasoning step must remain on the device or local network.

Run PiSight-Omni advisory mode with local telemetry and dry-run swarm output:

pisight --config config.yaml omni --no-window --interval-frames 30 --swarm

Allow live MQTT publishing only when the broker and privacy boundary are reviewed:

pisight --config config.yaml omni --no-window --swarm --swarm-live

Compatibility train command:

pisight --config config.yaml train

The command does not train an offline model in the embedding pipeline; it explains that enrollment writes vectors directly.

Privacy and Safety

Raw face crops are not written in the default collect/enroll flow.
FAISS index files and label metadata are local runtime artifacts.
Embeddings are biometric-derived data and should not be committed or published.
PiSight-Omni does not upload frames, embeddings or labels in the default local recognition flow.
The autonom and omni commands are opt-in and may send encoded frames to the configured VLM provider.
The omni command can publish local telemetry to MQTT only when explicitly enabled.
The codebase does not execute model-generated Python code and does not control locks, relays, GPIO, alarms or access-control systems.
Do not use real names or real face data in public demos.
This project is not identity verification, surveillance infrastructure or access-control authentication.

For security-grade use, add liveness detection, consent workflow, encrypted storage, access control, retention policy and formal risk assessment.

Testing

python -m pytest -q
python -m ruff check .

CI intentionally avoids real camera access, real face images, private videos and hardware-specific FPS assertions.

Limitations

Deep face embeddings are heavier than Haar/LBPH on Raspberry Pi CPU.
FAISS nearest-neighbor distance is not a calibrated identity guarantee.
Embeddings reduce raw image exposure but remain sensitive.
Omni visual signal telemetry is a non-clinical debugging signal only.
Swarm telemetry is a transport hook, not distributed intelligence or autonomous enforcement.
Face recognition alone should not be used as a security verification system.
Lighting, camera quality, pose and occlusion can affect detection and embedding quality.

Roadmap

Encrypted local embedding store
Liveness/anti-spoofing module
Detector backend selection from config
Structured event logging
Benchmark script with hardware metadata
Local-only dashboard
Edge accelerator notes for Jetson/Coral-class devices
Reviewed action adapters for notifications or hardware integrations
Optional depth-estimation adapter with measured hardware notes
Local VLM deployment guide for private omni reasoning

License

PiSight-Omni is released under the MIT License. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PiSight-Omni

Repository About

Overview

Design Philosophy

Architecture Maturity

Core Features

Tech Stack

Hardware Notes

Project Structure

Installation

Configuration

Usage

Privacy and Safety

Testing

Limitations

Roadmap

License

About

Uh oh!

Releases

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github		.github
docs		docs
examples		examples
pisight		pisight
raspberry_face_recognition		raspberry_face_recognition
scripts		scripts
systemd		systemd
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
config.example.json		config.example.json
config.example.yaml		config.example.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

PiSight-Omni

Repository About

Overview

Design Philosophy

Architecture Maturity

Core Features

Tech Stack

Hardware Notes

Project Structure

Installation

Configuration

Usage

Privacy and Safety

Testing

Limitations

Roadmap

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors

Uh oh!

Languages