Docsy

Docsy is a hackathon-ready AI document workflow agent built on top of the Markdown Muse editor.

It focuses on one core interaction:

a user edits a technical document
Gemini analyzes document context
the system proposes a structured action or patch
the user reviews the change before applying it

Language

Why this project exists

Technical documentation usually breaks in the same places:

procedures drift across documents
updates are applied to one file but not related files
AI can draft text, but it rarely returns reviewable actions

Docsy is designed to show a different pattern:

document-aware editing
review-first AI patch workflow
Gemini-powered structured output
a path to multimodal reasoning with editor screenshots

Hackathon focus

This repository is being prepared for a Gemini hackathon submission.

Current submission priorities:

GenAI SDK based Gemini integration
Cloud Run deployment for the AI service
multimodal request path for editor screenshot + document context
action-oriented JSON response from Gemini
one real UI action wired from AI output
review-first patch workflow demo

Demo flow

Open multiple technical documents in the editor.
Update a procedure in one document.
Send document context and editor state to the AI service.
Gemini returns a structured action or patch proposal.
The app opens the patch review flow.
The user accepts or rejects the change.

Architecture Diagram

Docsy's deployed architecture uses a React + Vite frontend served from Firebase Hosting and a separate Node.js AI backend deployed on Cloud Run. The backend accesses Gemini on Vertex AI through the Google GenAI SDK, stores shared Google Workspace session and state data in Firestore, and brokers Google Docs/Drive plus LaTeX service requests without exposing credentials to the browser.

For hackathon submission, export the diagram below as a PNG and upload it via File Upload.

flowchart LR
  U["User in Browser"] --> FE["Docsy Frontend<br/>React + Vite<br/>Firebase Hosting"]

  FE -->|"/api"| API["Docsy AI Backend<br/>Node.js<br/>Cloud Run"]
  FE -->|"document context<br/>viewport screenshot<br/>UI action request"| API

  API -->|"Google GenAI SDK"| GEM["Gemini on Vertex AI"]
  GEM -->|"structured JSON action<br/>patch proposal"| API
  API -->|"assistant response<br/>patch data"| FE

  API -->|"session + workspace state"| DB["Firestore"]
  DB -->|"shared state lookup"| API

  API -->|"OAuth + Docs/Drive operations"| GW["Google OAuth + Google Docs/Drive"]
  GW -->|"auth callback + document data"| API

  API -->|"TeX preview / validate / export"| TEX["TeX Service<br/>separate Cloud Run service"]
  TEX -->|"job result / PDF / preview"| API

For more detail, see Hackathon Architecture and the GCP deployment guide.

Repository structure

src/       frontend editor, patch review UI, workspace panels
server/    Gemini-backed AI service for structured responses
docs/      engineering notes, plans, and architecture docs
PRD/       product requirement documents for the hackathon direction
public/    static web assets

Core capabilities

multi-document editor with tabs and workspace sidebar
Markdown, LaTeX, HTML, JSON, and YAML editing
document AST generation and structured patch handling
Google Workspace connection, Google Docs import/export, and bound-document save
AI-assisted summaries, section generation, TOC suggestions, and update proposals
patch review dialog instead of silent document mutation
local knowledge indexing and related document workflows
version history and share/export flows

Local development

Requirements

Node.js 18+
npm

Install

npm install

Start the web app

npm run dev

Start the AI service

Create a local env file:

cp .env.example .env.local

PowerShell:

Copy-Item .env.example .env.local

Set at least:

GOOGLE_GENAI_USE_VERTEXAI=true
GOOGLE_CLOUD_PROJECT
GOOGLE_CLOUD_LOCATION
GEMINI_MODEL
GEMINI_FALLBACK_MODEL
AI_SERVER_PORT
AI_ALLOWED_ORIGIN
VITE_AI_API_BASE_URL

Authenticate locally with Application Default Credentials:

gcloud auth application-default login

Then run:

npm run ai:server

Cloud Run deployment

The AI service is prepared for Cloud Run.

Runtime contract:

PORT is provided by Cloud Run
GOOGLE_GENAI_USE_VERTEXAI=true
GOOGLE_CLOUD_PROJECT selects the Vertex AI project
GOOGLE_CLOUD_LOCATION selects the Vertex AI region
GEMINI_MODEL selects the primary model
GEMINI_FALLBACK_MODEL selects the fallback model used for model/quota failures
AI_ALLOWED_ORIGIN controls allowed frontend origins
AI_ALLOWED_ORIGIN must be an explicit frontend origin outside local development

Build and deploy pipeline:

Cloud Run auth:

the runtime service account must have Vertex AI permissions
the local API key path is no longer used for Cloud Run

Frontend deployment:

when Firebase Hosting rewrites /api/** to Cloud Run, set VITE_AI_API_BASE_URL to the deployed frontend origin
if you intentionally split frontend and API domains, set VITE_AI_API_BASE_URL to the deployed API origin
if omitted in a non-localhost environment, the frontend falls back to same-origin

Health check:

GET /api/ai/health
returns configuration-level configured state for the public AI runtime
GET /api/internal/ai/health includes model and fallbackModel when called with the diagnostics token
node scripts/check-ai-runtime-smoke.mjs --origin https://YOUR_HOST is the runtime readiness check and fails when POST /api/ai/agent/turn returns agentStatus

Google OAuth production guard:

set GOOGLE_OAUTH_PUBLISHING_STATUS=testing|production
set GOOGLE_WORKSPACE_SCOPE_PROFILE=restricted|reduced
deployed Google Workspace state now defaults to Firestore on Cloud Run so OAuth/session state is shared across instances
Firebase Hosting rewrites only forward the __session cookie to Cloud Run, so hosted workspace auth must use that cookie name in deployed HTTPS environments
the current deployed auth contract is documented in docs/current-workspace-auth-contract-2026-03-16.md
run npm run check:public-deploy before public deploys to validate custom-domain and OAuth settings

Main scripts

npm run dev - start the Vite dev server
npm run ai:server - start the AI service locally
npm run build - desktop-oriented production build
npm run build:web - web-oriented production build
npm run preview - preview the web build
npm run lint - run ESLint
npm run test - run Vitest
npm run typecheck:server - type-check server code

Important documents

Security notes

Vertex AI credentials are never exposed to the browser bundle.
Gemini calls are routed through the server layer.
The frontend sends document payloads to the AI service, while credentials stay in the server runtime and service account.
Workspace state now defaults outside the repository, and .data/ is ignored to prevent token-bearing state from being committed.

Status

This repository is under active hackathon preparation.

The current implementation already contains editor, patch review, and document analysis building blocks. The remaining work is focused on tightening the Gemini demo path: screenshot input, multimodal prompting, action JSON, and one end-to-end UI action.

Name		Name	Last commit message	Last commit date
Latest commit History 447 Commits
.claude		.claude
.github/workflows		.github/workflows
.playwright-cli		.playwright-cli
PRD		PRD
docs		docs
e2e		e2e
ops/gcp/monitoring		ops/gcp/monitoring
output		output
public		public
scripts		scripts
server		server
src		src
test-results		test-results
.env.example		.env.example
.firebaserc		.firebaserc
.gitignore		.gitignore
.playwright-editor-after.png		.playwright-editor-after.png
.playwright-editor-console.png		.playwright-editor-console.png
.playwright-editor.png		.playwright-editor.png
CODE_ANALYSIS_REPORT.md		CODE_ANALYSIS_REPORT.md
Dockerfile.ai		Dockerfile.ai
Dockerfile.tex		Dockerfile.tex
README.en.md		README.en.md
README.ko.md		README.ko.md
README.md		README.md
bun.lock		bun.lock
bun.lockb		bun.lockb
cloudbuild.ai.yaml		cloudbuild.ai.yaml
cloudbuild.tex.yaml		cloudbuild.tex.yaml
components.json		components.json
editor-after-fix.har		editor-after-fix.har
editor.har		editor.har
eslint.config.js		eslint.config.js
firebase.json		firebase.json
gcloud-version.err		gcloud-version.err
gcloud-version.out		gcloud-version.out
index.html		index.html
live-agent-server.err		live-agent-server.err
live-agent-server.out		live-agent-server.out
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
temp-editor-check.spec.js		temp-editor-check.spec.js
temp-import.docsy		temp-import.docsy
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.server.json		tsconfig.server.json
vite.config.ts		vite.config.ts
vite.config.ts.timestamp-1773157207115-b1812c62d88988.mjs		vite.config.ts.timestamp-1773157207115-b1812c62d88988.mjs
vite.config.ts.timestamp-1773214794362-07cbc34cffe098.mjs		vite.config.ts.timestamp-1773214794362-07cbc34cffe098.mjs
vite.config.ts.timestamp-1773216187204-496139fd43aa2.mjs		vite.config.ts.timestamp-1773216187204-496139fd43aa2.mjs
vite.config.ts.timestamp-1773217700694-c8c3e067ecab5.mjs		vite.config.ts.timestamp-1773217700694-c8c3e067ecab5.mjs
vite.config.ts.timestamp-1773218337162-31f6b7ccc46128.mjs		vite.config.ts.timestamp-1773218337162-31f6b7ccc46128.mjs
vite.config.ts.timestamp-1773283906363-e28d25a648116.mjs		vite.config.ts.timestamp-1773283906363-e28d25a648116.mjs
vite.config.ts.timestamp-1773284452363-64287acd9bf1a8.mjs		vite.config.ts.timestamp-1773284452363-64287acd9bf1a8.mjs
vite.config.ts.timestamp-1773284477960-2e2e9e5c876a18.mjs		vite.config.ts.timestamp-1773284477960-2e2e9e5c876a18.mjs
vite.config.ts.timestamp-1773290370603-6594c3e62f1548.mjs		vite.config.ts.timestamp-1773290370603-6594c3e62f1548.mjs
vite.config.ts.timestamp-1773292431647-f37e3a9beb0f98.mjs		vite.config.ts.timestamp-1773292431647-f37e3a9beb0f98.mjs
vite.config.ts.timestamp-1773384593505-d7e6b326d88128.mjs		vite.config.ts.timestamp-1773384593505-d7e6b326d88128.mjs
vite.config.ts.timestamp-1773392779125-75ade131f09bc.mjs		vite.config.ts.timestamp-1773392779125-75ade131f09bc.mjs
vite.config.ts.timestamp-1773629898178-9b264dfba17f08.mjs		vite.config.ts.timestamp-1773629898178-9b264dfba17f08.mjs
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docsy

Language

Why this project exists

Hackathon focus

Demo flow

Architecture Diagram

Repository structure

Core capabilities

Local development

Requirements

Install

Start the web app

Start the AI service

Cloud Run deployment

Main scripts

Important documents

Security notes

Status

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Docsy

Language

Why this project exists

Hackathon focus

Demo flow

Architecture Diagram

Repository structure

Core capabilities

Local development

Requirements

Install

Start the web app

Start the AI service

Cloud Run deployment

Main scripts

Important documents

Security notes

Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages