ResearchGraph

Live Site: https://research-graph-seven.vercel.app

A full-stack application for exploring academic paper citation networks with AI-powered Q&A. Discover papers from arXiv, visualize their citation relationships as an interactive graph, and ask questions answered directly from the paper content using RAG (Retrieval-Augmented Generation).

Architecture

┌──────────────┐      ┌──────────────┐      ┌──────────────────────┐
│  Next.js 14  │─────▶│  FastAPI      │─────▶│  PostgreSQL + pgvector│
│  (port 3000) │◀─────│  (port 8000)  │◀─────│  (port 5432)         │
└──────────────┘      └──────┬───────┘      └──────────────────────┘
                             │
                             ├──▶ Gemini API (embeddings + chat)
                             └──▶ Redis (task queue)

Backend: FastAPI, SQLAlchemy (async), pgvector, Gemini API

Frontend: Next.js 14 (App Router), Tailwind CSS, React Flow, Lucide React

Database: PostgreSQL 16 with pgvector extension for vector similarity search

Features

Topic Discovery — Enter any research topic (CS, physics, biology, math, etc.) and background level. Gemini orders papers into an optimal reading path from foundational to advanced.
Citation Graph — Interactive node-based visualization of paper relationships using React Flow. Click a node to expand its citations dynamically.
Paper Q&A — Ask questions about any paper. Answers are generated from the paper's metadata and content using vector similarity search + Gemini 2.5 Flash.
Response Caching — Repeated questions return instantly from a cache layer, saving API costs and latency.
Split-Screen UI — 65/35 layout with the graph on the left and chat on the right. Clicking a paper node switches the chat context.

Prerequisites

Docker and Docker Compose
A Google Gemini API key

Quick Start

# 1. Clone and configure
git clone <repo-url> && cd research-graph
cp .env.example .env
# Edit .env and add your GEMINI_API_KEY

# 2. Start all services
docker compose up --build -d

# 3. Open the app
open http://localhost:3000

Project Structure

backend/
  app/
    main.py                 # FastAPI app, CORS, route registration
    db.py                   # Async SQLAlchemy session factory
    api/
      chat.py               # POST /chat — RAG pipeline with caching
      discover.py           # POST /discover — arXiv search + Gemini reading path
      graph.py              # GET /graph/{paper_id} — citation network
    models/
      paper.py              # ORM: Paper, PaperChunk, ChatCache, IngestionJob
    schemas/
      chat.py               # ChatRequest / ChatResponse
      graph.py              # GraphNode / GraphEdge / GraphResponse
    core/
      config.py             # Pydantic Settings (env-based config)
    services/               # Business logic (ingestion, etc.)

frontend/
  src/
    app/
      layout.tsx            # Root layout
      page.tsx              # Dashboard — state orchestrator
    components/
      GraphView.tsx         # React Flow graph with expand-on-click
      PaperNode.tsx         # Custom node component (title, year, authors)
      ChatPanel.tsx         # Chat UI with source badges
    hooks/
      useChat.ts            # Chat state management, clears on paper change
    lib/
      api.ts                # Typed API client

migrations/
  init.sql                  # Full database schema (tables, indexes, functions)

API Endpoints

Method	Path	Description
`POST`	`/discover`	Search arXiv for papers and build a Gemini-ordered reading path
`POST`	`/chat`	Ask a question about a paper (cache check → vector search → LLM)
`GET`	`/graph/{paper_id}`	Get citation network as React Flow nodes/edges
`GET`	`/health`	Health check

POST /chat

Request:

{
  "paper_id": "1706.03762",
  "question": "What is the Transformer architecture?"
}

Response:

{
  "answer": "The Transformer is a model architecture...",
  "source": "llm",
  "context_used": ["chunk text 1", "chunk text 2"]
}

source is "cache" for cached responses or "llm" for freshly generated ones.

Environment Variables

Variable	Description	Default
`GEMINI_API_KEY`	Google Gemini API key (required)	—
`DATABASE_URL`	PostgreSQL connection string	`postgresql+asyncpg://...@localhost:5432/researchgraph_db`
`REDIS_URL`	Redis connection string	`redis://localhost:6379/0`
`FRONTEND_URL`	Vercel frontend URL for CORS (production)	—
`NEXT_PUBLIC_API_URL`	Backend URL for the frontend	`http://localhost:8000`

Deployment

Backend + Database — Render

Push the repo to GitHub.
Go to Render Dashboard and click New > Blueprint.
Connect your repo — Render reads render.yaml and creates:
- PostgreSQL database (Free plan, pgvector-ready)
- Web service running the FastAPI backend
In the Render dashboard, set the environment variables:
- GEMINI_API_KEY — your Gemini API key
- FRONTEND_URL — your Vercel URL (e.g. https://research-graph.vercel.app)
The first deploy runs start.sh, which auto-applies migrations/init.sql.
Note your Render service URL (e.g. https://researchgraph-api.onrender.com).

Frontend — Vercel

Go to vercel.com/new and import the repo.
Set the Root Directory to frontend.
Add the environment variable:
- NEXT_PUBLIC_API_URL = your Render backend URL (e.g. https://researchgraph-api.onrender.com)
Deploy. Vercel auto-detects Next.js and builds it.

Development (Local)

Run services individually for local development:

# Database + Redis
docker compose up postgres redis -d

# Backend (with hot reload)
cd backend
pip install -r requirements.txt
uvicorn app.main:app --reload --port 8000

# Frontend
cd frontend
npm install
npm run dev

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.claude		.claude
backend		backend
frontend		frontend
migrations		migrations
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
Dockerfile.render		Dockerfile.render
README.md		README.md
docker-compose.yml		docker-compose.yml
render.yaml		render.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResearchGraph

Architecture

Features

Prerequisites

Quick Start

Project Structure

API Endpoints

POST /chat

Environment Variables

Deployment

Backend + Database — Render

Frontend — Vercel

Development (Local)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ResearchGraph

Architecture

Features

Prerequisites

Quick Start

Project Structure

API Endpoints

POST /chat

Environment Variables

Deployment

Backend + Database — Render

Frontend — Vercel

Development (Local)

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages