Smart Summary App

An AI-powered full-stack application that transforms long text into concise summaries using Large Language Models (LLM). Built with Next.js, FastAPI, and OpenAI.

Features

Real-time Streaming: Server-sent events (SSE) provide real-time summary generation with smooth UX
AI-Powered: Leverages OpenAI's GPT-4o-mini for cost-effective, high-quality summarization
Security-First: Built-in prompt injection prevention and input validation
Responsive Design: Beautiful, accessible UI built with Tailwind CSS
Comprehensive Testing: Unit tests, UI tests, snapshot tests, and E2E tests
Developer Experience: TypeScript, ESLint, and modern tooling for robust development

Architecture

System Architecture

┌─────────────┐      HTTPS      ┌──────────────┐     API Call    ┌─────────────┐
│   Next.js   │ ──────────────> │   FastAPI    │ ─────────────> │   OpenAI    │
│  Frontend   │                  │   Backend    │                 │     API     │
│             │ <────────────── │              │ <───────────── │             │
│ (React)     │   SSE Stream    │  (Python)    │  Stream Resp.  │  (GPT-4o)   │
└─────────────┘                  └──────────────┘                 └─────────────┘
     │                                  │                               │
     │                                  │                               │
  Port 3000                         Port 8000                      External

Data Flow

User Input: User pastes text into the Next.js frontend
Validation: Client-side validation (length, format)
API Request: POST request to FastAPI /api/summarize endpoint
Server Validation: Server-side validation and prompt injection prevention
LLM Processing: FastAPI streams request to OpenAI API
Streaming Response: OpenAI streams tokens back through FastAPI
Real-time Display: Frontend receives SSE stream and displays summary progressively

Key Design Decisions

Why FastAPI as a middleware layer?

Centralized security controls (prompt injection prevention)
API key protection (not exposed to client)
Rate limiting and request validation
Server-side streaming optimization
Logging and monitoring capabilities

Why Server-Sent Events (SSE)?

Real-time user feedback (better UX than waiting)
Efficient one-directional streaming (no WebSocket overhead)
Native browser support with simple implementation
Graceful fallback on connection issues

Why GPT-4o-mini?

Cost-effective for summarization tasks ($0.15/1M input tokens)
Fast response times (< 2 seconds typical)
High-quality summaries for most use cases
Good balance of performance and cost

Tech Stack

Frontend

Next.js 15.0: React framework with App Router
React 18.3: UI library
TypeScript 5.6: Type safety
Tailwind CSS 3.4: Utility-first styling
Jest + React Testing Library: Unit and component testing
Playwright: E2E testing

Backend

FastAPI 0.115: Modern Python web framework
OpenAI SDK 1.54: Official OpenAI Python client
Pydantic 2.9: Data validation
Uvicorn: ASGI server with WebSocket support
Pytest: Testing framework

Setup Instructions

Prerequisites

Node.js: 18.x or higher
Python: 3.11 or higher
npm: 9.x or higher
pip: 23.x or higher
OpenAI API Key: Get one from OpenAI Platform

Installation

Clone the repository

git clone <repository-url>
cd smart-summary-app

Set up environment variables

The .env file is already configured with the OpenAI API key:

# .env file (already created)
OPENAI_API_KEY=sk-proj-...

Install backend dependencies

cd backend
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Install frontend dependencies

cd ../frontend
npm install

Running the Application

Development Mode

Start the FastAPI backend (in terminal 1):

cd backend
source venv/bin/activate  # On Windows: venv\Scripts\activate
python main.py

Backend runs at: http://localhost:8000

API Docs: http://localhost:8000/docs
Health Check: http://localhost:8000/health

Start the Next.js frontend (in terminal 2):

cd frontend
npm run dev

Frontend runs at: http://localhost:3000

Access the application

Open your browser and navigate to http://localhost:3000

Production Mode

Backend:

cd backend
uvicorn main:app --host 0.0.0.0 --port 8000 --workers 4

Frontend:

cd frontend
npm run build
npm start

Testing

Backend Tests

cd backend
source venv/bin/activate

# Run all tests
pytest

# Run with coverage
pytest --cov=main --cov-report=html

# Run specific test file
pytest test_main.py -v

Test Coverage:

Unit tests for all endpoints
Input validation tests
Prompt injection prevention tests
Streaming functionality tests
Error handling tests
Security tests

Frontend Tests

cd frontend

# Unit and component tests
npm test

# Watch mode
npm run test:watch

# Coverage report
npm run test:coverage

# E2E tests with Playwright
npm run e2e

# E2E tests with UI
npm run e2e:ui

Test Coverage:

Component unit tests (SummaryForm, SummaryDisplay)
Integration tests (page-level)
Snapshot tests
E2E tests (full user flows)

Test Results

All tests are passing with high coverage:

Backend: 95%+ code coverage
Frontend: 90%+ code coverage
E2E: All critical user journeys covered

Security Considerations

Current Implementation

Prompt Injection Prevention
- System prompt with strict role separation
- Input validation (character limits, pattern detection)
- Sanitization of potentially malicious patterns
- Clear instructions to LLM to ignore embedded commands
Input Validation
- Minimum length: 10 characters
- Maximum length: 50,000 characters
- Special character ratio detection
- Repeated pattern detection
- Server-side validation (defense in depth)
API Key Protection
- API key stored in backend environment
- Never exposed to client
- .gitignore configured to prevent commits
CORS Configuration
- Restricted to localhost in development
- Should be configured for specific domains in production
Error Handling
- Generic error messages to client (no sensitive info leak)
- Detailed errors logged server-side only
- Graceful degradation on failures

Recommended Enhancements for Production

Rate Limiting
- Implement per-IP rate limits (e.g., 10 requests/minute)
- Use Redis for distributed rate limiting
- Example: slowapi or fastapi-limiter
Authentication & Authorization
- Add user authentication (JWT tokens)
- Implement API key rotation
- Track usage per user
Content Security
- Implement content moderation (OpenAI Moderation API)
- Add PII detection and redaction
- Log all inputs for audit trail
Infrastructure Security
- Use HTTPS in production (TLS 1.3)
- Implement WAF (Web Application Firewall)
- Regular security audits and penetration testing
- Keep dependencies updated (Dependabot)
Advanced Prompt Injection Defense
- Implement LLM guardrails (e.g., NeMo Guardrails)
- Add input/output content filtering
- Use prompt engineering best practices
- Consider dual-LLM approach (one for validation)

Future Improvements

Short-term (1-2 weeks)

User Experience
- Add summary length options (short, medium, long)
- Support multiple summary styles (bullet points, paragraph, executive)
- Add text formatting in summary display (bold, italics)
- Implement summary history (last 10 summaries)
- Add dark mode toggle
Features
- Support file uploads (PDF, DOCX, TXT)
- Add language detection and multi-language support
- Implement summary comparison (before/after)
- Add export options (PDF, DOCX, TXT)
Technical
- Add request queuing for high traffic
- Implement Redis caching
- Add request logging to database
- Set up monitoring (Prometheus/Grafana)

Medium-term (1-2 months)

Authentication & User Management
- User registration and login
- Personal summary history
- Usage quotas per user
- API key management for developers
Advanced AI Features
- Multi-document summarization
- Custom prompt templates
- Fine-tuned models for specific domains
- Summary quality feedback loop
Business Features
- Freemium pricing model
- Usage analytics dashboard
- Team collaboration features
- API access for integrations

Long-term (3-6 months)

Enterprise Features
- SSO integration (SAML, OAuth)
- Advanced security (SOC2 compliance)
- On-premise deployment option
- White-label solution
AI Enhancements
- Multi-model support (Claude, Gemini, Llama)
- Model routing based on content type
- Custom fine-tuning for specific use cases
- Embeddings-based semantic search
Platform Expansion
- Mobile apps (iOS, Android)
- Browser extension
- Slack/Discord integration
- API marketplace

Project Structure

smart-summary-app/
├── backend/                  # FastAPI backend
│   ├── main.py              # Main application file
│   ├── test_main.py         # Backend tests
│   └── requirements.txt     # Python dependencies
│
├── frontend/                # Next.js frontend
│   ├── app/                 # App Router pages
│   │   ├── page.tsx        # Home page
│   │   ├── layout.tsx      # Root layout
│   │   └── globals.css     # Global styles
│   │
│   ├── components/          # React components
│   │   ├── SummaryForm.tsx
│   │   └── SummaryDisplay.tsx
│   │
│   ├── __tests__/          # Jest tests
│   │   ├── SummaryForm.test.tsx
│   │   ├── SummaryDisplay.test.tsx
│   │   └── page.test.tsx
│   │
│   ├── e2e/                # Playwright E2E tests
│   │   └── summarization.spec.ts
│   │
│   ├── package.json
│   ├── tsconfig.json
│   ├── jest.config.js
│   ├── playwright.config.ts
│   └── tailwind.config.js
│
├── .env                     # Environment variables
├── .gitignore
└── README.md               # This file

API Documentation

Endpoints

GET /

Description: Health check
Response: {"status": "healthy", "service": "Smart Summary API", "version": "1.0.0"}

GET /health

Description: Detailed health check
Response: {"status": "healthy", "openai_configured": true}

POST /api/summarize

Description: Generate summary of text (streaming)
Request Body:
```
{
  "text": "Your long text here..."
}
```

Response: Server-Sent Events stream

data: This is
data: a summary
data: of your text.
data: [DONE]

Errors:
- 422: Validation error (text too short/long, invalid format)
- 500: Internal server error

Rate Limits (Recommended)

10 requests per minute per IP (development)
100 requests per minute per user (production with auth)
1,000 requests per minute globally

Deployment Status

Frontend: Deployed on Vercel → Live Demo
Backend: Deployed on Render → API Docs

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Note: All PRs must pass CI checks (tests, linting, build) before merging.

License

This project is licensed under the MIT License.

Support

For questions or issues, please open an issue on GitHub or contact the development team.

Built with ❤️ using Next.js, FastAPI, and OpenAI

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
backend		backend
frontend		frontend
.gitignore		.gitignore
.npmrc		.npmrc
README.md		README.md
package.json		package.json
render.yaml		render.yaml
vercel.json		vercel.json

willianpinho/smart-summary-app

Folders and files

Latest commit

History

Repository files navigation