AI4Free API Wrapper

An industry-grade, multi-provider API wrapper for chat completions.

Overview

The AI4Free API Wrapper is a robust and production-ready API wrapper designed to provide a unified interface to multiple Large Language Model (LLM) providers. It supports both streaming and non-streaming chat completions and includes built-in functionalities such as API key management, rate limiting, detailed usage tracking, and a comprehensive testing suite.

This repository is maintained by SreejanPersonal and is intended to serve as a backend service for applications requiring integration with various LLMs using a single, consistent API.

Features

Multi-Provider Support: Seamlessly switch between providers such as DeepSeek-R1, gpt-4o, o3-mini, DeepSeekV3, and more.
Unified API Interface: Consistent request and response schemas across different providers.
Streaming & Non-Streaming: Supports both streaming responses and standard completions.
Robust Rate Limiting: Protect your services with configurable rate limiting using Redis and Lua scripting.
API Key Management: Generate, validate, and manage API keys securely.
Token Usage & Cost Tracking: Detailed tracking of prompt, completion tokens, and associated costs.
Flask-based REST API: Powered by Flask with CORS enabled for cross-origin requests.
PostgreSQL & SQLAlchemy Integration: Reliable data persistence and ORM support.
Comprehensive Testing Suite: End-to-end testing scripts for API endpoints, provider integrations, and usage metrics.
Production-Ready Configuration: Gunicorn with gevent for rapid asynchronous handling and optimal CPU resource utilization.

Architecture & Directory Structure

The repository is organized following industry best practices using the application factory pattern. The key directories include:

app/
Contains the core Flask application, configuration files, extensions, API routes, controllers, services, and models.
providers/
Integrates multiple LLM providers with alias-based mapping to ensure flexibility. Each provider has its own implementation module.
services/
Implements business logic such as API key management, rate limiting, and detailed usage recording.
utils/
Utility functions for database context management, token counting, stream processing, and helper methods.
data/
Contains centralized data files such as models.json with provider-specific and versioned model metadata.
Testing/
A set of testing scripts and clients to validate API endpoints, including chat completions and usage statistics.
Additional Files:
- db_manager.py for database creation, reset, and management.
- run.py for running the Flask application.
- gunicorn.config.py for production deployment configuration.
- requirements.txt listing all dependencies.

Installation

Prerequisites

Python 3.10+
PostgreSQL
Redis

Clone the Repository

git clone https://github.com/SreejanPersonal/ai4free-wrapper.git
cd ai4free-wrapper

Create a Virtual Environment

Using venv:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Dependencies

pip install -r requirements.txt

Configuration

Environment Variables:
Create a .env file in the project root. You may refer to .env.example for the complete set of environment variables. The default settings include:
- Provider Base URLs & API Keys:
  Set your provider endpoints and API keys accordingly.
- Local API URL:
  LOCAL_API_URL=http://127.0.0.1:5000
- Database Settings:
  Configure PostgreSQL connection details.
- Redis Settings:
  Configure your Redis instance.
- Flask Settings:
  Set FLASK_SECRET_KEY and toggle FLASK_DEBUG as needed.
- System Secret: For secure API key generation and management.
Model Configuration:
Models and provider mappings are defined in data/models.json and further refined in app/config.py.

Running the Application

Development Server

Start the Flask development server using:

python run.py

The API will be accessible at http://127.0.0.1:5000.

Production Server

Use Gunicorn with the provided configuration:

gunicorn -c gunicorn.config.py run:app

API Endpoints

The API is designed to be OpenAI-compatible. Some key endpoints include:

Health Check
- GET /health
  Returns a simple JSON indicating the service status.
Chat Completions
- POST /v1/chat/completions
  Handles both streaming and non-streaming chat completions. Requires a valid API key passed in the Authorization header (Bearer token).
List Models
- GET/POST /v1/models
  Returns available models and their configurations.
API Key Generation
- POST /v1/api-keys
  Generates a new API key for a user. Requires system secret verification.
Usage Details
- POST /v1/usage
  Returns usage details for the provided API key, including token counts, request statistics, and cost metrics.
Uptime Check
- GET /v1/uptime/<model_id>
  Performs a minimal streaming check to verify that a model is up.

Note: Ensure your requests include the appropriate JSON payloads and headers as specified in the code comments and schema validations.

Testing

A comprehensive suite of test scripts is provided in the Testing/ directory. These scripts cover:

Chat Completion Testing:
- API_Endpoint_Testing.py and OpenAI_Client_Testing.py
API Usage Testing:
- test_api_usage.py

To run the tests, execute the scripts directly, for example:

python Testing/API_Endpoint_Testing.py
python Testing/OpenAI_Client_Testing.py
python Testing/test_api_usage.py

These tests verify provider integration, streaming versus non-streaming behavior, and rate limiting.

Database Management

The repository includes a CLI tool for managing your PostgreSQL database.

Commands

Create Database & Tables:
```
python db_manager.py create-db
```
Clean (Drop) Database Tables:
```
python db_manager.py clean-db
```
Reset Database (Drop & Recreate Tables):
```
python db_manager.py reset-db
```
List All Tables:
```
python db_manager.py list-tables
```

Warning: These commands modify your database schema. Ensure you have proper backups before running them in production.

Contributing

Contributions are welcome! Please follow these guidelines:

Fork the Repository:
Create your own branch from the main branch.
Code Standards:
Follow PEP 8 standards for Python code. Ensure proper logging, error handling, and adherence to the application architecture.
Commit Messages:
Use descriptive commit messages detailing your changes and reasoning.
Pull Request:
Submit a pull request with a clear description of your changes and any additional context or testing instructions.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For questions or suggestions, please reach out via GitHub:

GitHub: SreejanPersonal

Thank you for using the AI4Free API Wrapper. Happy coding!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI4Free API Wrapper

Table of Contents

Overview

Features

Architecture & Directory Structure

Installation

Prerequisites

Clone the Repository

Create a Virtual Environment

Install Dependencies

Configuration

Running the Application

Development Server

Production Server

API Endpoints

Testing

Database Management

Commands

Contributing

License

Contact

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Testing		Testing
app		app
data		data
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
db_manager.py		db_manager.py
gunicorn.config.py		gunicorn.config.py
requirements.txt		requirements.txt
run.py		run.py

License

SreejanPersonal/ai4free-wrapper

Folders and files

Latest commit

History

Repository files navigation

AI4Free API Wrapper

Table of Contents

Overview

Features

Architecture & Directory Structure

Installation

Prerequisites

Clone the Repository

Create a Virtual Environment

Install Dependencies

Configuration

Running the Application

Development Server

Production Server

API Endpoints

Testing

Database Management

Commands

Contributing

License

Contact

About

Resources

License

Stars

Watchers

Forks

Languages