GitHub - thisishugow/create-llama-ollama: a simple create-llama template using llama-index v0.10 and integrated with Ollama

This is a LlamaIndex project using FastAPI bootstrapped with create-llama.

Demo

Getting Started

Quickstart

Option 1) Clone project and setup by yourself

git clone https://github.com/thisishugow/create-llama-ollama.git
cd create-llama-ollama
poetry install

Download and install Ollama(Guide).

# After Ollama is installed.
ollama pull llama3
ollama serve &

Start the app:

poetry install
poetry run ./backend/main.py -c ./backend/config.json # you can make your configuration.

Then visit http://localhost:8080 with your browser to see the result.

Optional 2) Docker container

# for Windows WSL2/Linux
docker run --name my-offline-llama -p 8080:8080 thisisyuwang/create-llama-ollama:latest-linux-amd64 

# for Mac arm64
docker run --name my-offline-llama -p 8080:8080 thisisyuwang/create-llama-ollama:latest-arm64

⚠️ WARNING: It will take minutes to download LLM at the first time.

Then visit http://localhost:8080 with your browser to see the result.

For Development

Startup the backend as described in the backend README.

Second, run the development server of the frontend as described in the frontend README.

Open http://localhost:3000 with your browser to see the result.

First, setup the environment:

poetry install
poetry shell

By default, we use the OpenAI LLM (though you can customize, see app/context.py). As a result you need to specify an OPENAI_API_KEY in an .env file in this directory.

Example .env file:

OPENAI_API_KEY=<openai_api_key>

Second, generate the embeddings of the documents in the ./data directory (if this folder exists - otherwise, skip this step):

python app/engine/generate.py

Third, run the development server:

python main.py

Then call the API endpoint /api/chat to see the result:

curl --location 'localhost:8000/api/chat' \
--header 'Content-Type: application/json' \
--data '{ "messages": [{ "role": "user", "content": "Hello" }] }'

You can start editing the API by modifying app/api/routers/chat.py. The endpoint auto-updates as you save the file.

Open http://localhost:8000/docs with your browser to see the Swagger UI of the API.

The API allows CORS for all origins to simplify development. You can change this behavior by setting the ENVIRONMENT environment variable to prod:

ENVIRONMENT=prod uvicorn main:app

Learn More

To learn more about LlamaIndex, take a look at the following resources:

LlamaIndex Documentation - learn about LlamaIndex.
LlamaIndexTS Documentation - learn about LlamaIndex (Typescript features).

You can check out the LlamaIndex GitHub repository - your feedback and contributions are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets/img		assets/img
backend		backend
dist		dist
frontend		frontend
.gitignore		.gitignore
README.md		README.md
dockerfile		dockerfile
dockerfile-amd64		dockerfile-amd64
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting Started

Quickstart

Option 1) Clone project and setup by yourself

Optional 2) Docker container

For Development

Learn More

About

Releases

Packages

Languages

thisishugow/create-llama-ollama

Folders and files

Latest commit

History

Repository files navigation

Getting Started

Quickstart

Option 1) Clone project and setup by yourself

Optional 2) Docker container

For Development

Learn More

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages