ImageNarrator

Description

ImageNarrator is a versatile tool for image processing and descriptive analysis. This application leverages OpenCV and LibRaw for efficient conversion of raw images to JPEG format, and utilizes CLIP for generating meaningful textual descriptions of images. It is ideal for rapid and high-quality image interpretation.

Installation

This project requires Python 3.7 or later. We recommend installing the necessary dependencies in a virtual environment to avoid conflicting with your system Python.

To install the dependencies, navigate to the root of the project directory and run:

python3 -m venv env  # create a virtual environment
source env/bin/activate  # activate the virtual environment
pip install -r requirements.txt  # install the dependencies

Usage

Place your RAW format image files into the 'images' directory.
Update the 'custom.json' file to specify the categories you are interested in. Please note, categories should be described in English.
Run the main program using the command build sh.

The processed images will be saved in the 'output' directory, along with a text file containing their descriptions.

License

This project is licensed under the terms of the MIT License.

Acknowledgement

OpenCLIP
OpenCV
LibRaw

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.vscode		.vscode
assets		assets
datasets		datasets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageNarrator

Description

Installation

Usage

License

Acknowledgement

About

Releases

Packages

Languages

License

LouisTsang-jk/ImageNarrator

Folders and files

Latest commit

History

Repository files navigation

ImageNarrator

Description

Installation

Usage

License

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages