Skip to content

A versatile tool for image processing and descriptive analysis. This application leverages OpenCV and LibRaw for efficient conversion of raw images to JPEG format, and utilizes CLIP for generating meaningful textual descriptions of images. Ideal for rapid and high-quality image interpretation.

License

Notifications You must be signed in to change notification settings

LouisTsang-jk/ImageNarrator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ImageNarrator

screen-recording

Description

ImageNarrator is a versatile tool for image processing and descriptive analysis. This application leverages OpenCV and LibRaw for efficient conversion of raw images to JPEG format, and utilizes CLIP for generating meaningful textual descriptions of images. It is ideal for rapid and high-quality image interpretation.

Installation

This project requires Python 3.7 or later. We recommend installing the necessary dependencies in a virtual environment to avoid conflicting with your system Python.

To install the dependencies, navigate to the root of the project directory and run:

python3 -m venv env  # create a virtual environment
source env/bin/activate  # activate the virtual environment
pip install -r requirements.txt  # install the dependencies

Usage

  1. Place your RAW format image files into the 'images' directory.

  2. Update the 'custom.json' file to specify the categories you are interested in. Please note, categories should be described in English.

  3. Run the main program using the command build sh.

The processed images will be saved in the 'output' directory, along with a text file containing their descriptions.

License

This project is licensed under the terms of the MIT License.

Acknowledgement

OpenCLIP
OpenCV
LibRaw

About

A versatile tool for image processing and descriptive analysis. This application leverages OpenCV and LibRaw for efficient conversion of raw images to JPEG format, and utilizes CLIP for generating meaningful textual descriptions of images. Ideal for rapid and high-quality image interpretation.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published