This repository contains the code supporting the CLIP base model for use with Autodistill.
CLIP, developed by OpenAI, is a computer vision model trained using pairs of images and text. You can use CLIP with autodistill for image classification.
Read the full Autodistill documentation.
Read the CLIP Autodistill documentation.
To use CLIP with autodistill, you need to install the following dependency:
pip3 install autodistill-clip
from autodistill_clip import CLIP
from autodistill.detection import CaptionOntology
# define an ontology to map class names to our CLIP prompt
# the ontology dictionary has the format {caption: class}
# where caption is the prompt sent to the base model, and class is the label that will
# be saved for that caption in the generated annotations
# then, load the model
base_model = CLIP(
ontology=CaptionOntology(
{
"person": "person",
"a forklift": "forklift"
}
)
)
results = base_model.predict("./context_images/test.jpg")
print(results)
base_model.label("./context_images", extension=".jpeg")
The code in this repository is licensed under an MIT license.
We love your input! Please see the core Autodistill contributing guide to get started. Thank you 🙏 to all our contributors!