Uses a pretrained Google ViT vision encoder connected to GPT 2 to generate captions for images. Obtained a BLEU score of 8+ on the flickr8k Database.
-
Notifications
You must be signed in to change notification settings - Fork 0
llLucidll/Captionr
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Integrated Google ViT-Base with OpenAI GPT2 to create a Sequence-2-Sequence model that generates a text caption given an input image.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published