Adapt "Natural language image search with a Dual Encoder" Demo to Keras 3 and Keras Hub #2209

vinayak19th · 2025-10-31T05:50:12Z

Summary

This PR updates the 'Natural language image search with a Dual Encoder' example to Keras 3 with Torch and Tensorflow Backend support for training, however, still using tf.data for dataloading and pre-processing.

Changelog

Major Changes:

Migrated all TensorFlow Hub functionality to KerasHub
Removed all tf.x functions to Keras 3 ops.x supporting full multi-backend functionality
Created multiple train_step implementations for the Tensorflow and Torch backends in the Dual Encoder with automatic selection of the correct implementation based on the backend.
Updated vision_encoder to use the better and more stable EfficientNetV2 models over the older Xception model.
Moved text preprocessing and tokenization out of the model and into the tf.Data pipeline for more efficient GPU VRAM usage and training speed.
Added additional blocks of code for increased clarity, such as a cell showing the processed dataset elements.
Fixed issues in dataset download and unzip file pathing due to changes in modern versions of the wget package.

Minor Changes:

Updated the markdown documentation to better match modern Keras 3 standards.
Converted tf.matmul-based inner product to the simplified and more efficient ops.einsum.
Added additional explanations to functions for increased clarity, such as an explanation of the different tf.data stages.

gemini-code-assist · 2025-10-31T05:50:36Z

Summary of Changes

Hello @vinayak19th, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly updates the 'Natural language image search with a Dual Encoder' demo to align with Keras 3, introducing multi-backend support for training with both TensorFlow and Torch. The changes focus on modernizing the underlying libraries, improving model architecture with a more advanced vision encoder, and optimizing the data processing pipeline for better performance and clarity. The overall goal is to provide a robust and up-to-date example for natural language image search within the Keras ecosystem.

Highlights

Keras 3 Adaptation: The demo has been fully adapted to Keras 3, ensuring compatibility with both TensorFlow and Torch backends for training.
Keras Hub Migration: All TensorFlow Hub functionalities have been migrated to KerasHub, streamlining dependencies and aligning with Keras 3 standards.
Multi-Backend Training: The DualEncoder now includes separate train_step implementations for TensorFlow and Torch, with automatic selection based on the active Keras backend.
Improved Vision Encoder: The vision encoder has been upgraded from Xception to the more stable and performant EfficientNetV2B3 model.
Optimized Data Pipeline: Text preprocessing and tokenization have been moved from the model into the tf.data pipeline, enhancing GPU VRAM usage and training speed.
Dataset Handling Fixes: Issues with dataset download and unzip file pathing, caused by changes in modern wget package versions, have been resolved.
Code Clarity and Efficiency: Added code blocks for clarity (e.g., showing processed dataset elements) and converted tf.matmul operations to ops.einsum for improved efficiency.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

Removed accidental extra Adam Optimizer

gemini-code-assist

Code Review

This pull request successfully updates the "Natural language image search with a Dual Encoder" demo to Keras 3, including migrating from TensorFlow Hub to Keras Hub and adding multi-backend support. The changes are well-structured, with notable improvements like upgrading the vision encoder to EfficientNetV2 and moving text preprocessing into the tf.data pipeline for better performance. My feedback includes a critical suggestion to reintroduce shuffling in the data pipeline to ensure proper model training, along with several medium-severity recommendations to improve code clarity and maintainability by removing commented-out code, unused variables, and replacing hardcoded values with defined constants.

examples/vision/nl_image_search.py

vinayak19th · 2025-10-31T06:01:12Z

/gemini review

gemini-code-assist

Code Review

This is a great pull request that successfully updates the example to Keras 3 and Keras Hub, enabling multi-backend support and using modern components. The code is much clearer and follows current best practices. I've found a critical syntax error in a print statement that needs to be fixed. I also have some high-severity feedback regarding the dataset download and extraction logic, specifically about cleaning up large temporary files and ensuring path consistency. Additionally, I've included a few medium-severity suggestions to improve robustness and add support for the JAX backend to make it fully Keras 3 compatible. Overall, excellent work on this migration!

examples/vision/nl_image_search.py

gemini-code-assist · 2025-10-31T06:05:44Z

examples/vision/nl_image_search.py

+        if keras.config.backend() == "torch":
+            self.train_step = self.train_step_torch
+        elif keras.config.backend() == "tensorflow":
+            self.train_step = self.train_step_tf


This logic for selecting train_step is great for multi-backend support. However, it only covers TensorFlow and Torch. To make this example fully compatible with Keras 3, you could also add support for the JAX backend. This would require implementing a train_step_jax method.

Not sure how to implement in Jax fully yet

examples/vision/nl_image_search.py

vinayak19th added 3 commits October 30, 2025 02:20

Created Demo Base

11816c6

Merge branch 'keras-team:master' into master

004736e

removed temp dir and parts from gitignore

32264d0

github-actions bot assigned sachinprasadhs Oct 31, 2025

Code clean up

6cf28e0

Removed accidental extra Adam Optimizer

gemini-code-assist bot reviewed Oct 31, 2025

View reviewed changes

vinayak19th added 3 commits October 30, 2025 22:57

Added dataset_shuffle and removed commented out os.rm commands

f2a9792

Fixed duplicate stack exchange

6bd5e71

Fixed hardcoded number of epochs

7544abb

gemini-code-assist bot reviewed Oct 31, 2025

View reviewed changes

vinayak19th added 2 commits October 30, 2025 23:30

Multiple changes based on gemini code-review

bfd12ce

Fixed everything from gemini that I could

0205646

Adapt "Natural language image search with a Dual Encoder" Demo to Keras 3 and Keras Hub #2209

Are you sure you want to change the base?

Adapt "Natural language image search with a Dual Encoder" Demo to Keras 3 and Keras Hub #2209

Conversation

vinayak19th commented Oct 31, 2025

Summary

Changelog

Uh oh!

gemini-code-assist bot commented Oct 31, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vinayak19th commented Oct 31, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

vinayak19th Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants