Move preprocess to base distributed embedding class. #93

cantonios · 2025-05-20T23:43:41Z

Also updated some minor issues encountered in testing on sparsecores.

hertschuh · 2025-05-20T23:55:47Z

keras_rs/src/layers/embedding/jax/test_utils.py

@@ -183,6 +190,128 @@ def _create_table_and_slot_variables(
    return output


+def create_feature_samples(


These are also added to test_utils.py below. This looks like a mistake.

The two are actually different. This one generates random samples. The one in embedding_utils.py combines the inputs and weights into a format for use with jax_tpu_embedding.

This function was only used in the embedding_lookup_test, which I previously didn't include.

Renamed to generate_feature_samples to avoid confusion.

hertschuh · 2025-05-21T00:13:48Z

keras_rs/src/layers/embedding/base_distributed_embedding.py

+        model.fit(preprocessed_training_dataset, epochs=10)
+        ```
+
+        For non-JAX backends, preprocessing will bundle together the inputs and


Can you explicitly mention that preprocessing is optional.

Added "optional" here again to re-iterate.

cantonios · 2025-05-21T00:57:49Z

keras_rs/src/layers/embedding/jax/test_utils.py

@@ -275,8 +404,8 @@ def _compute_expected_lookup_grad(
    embedding_dim = activation_gradients.shape[1]
    sample_lengths = jnp.array([len(sample) for sample in samples.tokens])
    rows = jnp.repeat(jnp.arange(batch_size), sample_lengths)
-    cols = jnp.concatenate(jnp.unstack(samples.tokens))
-    vals = jnp.concatenate(jnp.unstack(samples.weights)).reshape(-1, 1)
+    cols = jnp.concatenate(np.unstack(samples.tokens))  # type: ignore[attr-defined]


I really don't understand the mypy issue with this. The mypy job passes locally with the exact same version of numpy and mypy installed. The np.unstack attribute does exist in numpy 2.2.6, which is what is installed by the CI. I need np.unstack because the arrays are ragged, so jnp.unstack throws an error.

Added unused-ignore to the pyproject.toml to prevent local failure.

keras_rs/src/layers/embedding/base_distributed_embedding.py

Also updated some minor issues encountered in testing on sparsecores.

cantonios requested a review from hertschuh May 20, 2025 23:43

hertschuh reviewed May 21, 2025

View reviewed changes

cantonios force-pushed the preprocess branch 2 times, most recently from 091472c to d4ff069 Compare May 21, 2025 00:55

cantonios commented May 21, 2025

View reviewed changes

cantonios force-pushed the preprocess branch from d4ff069 to d57816a Compare May 21, 2025 00:59

cantonios commented May 21, 2025

View reviewed changes

keras_rs/src/layers/embedding/base_distributed_embedding.py Show resolved Hide resolved

Move preprocess to base distributed embedding class.

59e5123

Also updated some minor issues encountered in testing on sparsecores.

cantonios force-pushed the preprocess branch from d57816a to 59e5123 Compare May 21, 2025 14:37

hertschuh merged commit 91422d5 into keras-team:main May 21, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move preprocess to base distributed embedding class. #93

Move preprocess to base distributed embedding class. #93

cantonios commented May 20, 2025

hertschuh May 20, 2025

cantonios May 21, 2025

cantonios May 21, 2025

cantonios May 21, 2025

hertschuh May 21, 2025

cantonios May 21, 2025

cantonios May 21, 2025

cantonios May 21, 2025

cantonios May 21, 2025

		@@ -183,6 +190,128 @@ def _create_table_and_slot_variables(
		return output


		def create_feature_samples(

Move preprocess to base distributed embedding class. #93

Move preprocess to base distributed embedding class. #93

Conversation

cantonios commented May 20, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment