Update readme to match code, see cl/609177092

jan-wassenberg · dan-zheng · commit af715d24361c · 2024-02-23T22:34:08.000-08:00
PiperOrigin-RevId: 609912278
diff --git a/README.md b/README.md
@@ -308,19 +308,19 @@ The code defines two C++ structs, `ConfigGemma7B` and `ConfigGemma2B`, which are
 
 **ConfigGemma7B**:
 
-* `seq_len`: Stores the length of the sequence to be processed. It's set to 7168.
-* `vocab_size`: Stores the size of the vocabulary, which is 256128.
-* `n_layers`: Number of layers in the deep learning model. It's set to 28.
-* `dim_model`: Dimension of the model's internal representation. It's set to 3072.
-* `dim_ffw_hidden`: Dimension of the feedforward and recurrent layers' hidden representations. It's set to 16 * 3072 / 2.
+* `kSeqLen`: Stores the length of the sequence to be processed. It's set to 7168.
+* `kVocabSize`: Stores the size of the vocabulary, which is 256128.
+* `kLayers`: Number of layers in the deep learning model. It's set to 28.
+* `kModelDim`: Dimension of the model's internal representation. It's set to 3072.
+* `kFFHiddenDim`: Dimension of the feedforward and recurrent layers' hidden representations. It's set to 16 * 3072 / 2.
 
 **ConfigGemma2B**:
 
-* `seq_len`: Stores the length of the sequence to be processed. It's also set to 7168.
-* `vocab_size`: Size of the vocabulary, which is 256128.
-* `n_layers`: Number of layers in the deep learning model. It's set to 18.
-* `dim_model`: Dimension of the model's internal representation. It's set to 2048.
-* `dim_ffw_hidden`: Dimension of the feedforward and recurrent layers' hidden representations. It's set to 16 * 2048 / 2.
+* `kSeqLen`: Stores the length of the sequence to be processed. It's also set to 7168.
+* `kVocabSize`: Size of the vocabulary, which is 256128.
+* `kLayers`: Number of layers in the deep learning model. It's set to 18.
+* `kModelDim`: Dimension of the model's internal representation. It's set to 2048.
+* `kFFHiddenDim`: Dimension of the feedforward and recurrent layers' hidden representations. It's set to 16 * 2048 / 2.
 
 These structs are used to configure a deep learning model with specific parameters for either Gemma7B or Gemma2B architecture.
 ```