Gemma3 seems working fine, but TranslateGemma is not, I can't compare it with Gemma4, the converter gives an error:
File "C:\Python311\model\convert.py", line 6712, in dump_config assert txt_config.use_bidirectional_attention == 'vision'
TranslateGemma 4b/12b@Q8/FP16 (any backend) stops inference at some point, the problem always appears on the prompt: "List the programming languages whose code you can edit and what exactly you can do with the code of these languages"
chatllm version: 0.22+
CLI: main.exe -c 4096 -i -ngl all -m C:\Models\gemma-4-E2B-it-q8_0.bin
You are served by translategemma-4b-it,
with 3880263168 (3.9B) parameters.
You > List the programming languages whose code you can edit and what exactly you can do with the code of these languages.
A.I. > Okay, here's a list of programming languages I can work with, along with a description of what I can do with their code:
**Languages I Can Work With:**
* **Python:**
* **What I can do:**
* **Code Generation:** I can generate Python code snippets, functions, classes, and even entire programs based on your instructions.
* **Code Completion:** I can suggest code completions as you type, helping you write code faster and with fewer errors.
* **Code Explanation:** I can explain what a piece of Python code does, line by line or at a higher level.
* **Code Translation:** I can translate code from other languages (like JavaScript or C++) to Python, or vice versa (though the accuracy depends on the complexity).
* **Code Debugging:** I can help identify potential errors and suggest fixes in Python code.
* **Code Optimization:** I can suggest ways to improve the performance and efficiency of Python code.
* **Code Refactoring:** I can help you refactor your code to make it more readable, maintainable, and efficient.
* **Code Documentation:** I can generate documentation for your Python code.
* **Code Testing:** I can help you write unit tests for your Python code.
* **Code Style Enforcement:** I can check your code for adherence to Python style guides (like PEP 8).
* **JavaScript:**
* **What I can do:**
* **Code Generation:** I can generate JavaScript code snippets, functions, classes, and entire programs.
* **Code Completion:** I can suggest code completions.
* **Code Explanation:** I can explain what a piece of JavaScript code does.
* **Code Translation:** I can translate code from other languages (like Python or C++) to JavaScript, or vice versa.
* **Code Debugging:** I can help identify potential errors.
* **Code Optimization:** I can suggest ways to improve performance.
* **Code Refactoring:** I can help with refactoring.
* **Code Documentation:** I can generate documentation.
* **Code Testing:** I can help with writing tests.
* **Code Style Enforcement:** I can check for adherence to style guides (like Airbnb or Google).
*
i.e. the inference stops at about 30%
Gemma3 seems working fine, but TranslateGemma is not, I can't compare it with Gemma4, the converter gives an error:
File "C:\Python311\model\convert.py", line 6712, in dump_config assert txt_config.use_bidirectional_attention == 'vision'TranslateGemma 4b/12b@Q8/FP16 (any backend) stops inference at some point, the problem always appears on the prompt: "List the programming languages whose code you can edit and what exactly you can do with the code of these languages"
chatllm version: 0.22+
CLI:
main.exe -c 4096 -i -ngl all -m C:\Models\gemma-4-E2B-it-q8_0.bini.e. the inference stops at about 30%