Can you improve the Gemma 4?

#2
by Regrin - opened

Hello!
I heard about your experience where you were able to significantly improve the model's performance by duplicating several layers.
I really like the Gemma models. I want to use them on my laptop. It's not very powerful...
Tell me, maybe you could try improving the Gemma 4 models in the same way?
I'm primarily concerned about the Gemma 4 E4B. Although the improved Gemma 4 31b could produce excellent results!

It's very important to me that the model doesn't become even more erratic.

I would be very grateful if you could improve the Gemma series models. Perhaps you could contact the developers at Google? They could handle it themselves.

I wonder if your method for improving models with Gemma 4 E4B will work?
After all, it uses Per-Layer Embeddings.

Sign up or log in to comment