Sebastian Raschka 80d4732456 add HF equivalency tests for standalone nbs (#774) před 3 měsíci
..
tests 80d4732456 add HF equivalency tests for standalone nbs (#774) před 3 měsíci
README.md a6b883c9f9 Gemma 3 270M From Scratch (#771) před 3 měsíci
standalone-gemma3.ipynb a6b883c9f9 Gemma 3 270M From Scratch (#771) před 3 měsíci

README.md

Gemma 3 270M From Scratch

This standalone-gemma3.ipynb Jupyter notebook in this folder contains a from-scratch implementation of Gemma 3 270M. It requires about 2 GB of RAM to run.

Below is a side-by-side comparison with Qwen3 0.6B as a reference model; if you are interested in the Qwen3 0.6B standalone notebook, you can find it here.



To learn more about the architecture differences and read about comparisons with other architectures, see my The Big LLM Architecture Comparison: From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Designarticle.