Sebastian Raschka 80d4732456 add HF equivalency tests for standalone nbs (#774) 3 месяцев назад
..
tests 80d4732456 add HF equivalency tests for standalone nbs (#774) 3 месяцев назад
README.md a6b883c9f9 Gemma 3 270M From Scratch (#771) 3 месяцев назад
standalone-gemma3.ipynb a6b883c9f9 Gemma 3 270M From Scratch (#771) 3 месяцев назад

README.md

Gemma 3 270M From Scratch

This standalone-gemma3.ipynb Jupyter notebook in this folder contains a from-scratch implementation of Gemma 3 270M. It requires about 2 GB of RAM to run.

Below is a side-by-side comparison with Qwen3 0.6B as a reference model; if you are interested in the Qwen3 0.6B standalone notebook, you can find it here.



To learn more about the architecture differences and read about comparisons with other architectures, see my The Big LLM Architecture Comparison: From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Designarticle.