ML Notes

❯

❯

❯

Gemma 3

07 Apr 20251 min read

multimodal

Summary

The Gemma 3 models range from 1B to 27B, have a context window up to 128k tokens, can accept images and text, and support 140+ languages.

Compared to Gemma 2, Gemma 3:

has a longer context length
is multimodal
is multilingual

The 1B version is limited to:

32k tokens
text only
English only

Longer Context Length

The models start with a 32k sequence length in pre-training and then the larger variants are scaled to 128k tokens by adjusting the RoPE scale factor

References

# Welcome Gemma 3: Google’s all new multimodal, multilingual, long context open LLM

Graph View

Backlinks

Google

Created with Quartz v4.5.0 © 2025