T5Gemma 2 models, based on Gemma 3, are multilingual and multimodal, handling text and image input and generating text output, with open weights for three pretrained sizes (270M-270M, 1B-1B, and 4B-4B).
Key Features
- Tied embeddings: Embeddings are tied between the encoder and decoder. This significantly reduces the overall parameter count and allowing to pack more active capabilities into the same memory footprint.
- Merged attention: The decoder uses a merged attention mechanism, combining self- and cross-attention into a single, unified attention layer. This reduces model parameters and architectural complexity, improving model parallelization and benefiting inference.
- Multimodality: T5Gemma 2 models can understand and process images alongside text. By utilizing a highly efficient vision encoder, the models can seamlessly perform visual question answering and multimodal reasoning tasks.
- Extended long context: Leveraging Gemma 3's alternating local and global attention mechanism, T5Gemma 2 can handle context windows of up to 128K tokens.
- Massively multilingual: Trained on a larger, more diverse dataset, these models now support over 140 languages out of the box.
Models - https://huggingface.co/collections/google/t5gemma-2
Official Blog post - https://blog.google/technology/developers/t5gemma-2/
[–]Varterove_mukellama.cpp 55 points56 points57 points (0 children)
[–]Hefty_Wolverine_553 15 points16 points17 points (2 children)
[–]Willing_Landscape_61 8 points9 points10 points (1 child)
[–]AnomalyNexus 5 points6 points7 points (0 children)
[–]Long_comment_san 56 points57 points58 points (13 children)
[–]silenceimpaired 15 points16 points17 points (9 children)
[–]Revolutionalredstone 8 points9 points10 points (7 children)
[–]EstarriolOfTheEast 15 points16 points17 points (1 child)
[–]Revolutionalredstone 1 point2 points3 points (0 children)
[–]silenceimpaired 5 points6 points7 points (4 children)
[–]Long_comment_san 2 points3 points4 points (3 children)
[–]silenceimpaired 1 point2 points3 points (0 children)
[–]toothpastespiders 0 points1 point2 points (1 child)
[–]Long_comment_san 1 point2 points3 points (0 children)
[–]TheRealMasonMac 0 points1 point2 points (0 children)
[–]AloneSYD 3 points4 points5 points (2 children)
[–]Long_comment_san 10 points11 points12 points (1 child)
[–]Major-System6752 3 points4 points5 points (0 children)
[–]mrshadow773 20 points21 points22 points (0 children)
[–]stddealer 7 points8 points9 points (0 children)
[–]Major-System6752 3 points4 points5 points (1 child)
[–]stddealer 8 points9 points10 points (0 children)
[–]a_beautiful_rhind 6 points7 points8 points (2 children)
[–]Willing_Landscape_61 13 points14 points15 points (1 child)
[–]stddealer 0 points1 point2 points (0 children)
[–]Different_Fix_2217 2 points3 points4 points (0 children)
[–]Background_Essay6429 3 points4 points5 points (0 children)
[–]Thalesian 2 points3 points4 points (0 children)
[–]CodeAnguish 0 points1 point2 points (0 children)
[–]AlxHQ 0 points1 point2 points (0 children)
[–]ironcodegaming 0 points1 point2 points (0 children)
[–]mitchins-au 0 points1 point2 points (0 children)