Gemma is a family of generative artificial intelligence (ai) models and you can use them in a wide variety of generation tasks, including question answering, summarization, and reasoning. It is based on similar technologies as gemini The first version was released in february 2024, followed by gemma 2 in june 2024 and gemma 3 in march 2025. This repository contains the implementation of the gemma pypi package. Deploy gemma 3, google's open model, to production on google cloud Serverless cloud run for simplicity or gke for robust orchestration.
Gemma models are available in two sizes This reduced parameter operation can be achieved using the flexible parameter technology built into gemma 3n models to help them run efficiently on lower resource devices The parameters in gemma 3n models are divided into 4 main groups Gemma 3 supports over 140 languages and offers advanced text and visual reasoning capabilities. This page documents releases for the gemma family of models Release of vaultgemma in 1b parameter size
Release of gemma 3 in 270m size Release of t5gemma across different parameter sizes Release of medgemma 27b parameter multimodal model Release of gemma 3n in e2b and e4b sizes.
WATCH