Google DeepMind Launches EmbeddingGemma, an Open Model for On-Device Embeddings

Google DeepMind has introduced EmbeddingGemma, a 308M parameter open embedding model designed to run efficiently on-device. The model aims to make applications like retrieval-augmented generation (RAG), semantic search, and text classification accessible without the need for a server or internet connection. By Robert Krzaczyński

September 12, 2025

2025-09-11 19:00 GMT · 10 months ago www.infoq.com

Google DeepMind has introduced EmbeddingGemma, a 308M parameter open embedding model designed to run efficiently on-device. The model aims to make applications like retrieval-augmented generation (RAG), semantic search, and text classification accessible without the need for a server or internet connection. By Robert Krzaczyński

Original: https://www.infoq.com/news/2025/09/embedding-gemma/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=AI%2C+ML+%26+Data+Engineering