aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
Gemma 3n
  • Technology

Announcing Gemma 3n preview: powerful, efficient, mobile-first AI

  • aster.cloud
  • May 22, 2025
  • 4 minute read

Following the exciting launches of Gemma 3 and Gemma 3 QAT, our family of state-of-the-art open models capable of running on a single cloud or desktop accelerator, we’re pushing our vision for accessible AI even further. Gemma 3 delivered powerful capabilities for developers, and we’re now extending that vision to highly capable, real-time AI operating directly on the devices you use every day – your phones, tablets, and laptops.

To power the next generation of on-device AI and support a diverse range of applications, including advancing the capabilities of Gemini Nano, we engineered a new, cutting-edge architecture. This next-generation foundation was created in close collaboration with mobile hardware leaders like Qualcomm Technologies, MediaTek, and Samsung System LSI, and is optimized for lightning-fast, multimodal AI, enabling truly personal and private experiences directly on your device.


Partner with aster.cloud
for your next big idea.
Let us know here.



From our partners:

CITI.IO :: Business. Institutions. Society. Global Political Economy.
CYBERPOGO.COM :: For the Arts, Sciences, and Technology.
DADAHACKS.COM :: Parenting For The Rest Of Us.
ZEDISTA.COM :: Entertainment. Sports. Culture. Escape.
TAKUMAKU.COM :: For The Hearth And Home.
ASTER.CLOUD :: From The Cloud And Beyond.
LIWAIWAI.COM :: Intelligence, Inside and Outside.
GLOBALCLOUDPLATFORMS.COM :: For The World's Computing Needs.
FIREGULAMAN.COM :: For The Fire In The Belly Of The Coder.
ASTERCASTER.COM :: Supra Astra. Beyond The Stars.
BARTDAY.COM :: Prosperity For Everyone.

Gemma 3n is our first open model built on this groundbreaking, shared architecture, allowing developers to begin experimenting with this technology today in an early preview. The same advanced architecture also powers the next generation of Gemini Nano, which brings these capabilities to a broad range of features in Google apps and our on-device ecosystem, and will become available later this year. Gemma 3n enables you to start building on this foundation that will come to major platforms such as Android and Chrome.

This chart ranks AI models by Chatbot Arena Elo scores; higher scores (top numbers) indicate greater user preference. Gemma 3n ranks highly amongst both popular proprietary and open models.

Gemma 3n leverages a Google DeepMind innovation called Per-Layer Embeddings (PLE) that delivers a significant reduction in RAM usage. While the raw parameter count is 5B and 8B, this innovation allows you to run larger models on mobile devices or live-stream from the cloud, with a memory overhead comparable to a 2B and 4B model, meaning the models can operate with a dynamic memory footprint of just 2GB and 3GB. Learn more in our documentation.

By exploring Gemma 3n, developers can get an early preview of the open model’s core capabilities and mobile-first architectural innovations that will be available on Android and Chrome with Gemini Nano.

In this post, we’ll explore Gemma 3n’s new capabilities, our approach to responsible development, and how you can access the preview today.


Key Capabilities of Gemma 3n

Engineered for fast, low-footprint AI experiences running locally, Gemma 3n delivers:

  • Optimized On-Device Performance & Efficiency: Gemma 3n starts responding approximately 1.5x faster on mobile with significantly better quality (compared to Gemma 3 4B) and a reduced memory footprint achieved through innovations like Per Layer Embeddings, KVC sharing, and advanced activation quantization.
  • Many-in-1 Flexibility: A model with a 4B active memory footprint that natively includes a nested state-of-the-art 2B active memory footprint submodel (thanks to MatFormer training). This provides flexibility to dynamically trade off performance and quality on the fly without hosting separate models. We further introduce mix’n’match capability in Gemma 3n to dynamically create submodels from the 4B model that can optimally fit your specific use case — and associated quality/latency tradeoff. Stay tuned for more on this research in our upcoming technical report.
  • Privacy-First & Offline Ready: Local execution enables features that respect user privacy and function reliably, even without an internet connection.
  • Expanded Multimodal Understanding with Audio: Gemma 3n can understand and process audio, text, and images, and offers significantly enhanced video understanding. Its audio capabilities enable the model to perform high-quality Automatic Speech Recognition (transcription) and Translation (speech to translated text). Additionally, the model accepts interleaved inputs across modalities, enabling understanding of complex multimodal interactions. (Public implementation coming soon)
  • Improved Multilingual Capabilities: Improved multilingual performance, particularly in Japanese, German, Korean, Spanish, and French. Strong performance reflected on multilingual benchmarks such as 50.1% on WMT24++ (ChrF).

This chart show’s MMLU performance vs model size of Gemma 3n’s mix-n-match (pretrained) capability.

Unlocking New On-the-go Experiences

Gemma 3n will empower a new wave of intelligent, on-the-go applications by enabling developers to:

  1. Build live, interactive experiences that understand and respond to real-time visual and auditory cues from the user’s environment.


2. Power deeper understanding and contextual text generation using combined audio, image, video, and text inputs—all processed privately on-device.


3. Develop advanced audio-centric applications, including real-time speech transcription, translation, and rich voice-driven interactions.


Here’s an overview and the types of experiences you can build:

Building Responsibly, Together

Our commitment to responsible AI development is paramount. Gemma 3n, like all Gemma models, underwent rigorous safety evaluations, data governance, and fine-tuning alignment with our safety policies. We approach open models with careful risk assessment, continually refining our practices as the AI landscape evolves.


Get Started: Preview Gemma 3n Today

We’re excited to get Gemma 3n into your hands through a preview starting today:


Initial Access (Available Now):

  • Cloud-based Exploration with Google AI Studio: Try Gemma 3n directly in your browser on Google AI Studio – no setup needed. Explore its text input capabilities instantly.
  • On-Device Development with Google AI Edge: For developers looking to integrate Gemma 3n locally, Google AI Edge provides tools and libraries. You can get started with text and image understanding/generation capabilities today.


Gemma 3n marks the next step in democratizing access to cutting-edge, efficient AI. We’re incredibly excited to see what you’ll build as we make this technology progressively available, starting with today’s preview.

Explore this announcement and all Google I/O 2025 updates on io.google starting May 22.

Source: zedreviews.com

Read More  A Trust Service Startup Inside the Chip Company

For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

Our humans need coffee too! Your support is highly appreciated, thank you!

aster.cloud

Related Topics
  • AI
  • Artificial Intelligence
  • Gemma
  • Gemma 3
  • google
  • Google I/O
You May Also Like
View Post
  • Technology

Apple services deliver powerful features and intelligent updates to users this autumn

  • June 11, 2025
View Post
  • Computing
  • Multi-Cloud
  • Technology

By the numbers: Use AI to fill the IT skills gap

  • June 11, 2025
View Post
  • Computing
  • Multi-Cloud
  • Technology

Crayon targets mid-market gains with expanded Google Cloud partnership

  • June 10, 2025
Apple-WWDC25-Apple-Intelligence-hero-250609
View Post
  • Featured
  • Technology

Apple Intelligence gets even more powerful with new capabilities across Apple devices

  • June 9, 2025
Apple-WWDC25-Liquid-Glass-hero-250609_big.jpg.large_2x
View Post
  • Featured
  • Technology

Apple introduces a delightful and elegant new software design

  • June 9, 2025
Robot giving light bulb to businessman. Man sitting with laptop on money coins flat vector illustration. Finance, help of artificial intelligence concept for banner, website design or landing web page
View Post
  • Computing
  • Multi-Cloud
  • Technology

FinOps X 2025: IT cost management evolves for AI, cloud

  • June 9, 2025
View Post
  • Computing
  • Multi-Cloud
  • Technology

AI security and compliance concerns are driving a private cloud boom

  • June 9, 2025
View Post
  • Engineering
  • Technology

Apple supercharges its tools and technologies for developers to foster creativity, innovation, and design

  • June 9, 2025

Stay Connected!
LATEST
  • 1
    Apple services deliver powerful features and intelligent updates to users this autumn
    • June 11, 2025
  • By the numbers: Use AI to fill the IT skills gap
    • June 11, 2025
  • 3
    Crayon targets mid-market gains with expanded Google Cloud partnership
    • June 10, 2025
  • Apple-WWDC25-Apple-Intelligence-hero-250609 4
    Apple Intelligence gets even more powerful with new capabilities across Apple devices
    • June 9, 2025
  • Apple-WWDC25-Liquid-Glass-hero-250609_big.jpg.large_2x 5
    Apple introduces a delightful and elegant new software design
    • June 9, 2025
  • Robot giving light bulb to businessman. Man sitting with laptop on money coins flat vector illustration. Finance, help of artificial intelligence concept for banner, website design or landing web page 6
    FinOps X 2025: IT cost management evolves for AI, cloud
    • June 9, 2025
  • 7
    AI security and compliance concerns are driving a private cloud boom
    • June 9, 2025
  • 8
    Apple supercharges its tools and technologies for developers to foster creativity, innovation, and design
    • June 9, 2025
  • 9
    It’s time to stop debating whether AI is genuinely intelligent and focus on making it work for society
    • June 8, 2025
  • cookies-food-photographer-jennifer-pallian-OfdDiqx8Cz8-unsplash 10
    What is a cookie?
    • June 6, 2025
about
Hello World!

We are aster.cloud. We’re created by programmers for programmers.

Our site aims to provide guides, programming tips, reviews, and interesting materials for tech people and those who want to learn in general.

We would like to hear from you.

If you have any feedback, enquiries, or sponsorship request, kindly reach out to us at:

[email protected]
Most Popular
  • person-working-html-computer 1
    8 benefits of AI as a service
    • June 6, 2025
  • 2
    Cloud breaches are surging, but enterprises aren’t quick enough to react
    • June 6, 2025
  • 3
    Where is the cloud headed?
    • June 6, 2025
  • 4
    Enterprises are keen on cloud repatriation – but not for all workloads
    • June 4, 2025
  • 5
    The Summer Adventures : Hiking and Nature Walks Essentials
    • June 2, 2025
  • /
  • Technology
  • Tools
  • About
  • Contact Us

Input your search keywords and press Enter.