
Google releases Gemma 4 12B, a single multimodal model for text and images
Hacker News·6d·Google
Google's new Gemma 4 12B combines text and image understanding in one model without separate encoders, aimed at developers building on-device or cost-constrained AI applications. For indie makers, this means easier deployment of multimodal features without juggling multiple model architectures or managing complex pipelines.
Original story
Read the original on Hacker NewsRelated stories
⬢ HYVE SPOTLIGHT
The Owens AI Institute is giving K-12 AI education away free, foreverHyve Spotlight·2w·HyveCares