Comment by cedws

15 hours ago

>Multimodal by design: Gemma 3n natively supports image, audio, video, and text inputs and text outputs.

But I understood your point, Simon asked it to output SVG (text) instead of a raster image so it's more difficult.