Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>Multimodal by design: Gemma 3n natively supports image, audio, video, and text inputs and text outputs.

But I understood your point, Simon asked it to output SVG (text) instead of a raster image so it's more difficult.






It can handle image and audio inputs, but it cannot produce those as outputs - it's purely a text output model.

Yeah you're right. Also, you're Simon :)



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: