Hacker News new | past | comments | ask | show | jobs | submit login

They say all the models were distilled from a teacher model but they didn't specify what that teacher model is. Interesting thing to hide.





It's a safe bet that it's either one of the Gemini models or a relative of it.

That's what I thought. And it could be pulicity of Gemini as well that it is so good that it can teach students say 5x faster. If it is Gemini, there isn't any reason to hide. My bet is it is some unreleased Gemma or some model.



Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: