Comment by jabroni_salad
1 day ago
Gemma is open source and apache 2.0 licensed. If you want to include it with an app you have to package it yourself.
gemini nano is an android api that you dont control at all.
1 day ago
Gemma is open source and apache 2.0 licensed. If you want to include it with an app you have to package it yourself.
gemini nano is an android api that you dont control at all.
> Gemma is open source and apache 2.0 licensed
Closed source but open weight. Let’s not ruin the definition of the term in advantage of big companies.
Your reply adds more confusion, imo.
The inference code and model architecture IS open source[0] and there are many other high quality open source implementations of the model (in many cases contributed by Google engineers[1]). To your point: they do not publish the data used to train the model so you can't re-create it from scratch.
[0] https://github.com/google-deepmind/gemma [1] https://github.com/vllm-project/vllm/pull/2964
If for some reason you had the training data, is it even possible to create an exact (possibly same hash?) copy of the model? Seems like there are a lot of other pieces missing like the training harness, hardware it was trained on, etc?
2 replies →
I am not sure if this adds even more confusion. Linked library is about fine-tuning which is completely different process.
Their publications about producing Gemma is not accurate enough that even with data you would get the same results.
2 replies →
Yes!! But I doubt how many are truly truly open source models since most just confuse open source with open weights and the definition has been changed really smh.
> Gemma is open source and apache 2.0 licensed.
Are you sure? On a quick look, it appears to use its own bespoke license, not the Apache 2.0 license. And that license appears to have field of use restrictions, which means it would not be classified as an open source license according to the common definitions (OSI, DFSG, FSF).
Wait, what files are you reading? https://github.com/google-deepmind/gemma/blob/main/LICENSE
(Even then, releasing some source code under Apache-2 does not make a model "open source".)
Ah I found https://ai.google.dev/gemma/terms
https://ai.google.dev/gemma/prohibited_use_policy
Yeah, definitely not open source, even if they had released all the training data.
Perhaps we could rephrase my statement to "there are a bunch of green checkmarks on github that may or may not mean anything depending on who you ask."