Comment by alfiedotwtf

1 day ago

Wasn’t it already obvious given the awfully familiar parameter numbers?

That only tells what base architecture they used, but fine tuning does not increase the number of weights, it just adapts the weights to improve better on a fine tuning dataset- something they claimed they had done