← Back to context

Comment by jwitthuhn

17 hours ago

They did upload the wrong model but as of the time of writing they have not fixed it. Right now, 12 hours after they took the old one down, there is simply no model present in their huggingface repo.

I guess they will upload it later, it seems like an honest mistake to me.

Anyways SwiTransformer paper looks interesting and doing a post training to optimize for it looks interesting as well.