Comment by jwitthuhn
17 hours ago
They did upload the wrong model but as of the time of writing they have not fixed it. Right now, 12 hours after they took the old one down, there is simply no model present in their huggingface repo.
17 hours ago
They did upload the wrong model but as of the time of writing they have not fixed it. Right now, 12 hours after they took the old one down, there is simply no model present in their huggingface repo.
I guess they will upload it later, it seems like an honest mistake to me.
Anyways SwiTransformer paper looks interesting and doing a post training to optimize for it looks interesting as well.