← Back to context

Comment by baldai

12 hours ago

They are not even close in capabilities. Only nenchmark I ever seen that captures their difference is DeepSWE. They are worse by factor of 3.