← Back to context

Comment by throwa356262

17 hours ago

What would you suggest instead?

A non-autoregressive transformer trained with a classification objective.