Comment by throwa356262

14 hours ago

What would you suggest instead?

A non-autoregressive transformer trained with a classification objective.