← Back to context Comment by naasking 5 days ago I was thining of something like LLaDa that uses a Transformer to predict forward masked tokens:https://arxiv.org/abs/2502.09992 0 comments naasking Reply No comments yet Contribute on Hacker News ↗
No comments yet
Contribute on Hacker News ↗