Comment by gabriel666smith
2 days ago
Thank you, I agree, I think it'd be helpful to publish aspects of it.
> Are you saying the training task is to ask for the (fib_i)th token rather than the next token?
Yes, functionally - I explained in more detail in another comment.
I'm not sure which is the key point (sort of what I'm trying to work out), but I'll get the model-generation code into the repo. Is that the best thing for you?
No comments yet
Contribute on Hacker News ↗