← Back to context Comment by electroglyph 1 day ago any divergence (even if the benchmark is better) from full precision is error 1 comment electroglyph Reply 7e 15 hours ago Just pretend that it is the next step update when training. You didn’t train your model to step=inf, I hope?
7e 15 hours ago Just pretend that it is the next step update when training. You didn’t train your model to step=inf, I hope?
Just pretend that it is the next step update when training. You didn’t train your model to step=inf, I hope?