Comment by leumon
11 hours ago
My locally running nemotron-3-nano quantized to Q4_K_M gets this right. (although it used 20k thought tokens before answering the question)
11 hours ago
My locally running nemotron-3-nano quantized to Q4_K_M gets this right. (although it used 20k thought tokens before answering the question)
No comments yet
Contribute on Hacker News ↗